ArgmaxAuthor: Vahe Hagopian, Taka Hasegawa, Farrukh Rahman
A show where three machine learning enthusiasts talk about recent papers and developments in machine learning. Watch our video on YouTube https://www.youtube.com/@argmaxfm Language: en-us Genres: Mathematics, Science Contact email: Get it Feed URL: Get it iTunes ID: Get it |
Listen Now...
Mixture of Experts
Tuesday, 8 October, 2024
In this episode we talk about the paper "Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean.