Training an LLMs with the hundreds-of-mini-cupcakes approach
Jonathan Frankle
Chief Scientist at MosaicML (acquired by Databricks)
Get this Episode:
Season 2 Episode 2
MosaicML was just acquired for $1.3 Billion. Mosaic Chief Scientist, Jonathan Frankle joins Robb and Josh for a conversation packed with practical thinking and perspective on the incredible work being done at Mosaic. They pick Jonathan’s brain and exchange ideas in a fun and useful conversation for all.
About the guest
Jonathan Frankle serves as the Chief Neural Network Scientist at Databricks, leading the Mosaic Research lab comprised of over 30 research scientists. With a focus on enhancing the efficiency of training modern generative AI models like LLMs and diffusion models, his team conducts empirical studies on neural network learning.
Read more about the Guest