Nuclear Fusion, No Power Lines
Jonathan Frankle
Chief AI Scientist at Databricks
Get this Episode:
Season 7 Episode 11
155.6k views
Most organizations treat a bigger context window like a cheat code: dump every document in, skip the data work, ship. Jonathan Frankle, Chief AI Scientist at Databricks, says that’s still wrong.
This is Jonathan’s return visit to Invisible Machines — a conversation recorded last summer, released ahead of Databricks Data + AI Summit. His first appearance (Season 2) was the MosaicML-era craft conversation: lottery tickets, mixology, mini-cupcakes. This one is the enterprise engineering thread: be a scientist, curate before you scale, and treat specification (what you actually want the system to do) as the bottleneck between raw model power and useful AI.
Robb and Josh press him on the myths that still seduce enterprise teams: million-token windows as a substitute for real data work, hyperscaler résumés as a proxy for talent, and the fantasy that unlocking every PDF in the org automatically makes knowledge useful. Jonathan’s answer is consistent: measure success, test your use case, climb the ladder of techniques, and accept that multimodal is where long context actually earns its keep, not as a universal bypass for curation.
Along the way: the nuclear fusion vs. power lines metaphor; why building a benchmark is a cop-out compared to describing intent; prompts as parameters; chat-only UIs vs. a generation that never wanted buttons; LLM-oriented publishing and static FAQ pages; unlocking PDF at scale when curation gets skipped; early-adopter mistakes we’ll laugh at in ten years; and why separating knowledge from reasoning is the north star, even if we aren’t there yet.
———- Support our show by supporting our sponsors!
This episode is supported by OneReach.ai
Forged over a decade of R&D and proven in 10,000+ deployments, OneReach.ai’s GSX is the first complete AI agent runtime environment (circa 2019) — a hardened AI agent architecture for enterprise control and scale.
Backed by UC Berkeley, recognized by Gartner, and trusted across highly regulated industries, including healthcare, finance, government and telecommunications.
A complete system for accelerating AI adoption — design, train, test, deploy, monitor, and orchestrate neurosymbolic applications (agents).
- Use any AI models
- Build and deploy intelligent agents fast
- Create guardrails for organizational alignment
- Enterprise-grade security and governance
About the guest
Jonathan Frankle serves as Chief AI Scientist at Databricks, where he leads research projects on reinforcement learning, model training, and agent evaluation. He heads Mosaic Research—a lab of more than thirty research scientists focused on enhancing the efficiency of training modern generative AI models, including LLMs and diffusion models, through empirical studies on neural network learning.
Read more about the Guest