Deepmind Reveals Genie 3, A World Model That Could Be The Key To Reaching Agi

Trending 2 hours ago

Google DeepMind has revealed Genie 3, its latest instauration world exemplary that nan AI laboratory says presents a important stepping chromatic connected nan way to artificial wide intelligence, aliases human-like intelligence. 

“Genie 3 is nan first real-time interactive wide intent world model,” Shlomi Fruchter, a investigation head astatine DeepMind, said during a property briefing. “It goes beyond constrictive world models that existed before. It’s not circumstantial to immoderate peculiar environment. It tin make some photo-realistic and imaginary worlds, and everything successful between.”

Genie 3, which is still successful investigation preview and not publically available, builds connected some its predecessor Genie 2 – which tin make caller environments for agents – and DeepMind’s latest video procreation exemplary Veo 3 – which exhibits a heavy knowing of physics. 

Image Credits:Google DeepMind

With a elemental matter prompt, Genie 3 tin make aggregate minutes – up from 10 to 20 seconds successful Genie 2 – of diverse, interactive, 3D environments astatine 24 frames per 2nd pinch a solution of 720p. The exemplary besides features “promptable world events,” aliases nan expertise to usage a punctual to alteration nan generated world.

Perhaps astir importantly, Genie 3’s simulations enactment physically accordant complete clip because nan exemplary is capable to retrieve what it had antecedently generated – an emergent capacity that DeepMind researchers didn’t explicitly programme into nan model. 

Fruchter said that while Genie 3 intelligibly has implications for acquisition experiences and caller generative media for illustration gaming aliases prototyping imaginative concepts, its existent unlock will manifest successful training agents for wide intent tasks, which he said is basal to reaching AGI. 

“We deliberation world models are cardinal connected nan way to AGI, specifically for embodied agents, wherever simulating existent world scenarios is peculiarly challenging,”Jack Parker-Holder, a investigation intelligence connected DeepMind’s open-endedness team, said during a briefing.

Techcrunch event

San Francisco | October 27-29, 2025

Image Credits:Google DeepMind

Genie 3 is designed to lick that bottleneck. Like Veo, it doesn’t trust connected a hard-coded physics engine. Instead, it teaches itself really nan world useful – really objects move, fall, and interact – by remembering what it has generated and reasoning complete agelong clip horizons. 

“The exemplary is auto-regressive, meaning it generates 1 framework astatine a time,” Fruchter told TechCrunch successful a abstracted interview. “It has to look backmost astatine what was generated earlier to determine what’s going to hap next. That’s a cardinal portion of nan architecture.”

That representation creates consistency successful its simulated worlds, and that consistency allows it to create a benignant of intuitive grasp of physics, akin to really humans understand that a solid teetering connected nan separator of a array is astir to fall, aliases that they should duck to debar a falling object.

This expertise to simulate coherent, physically plausible environments complete clip makes Genie 3 overmuch much than a generative model. It becomes an perfect training crushed for general-purpose agents. Not only tin it make endless, divers worlds to explore, but it besides has nan imaginable to push agents to their limits – forcing them to adapt, struggle, and study from their ain acquisition successful a measurement that mirrors really humans study successful nan existent world. 

Image Credits:Google DeepMind

Currently, nan scope of actions an supplier tin return is still limited. For example, nan promptable world events let for a wide scope of biology interventions, but they’re not needfully performed by nan supplier itself. Similarly, it’s still difficult to accurately exemplary analyzable interactions betwixt aggregate independent agents successful a shared environment. Genie 3 tin besides only support a fewer minutes of continuous interaction, erstwhile hours would beryllium basal for due training. 

Still, Genie 3 presents a compelling measurement guardant successful school agents to spell beyond reacting to inputs truthful they tin plan, explore, activity retired uncertainty, and amended done proceedings and correction – nan benignant of self-driven, embodied learning that’s cardinal successful moving towards wide intelligence. 

“We haven’t really had a Move 37 infinitesimal for embodied agents yet, wherever they tin really return caller actions successful nan existent world,” Parker-Holder said, referring to nan legendary infinitesimal successful nan 2016 crippled of Go betwixt DeepMind’s AI supplier AlphaGo and world champion Lee Sedol, successful which Alpha Go played an unconventional and superb move that became symbolic of AI’s expertise to observe caller strategies beyond quality understanding. 

“But now, we tin perchance usher successful a caller era,” he said. 

Rebecca Bellan is simply a elder newsman astatine TechCrunch, wherever she covers Tesla and Elon Musk’s broader empire, autonomy, AI, electrification, gig activity platforms, Big Tech regulatory scrutiny, and more. She’s 1 of nan co-hosts of nan Equity podcast and writes nan TechCrunch Daily greeting newsletter. Previously, she covered societal media for Forbes.com, and her activity has appeared successful Bloomberg CityLab, The Atlantic, The Daily Beast, Mother Jones, i-D (Vice) and more. Rebecca has invested successful Ethereum.

More