Google Gemini’s Ai Image Model Gets A ‘bananas’ Upgrade

1 week ago

Google is upgrading its Gemini chatbot pinch a caller AI image exemplary that gives users finer power complete editing photos, a measurement meant to drawback up pinch OpenAI’s celebrated image tools and tie users from ChatGPT.

The update, called Gemini 2.5 Flash Image, rolls retired starting Tuesday to each users successful nan Gemini app, arsenic good arsenic to developers via nan Gemini API, Google AI Studio, and Vertex AI platforms.

Gemini’s caller AI image exemplary is designed to make much precise edits to images — based connected earthy connection requests from users — while preserving nan consistency of faces, animals, and different details, thing that astir rival devices struggle with. For instance, inquire ChatGPT aliases xAI’s Grok to alteration nan colour of someone’s garment successful a photo, and nan consequence mightiness see a distorted look aliases an altered background.

an animated GIF showing 2 pictures, 1 of an jock and nan different of a dog, successful a caller mixed photograph of nan jock cuddling nan dog.

Google’s caller instrumentality has already drawn attention. In caller weeks, societal media users raved complete an awesome AI image editor successful nan crowdsourced information platform, LMArena. The exemplary appeared to users anonymously nether nan pseudonym “nano-banana.”

Google says it’s down nan exemplary (if it wasn’t obvious already from each nan banana-related hints), which is really nan autochthonal image capacity wrong its flagship Gemini 2.5 Flash AI model. Google says nan image exemplary is state-of-the-art connected LMArena and different benchmarks.

a chart showing imaging editing benchmarks, pinch Gemini 2.5 Flash Image / LMArena performing amended than different rival models.

“We’re really pushing ocular value forward, arsenic good arsenic nan model’s expertise to travel instructions,” said Nicole Brichtova, a merchandise lead connected ocular procreation models astatine Google DeepMind, successful an question and reply pinch TechCrunch.

“This update does a overmuch amended occupation making edits much seamlessly, and nan models outputs are usable for immoderate you want to usage them for,” said Brichtova.

AI image models person go a captious conflict crushed for Big Tech. When OpenAI launched GPT-4o’s autochthonal image generator successful March, it drove ChatGPT’s usage done nan tile acknowledgment to a frenzy of AI-generated Studio Ghibli memes that, according to OpenAI CEO Sam Altman, near nan company’s GPUs “melting.”

To support up pinch OpenAI and Google, Meta announced past week that it would license AI image models from nan startup Midjourney. Meanwhile, nan a16z-backed German unicorn Black Forest Labs continues to predominate benchmarks pinch its FLUX AI image models.

Perhaps Gemini’s awesome AI image editor tin thief Google adjacent its personification spread pinch OpenAI. ChatGPT now logs much than 700 million play users. On Google’s net telephone successful July, nan tech giant’s CEO Sundar Pichai revealed that Gemini had 450 million monthly users — implying play users are moreover lower.

Brichtova says Google specifically designed nan image exemplary pinch user usage cases in-mind, specified arsenic helping users visualize their location and plot projects. The exemplary besides has amended “world knowledge” and tin harvester aggregate references successful a azygous prompt; for example, merging an image of a sofa, a surviving room photo, and a colour palette into 1 cohesive render.

an animated GIF showing an image of an quiet surviving room, pinch prompts displayed connected surface specified arsenic "add paint" — and nan room overgarment changes color. "Add sofa," and a sofa is added. The demo shows nan AI prompts changing nan image successful real-time.

While Gemini’s caller AI image generator makes it easier for users to make and edit realistic images, nan institution has safeguards that limit what users tin create. Google has struggled pinch AI image generator safeguards successful nan past. At 1 point, nan institution apologized for Gemini generating historically inaccurate image of people, and rolled back nan AI image generator altogether.

Now, Google feels that it’s struck a amended balance.

“We want to springiness users imaginative power truthful that they tin get from nan models what they want,” said Brichtova. “But it’s not for illustration thing goes.”

The generative AI conception of Google’s position of work prohibits users from generating “non-consensual friendly imagery.” Those aforesaid kinds of safeguards don’t look to beryllium for Grok, which allowed users to create AI-generated explicit images resembling celebrities, specified arsenic Taylor Swift.

To reside nan emergence of deepfake imagery, which tin make it difficult for users to discern what’s existent online, Brichtova says that Google applies ocular watermarks to AI-generated images, arsenic good arsenic identifiers successful its metadata. However, personification scrolling past an image connected societal media whitethorn not look for specified identifiers.

English (US) ·

Indonesian (ID) ·

· · ·

↑

Google Gemini’s Ai Image Model Gets A ‘bananas’ Upgrade

Related Article

Apple Is Playing A Dangerous Game With The Iphone 17 - And It Might Just Pay Off

The 9 Coolest Gadgets I Saw At Ifa Berlin 2025 (including Picks You Can Actually Buy)

Signal Lets You Back Up Your Chats For Free Now - Plus Its First-ever Paid Feature

Popular Article

The Best Wireless Headphones For 2025: Bluetooth Options For Every Budget

New Travel Turmoil As American Airlines, United, Jetblue, And Avelo Slashing Flights And Routes – What You Need To Know

American, Delta, Southwest And Alaska Connecting Chicago, Philadelphia, Raleigh-durham, San Diego, Santa Maria, Sun Valley With New Winter Airline Rou...

Thousands Of Air Canada Flights At Risk As Potential Strike Threat Set To Disrupt Global Travel

Google Is Experimenting With Machine-learning Powered Age Estimation Tech In The U.s.