I tested Google's "nano banana" AI image generator after massive upgrade


Google’s artificial intelligence (AI) assistant has undergone a massive upgrade, introducing Gemini 2.5 Flash Image, otherwise known as “nano-banana.”

Google has revealed its “state-of-the-art” image generation and editing model within its chatbot Gemini.

The upgrade allows users to create consistent characters, combine images to create a fully realized scene, and edit using natural language.

ADVERTISEMENT

The tech giant launched Gemini 2.0 in early 2025, which was apparently well-received by the public. Yet users wanted tweaks like “higher-quality images” and “more powerful creative control,” Google said.

Gemini 2.5 Flash Image can maintain character consistency across multiple prompts, meaning that users don’t lose physical features that help us distinguish different characters.

Niamh Ancell BW jurgita Ernestas Naprys vilius
Don't miss our latest stories on Google News

One X user was given early access to Google’s Gemini 2.5 model, and they were “impressed by how it maintained consistency of character as well as small details.”

Only one image was fed into the “nano banana,” and the user prompted the AI model to place the man in different scenarios.

The user then used Veo3, Google’s video generation model, to fix the flow and motion within the scenes.

ADVERTISEMENT

Character consistency in Gemini 2.5 goes beyond displaying people. Users can also show objects from various angles and environments.

Alongside character consistency, the model is also good at sticking to templates, which is useful for businesses when adhering to brand guidelines.

Google has said it’s already witnessed developers use the AI model to create business cards, employee badges, and product mock-ups.

google-flash-image-cards
AI images generated by Gemini 2.5/Google

When editing images, Google’s AI now uses natural language to remove or add elements to an image.

Using a single text prompt, the model can blur backgrounds, remove stains on a t-shirt, or remove a person from an image entirely.

Google has supposedly enabled its newest model to “unlock new use cases” such as becoming a tutor by drawing on “Gemini’s world knowledge.”

This version of Gemini can now merge images to create one succinct scene. Users can see what a new lamp might look like on their bedside table or see what a new shirt might look like.

People have already started playing around with Gemini 2.5 Flash Image, with one person even asking to see what they’d look like with bangs.

ADVERTISEMENT

Cybernews asked “ nano banana” to create an image of a cat wearing a sombrero, which it did in just over five seconds.

cat-wearing-hat
AI image by Google Gemini 2.5 Flash Image

We wanted to test out the character consistency element of the upgrade, and asked Gemini to generate the same image but instead, have the cat holding maracas.

The result left much to be desired, as, typical of AI image generation, the model created the same cat just with six legs instead of two.

cat-wearing-hat-with-maracas
AI image by Google Gemini 2.5 Flash Image

However, the model is only available in preview via the Gemini API and Google AI Studio and “will be stable in the coming weeks,” Google said.

Gemini 2.5 Flash Image is priced at $30 per one million tokens, and each image is roughly 1290 tokens, which Google estimates is around 40 cents per image.

ADVERTISEMENT