Experiment with the generation of native image Gemini 2.0 Flash

Share

IN December First, we introduced a native image output in Gemini 2.0 Flash into trusted testers. Today we make it available for a programmer experiments all regions Currently served by Google AI Studio. You can test this fresh ability using the experimental version of Gemini 2.0 flash (Gemini-2.0-Flash-Exp) In Google AI Studio and via API Gemini.

Gemini 2.0 Flash combines multimodal input data, improved reasoning and understanding of the natural language to create images.

Here are some examples in which the 2.0 flash multimodal outputs shine:

1. Text and images Together

Apply Flash Gemini 2.0 to tell the story and illustrates it with photos, maintaining the consistency of characters and settings. Give him an opinion, and the model in addition to history or changes the style of its drawings.

Sorry, your browser does not support the playback of this movie

Story and generating illustrations in Google AI Studio

2. Edition of image conversation

Gemini 2.0 Flash helps to edit images by many corners of dialogue in a natural language, ideal for iteration towards an ideal image or jointly discovering various ideas.

Sorry, your browser does not support the playback of this movie

Multi-Turn Conversion Editing Image Maintenance of context during a conversation at Google AI Studio

3. World understanding

Unlike many other models of generating images, Gemini 2.0 Flash uses world knowledge and improved reasoning to create Normal picture. This makes it ideal for creating detailed photos that are realistic – like an illustrating recipe. Although he strives for accuracy, like all language models, his knowledge is wide and general, not absolute or complete.

Sorry, your browser does not support the playback of this movie

Interspersed for the initial data of the text and image for the recipe at Google AI Studio

4. Text rendering

Most models of generating images try to thoroughly draw long text sequences, often causing poorly formatted or illegible characters or incorrect. Internal references show that 2.0 Flash has stronger rendering compared to leading competitive models and perfect for creating ads, social posts and even invitations.

Sorry, your browser does not support the playback of this movie

Image outputs with long text rendering in Google AI Studio

Start taking photos from Gemini today

Start from Gemini 2.0 Flash via API Gemini. Read more about generating images in ours documents.

from google import genai
from google.genai import types

client = genai.Client(api_key="GEMINI_API_KEY")

response = client.models.generate_content(
    model="gemini-2.0-flash-exp",
    contents=(
        "Generate a story about a cute baby turtle in a 3d digital art style. "
        "For each scene, generate an image."
    ),
    config=types.GenerateContentConfig(
        response_modalities=["Text", "Image"]
    ),
)

Regardless of whether you are building AI agents, you develop applications with stunning visualizations such as Illustrated Interactive Stories, or brainstorming in conversation, Gemini 2.0 Flash allows you to add text and image generation with one model. We are elated to see what developers create with a native image output and yours feedback It will soon support us finalize the ready for production version.

The AI Sckool

Categories

Experiment with the generation of native image Gemini 2.0 Flash

1. Text and images Together

2. Edition of image conversation

3. World understanding

4. Text rendering

Start taking photos from Gemini today

Van allows users to have a piece of AI models trained on their data

What anime memes AI tell us about the future of art and humanity

Opeli says that CHATGPT users have generated over 700 m photos from last week

Google notebook can now find its own sources

Assessment of potential threats of advanced cyber security AI

More News

Assessment of potential threats of advanced cyber security AI

Following a responsible road to Aga

Gemini 2.5: Our most knowledgeable AI model

We present Gemma 3: The most talented model that you can start on one GPU or TPU

Van allows users to have a piece of AI models trained on their data

What anime memes AI tell us about the future of art and humanity

Opeli says that CHATGPT users have generated over 700 m photos from last week