This is basic/gemini-2.5-flash-image-preview, the same image quality at a lower price.
How to use?
- Open this url: https://aiview.gongxiangai.top
- Set up your API to get started.
Introducing Gemini 2.5 Flash Image
On August 26, 2025, the introduction of Gemini 2.5 Flash Image (aka nano-banana) was announced, a state-of-the-art model designed for image generation and editing. This model introduces several key capabilities, including the ability to blend multiple images into a single image, maintain character consistency for storytelling, perform targeted edits using natural language, and leverage Gemini’s world knowledge for both generation and editing tasks.
This release builds upon the native image generation feature first launched in Gemini 2.0 Flash. While users appreciated the low latency, cost-effectiveness, and ease of use of the previous version, feedback indicated a need for higher-quality images and more powerful creative control, which this new model aims to address.
Availability and Pricing
Gemini 2.5 Flash Image is available immediately for developers via the Gemini API and Google AI Studio, and for enterprise use through Vertex AI. The pricing is set at $30.00 per 1 million output tokens. Since each generated image corresponds to 1290 output tokens, the cost per image is approximately $0.039. Pricing for all other input and output modalities aligns with the standard Gemini 2.5 Flash pricing structure.
Developer Experience in Google AI Studio
To facilitate building with the new model, significant updates have been made to the “build mode” in Google AI Studio. Developers can quickly test the model’s capabilities with custom AI-powered apps, remix existing templates, or bring new ideas to life with a single prompt. Once an application is ready, it can be deployed directly from Google AI Studio or its code can be saved to GitHub.
Key Model Capabilities
Maintain Character Consistency A common challenge in image generation is preserving a character’s appearance across multiple images. Gemini 2.5 Flash Image addresses this by allowing users to place the same character in different environments, showcase a product from various angles in new settings, or generate consistent brand assets. The model is also proficient at adhering to visual templates, making it useful for creating items like real estate listing cards, uniform employee badges, or dynamic product mockups from a single design.
Prompt-based Image Editing The model enables precise, localized edits through natural language instructions. Users can perform targeted transformations with simple prompts, such as blurring the background of an image, removing a stain from a t-shirt, deleting a person from a photo, altering a subject’s pose, or adding color to a black-and-white picture.
Native World Knowledge Unlike traditional image generation models that often focus solely on aesthetics, Gemini 2.5 Flash Image benefits from Gemini’s deep, semantic understanding of the real world. This integration of world knowledge unlocks new use cases. For example, it can power an interactive educational tool that reads and understands hand-drawn diagrams, assists with real-world questions, and follows complex editing instructions in a single step.
Multi-image Fusion Gemini 2.5 Flash Image has the ability to understand and merge multiple input images. This allows users to seamlessly place an object into a new scene, restyle a room with a different color scheme or texture, or fuse multiple images together using a single prompt to create a new, photorealistic image.