Google launches its faster AI model Gemini 3 Flash

Alphabet’s (GOOG) (GOOGL) Google has globally launched its latest AI model called Gemini 3 Flash, making it the default model in the Gemini app, replacing 2.5 Flash.

Last month, the company launched Gemini 3 with Gemini 3 Pro and Gemini 3 Deep Think mode.

“With Gemini 3, we introduced frontier performance across complex reasoning, multimodal and vision understanding and agentic and vibe coding tasks. Gemini 3 Flash retains this foundation, combining Gemini 3’s Pro-grade reasoning with Flash-level latency, efficiency and cost. It not only enables everyday tasks with improved reasoning, but also is our most impressive model for agentic workflows,” said Tulsee Doshi, senior director, Product Management, Google DeepMind, in a blog post on Tuesday.

Gemini 3 Flash is rolling out — for developers in the Gemini API in Google AI Studio, Gemini CLI and our new agentic development platform Google Antigravity; for everyone via the Gemini app and in AI Mode in Search; and for enterprises in Vertex AI and Gemini Enterprise.

Google said Gemini 3 Flash delivers frontier performance on PhD-level reasoning and knowledge benchmarks like GPQA Diamond (90.4%) and Humanity’s Last Exam (33.7% without tools), rivaling larger frontier models, and significantly outperforming even the best 2.5 model, Gemini 2.5 Pro, across a number of benchmarks.

The new model also reaches state-of-the-art performance with a score of 81.2% on MMMU Pro, comparable to Gemini 3 Pro, according to the company.

Google noted that Gemini 3 Flash is able to modulate how much it thinks. It may think longer for more complex use cases, but it also uses 30% fewer tokens on average than 2.5 Pro, as measured on typical traffic, to accurately complete everyday tasks with higher performance.

Gemini 3 Flash outperforms 2.5 Pro while being three times faster (based on Artificial Analysis benchmarking) at a fraction of the cost. Gemini 3 Flash is priced at $0.50/1M input tokens and $3/1M output tokens (audio input remains at $1/1M input tokens), the company added.

On SWE-bench Verified, a benchmark for evaluating coding agent capabilities, Gemini 3 Flash achieves a score of 78%, outperforming the 2.5 series and Gemini 3 Pro.

“Because of Gemini 3 Flash’s incredible multimodal reasoning capabilities, you can use it to help you see, hear and understand any type of information faster. For example, you can ask Gemini to understand your videos and images and turn that content into a helpful and actionable plan in just a few seconds,” said Doshi.

Last week, Google’s AI rival OpenAI (OPENAI) launched GPT-5.2, its latest flagship large language model, just one month following the release of GPT-5.1.

Leave a Reply Cancel reply