Skip to content
Google

Gemini 3 Flash

flagship
Google · released 2025-10-01 · text
currently routing · 4.2k rpm
1000M tokens
Context
— / 1M
Input
— / 1M
Output
— t/s
Speed
proprietary
License
/ ABOUT

Google Gemini 3 Flash is the latest generation of Google's fast and efficient multimodal AI model. Designed for high-throughput applications, it delivers strong performance across text generation, reasoning, coding, and visual understanding tasks with significantly reduced latency and cost compared to the Pro variant.

Gemini 3 Flash supports a large context window and can process text, images, audio, and video inputs. It excels at tasks requiring quick responses like chat, real-time assistance, content generation, and code completion while maintaining quality close to the larger Pro models.

This model is recommended for production applications where speed and cost-efficiency are priorities, serving as the workhorse of the Gemini 3 family for most common AI tasks.

BENCHMARKS Artificial Analysis Index
Intelligence 46

Providers for Gemini 3 Flash

2 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
bf16
0.00%
bf16
0.00%