Google Reveals Dev-Focused Gemini 3.1 Flash Lite, Promises 'Best-in-Class Intelligence for Your Highest-Volume Workloads'
Gemini 3.1 Flash-Lite offers adjustable thinking levels and a 1-million token context window, enabling fast, cost-efficient processing for high-throughput developer tasks.
- On Tuesday, Google launched Gemini 3.1 Flash-Lite, two weeks after Gemini 3.1 Pro, making it available in preview via the Gemini API in Google AI Studio and Vertex AI.
- Designed for high-volume developer workloads, Flash-Lite serves low-latency tasks like email summarizing and data extraction with adjustable 'Thinking' levels for accuracy trade-offs.
- Benchmarking shows Gemini 3.1 Flash-Lite can produce up to 363 tokens per second and supports a 1-million token context window for multimodal vision and video processing.
- Google set pricing at $0.25 per 1 million input tokens and $1.50 per 1 million output tokens, making Flash-Lite 12x–16x cheaper for high-context workloads.
- In a departure from prior launches, Google surprised some observers by releasing a Flash-Lite variant first instead of a more capable flagship, and despite being three tokens slower than Gemini 2.5 Flash-Lite, Flash-Lite still outpaces competitors as Google withheld agent benchmarks.
21 Articles
21 Articles
Google releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro
Google's newest AI model is here: Gemini 3.1 Flash-Lite, and the biggest improvements this time around come in cost and speed, especially for enterprises and developers seeking to leverage powerful reasoning and multimodal capabilities from the U.S. search and cloud giant.Positioning it as the most cost-efficient and responsive model in the Gemini 3 series, Google is offering a solution built specifically for intelligence at scale. This launch a…
Google launches Gemini 3.1 Flash-Lite, its fastest Gemini 3 model yet
Two weeks after launching Gemini 3.1 Pro, its most capable AI model yet, Google on Tuesday launched Gemini 3.1 Flash-Lite, the fastest model in the Gemini 3 family so far. At $0.25/$1.50 per million input/output tokens, it’s also Google’s most affordable Gemini 3 model yet. The model is meant for what Google describes as “high-volume developer workloads at scale” and is now available in preview in the Gemini API in Google AI Studio and Vertex AI…
Google announced a new artificial intelligence model in the Gemini 3 series • The model will be the cheapest yet and will provide an initial answer 2.5 times faster than existing models • Everything we know so far
GNT is the French Hi-Tech portal dedicated to new technologies (internet, software, hardware, mobility, company) and video game PC and consoles.
Coverage Details
Bias Distribution
- 100% of the sources are Center
Factuality
To view factuality data please Upgrade to Premium










