Gemini 2.5 Flash-Lite

Google gemini-2-5-flash-lite

Model Information
Slug gemini-2-5-flash-lite
Release Date June 17, 2025
Aliases gemini-2.5-flash-lite gemini-2-5-flash-lite
Organization
Name Google
Description

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the [Reasoning API parameter](https://openrouter.ai/docs/use-cases/reasoning-tokens) to selectively trade off cost for intelligence. Context: 1048576

Available at 5 Providers
Provider Input Price ($/1M) Output Price ($/1M) Free
Poe $0.07 $0.28
Helicone $0.10 $0.40
ZenMux $0.10 $0.40
OpenRouter $0.10 $0.40
AIHubMix $0.10 $0.40