
Gemini 2.5 Flash is a hybrid reasoning model that allows developers to turn thinking on or off and set thinking budgets for quality, cost, and latency. It's available in preview via the Gemini API in Google AI Studio and Vertex AI, offering comparable metrics to leading models at a fraction of the cost and size.