Skip to Content

Gemini 2.5 Flash and Pro Pricing and Availability Changes

Premium Models Move to General Availability

Get All The Latest Research & News!

Thanks for registering!

Exciting news for the AI development community! Google Cloud has announced that Gemini 2.5 Flash and Gemini 2.5 Pro have officially reached General Availability (GA) as of June 17, 2025. This marks a significant milestone, bringing enhanced capabilities and stability to these powerful models.

This update includes important changes for developers, including new GA endpoints, a deprecation timeline for preview endpoints, and revised pricing for Gemini 2.5 Flash. Let's dive into the details.

What's New and What's Changing?


New GA Endpoints for Stable Production Use

Effective June 17, 2025, new GA endpoints are available for gemini-2.5-flash and gemini-2.5-pro. The previously used "-001" suffix, which indicated a stable version, has been removed for these GA endpoints, simplifying usage. This means you can now rely on these endpoints for your production applications with full confidence in their stability.

Farewell to Preview Endpoints: A Migration is Required

All existing Gemini 2.5 Flash and Pro preview endpoints will continue to function with their current preview pricing until July 15, 2025. After this date, these preview endpoints will be shut down. This includes:

  • gemini-2.5-flash-preview-04-17
  • gemini-2.5-flash-preview-05-20
  • gemini-2.5-pro-preview-03-25
  • gemini-2.5-pro-preview-05-06
  • gemini-2.5-pro-preview-06-05

To ensure uninterrupted service, it is crucial to update your applications to use the new GA endpoints (gemini-2.5-flash and gemini-2.5-pro) before July 15, 2025.

Important Note on Preview Endpoint Updates: To facilitate a smoother transition, starting June 19, 2025, some preview endpoints will begin serving the GA versions of the models. For example, gemini-2.5-flash-preview-04-17 will serve the Gemini 2.5 Flash model version released on 05-20 (which is now GA), and similarly for certain Pro preview endpoints. This helps ensure continuity during your migration period.

Updated Pricing for Gemini 2.5 Flash GA

The pricing for Gemini 2.5 Flash GA is being adjusted to reflect its improved quality and to unify input and output token pricing.

Previously, there were different prices for "thinking" vs. "non-thinking" output tokens. Now, the pricing for "thinking" and "non-thinking" output will be unified, simplifying cost management.

Here's a quick look at the price adjustments for Gemini 2.5 Flash:

Endpoint(s)Task TypePrice before June 17, 2025 (per 1M tokens)Price after June 17, 2025 (per 1M tokens)
/gemini-2.5-flash-preview-04-17 (Retired 7/15/2025) /gemini-2.5-flash-preview-05-20 (Retired 7/15/2025)Input (text, image, video)$0.15$0.15
Audio Output$1$1
Text output (no thinking)$0.60$0.60
Text output (thinking)$3.50$3.50
/gemini-2.5-flash (GA)Input (text, image, video)N/A$0.30
Audio OutputN/A$1 (no change)
Text output (no thinking)N/A$2.50
Text output (thinking)N/A$2.50

The new pricing takes effect on the GA endpoint immediately. Preview pricing will only continue on existing preview endpoints until July 15, 2025.

Additionally, Google has introduced Gemini 2.5 Flash-Lite, a new, even more cost-efficient Gemini 2.5 model, now available in public preview. This model is optimized for low-latency use cases and high throughput.

Provisioned Throughput (PT) Updates

For those utilizing Provisioned Throughput, all new PT purchases will now be for GA endpoints only. If you have existing PT for a specific preview version, it will continue to work for that preview. However, you must migrate your existing PT to the GA endpoint or purchase new PT for the GA endpoint by July 15, 2025. Instructions for migrating PT can be found in the Google Cloud documentation.

Essential Actions for Developers:

To ensure a seamless transition and leverage the latest Gemini 2.5 capabilities, Google Cloud recommends the following:

  • Migrate to new GA endpoints: Update your applications to use gemini-2.5-flash and gemini-2.5-pro before July 15, 2025.

  • Review price changes: Familiarize yourself with the updated pricing details for Gemini 2.5 Flash GA.

  • Update Provisioned Throughput: If applicable, update your PT assignments to the GA endpoint by July 15, 2025.

Google Cloud is committed to supporting developers through these changes. If you have questions, you can reach out to your account team, Google Cloud Support, or refer to the official pricing page for the latest information.

These advancements in Gemini 2.5 Flash and Pro open up new possibilities for building intelligent applications with enhanced performance and optimized costs. Make sure to update your integrations to benefit from these exciting GA releases!

in News

Gemini 2.5 Flash and Pro Pricing and Availability Changes
Joshua Berkowitz June 18, 2025
Share this post
Tags