Google Gemini API
  1. models
Google Gemini API
  • Get API key
  • Release notes
  • Libraries
  • Run Gemini on Google Cloud
  • Model Capabilities
    • Overview
    • Long context
    • Structured output
    • Document understanding
    • Image understanding
    • Video understanding
    • Audio understanding
    • Text generation
      • Text input
      • Image input
      • Streaming output
      • Multi-turn conversations
      • Multi-turn conversations (Streaming)
      • Configuration parameters
    • Generate images
      • Generate images using Gemini
      • Image editing with Gemini
      • Generate images using Imagen 3
    • Gemini thinking
      • Use thinking models
      • Set budget on thinking models
    • Function calling
      • Function Calling with the Gemini API
  • models
    • All Model
    • Pricing
    • Rate limits
    • Billing info
  • Safety
    • Safety settings
    • Safety guidance
  1. models

Pricing

The Gemini API "free tier" is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries. The Gemini API "paid tier" comes with higher rate limits, additional features, and different data handling.
Upgrade to the Paid Tier

Gemini 2.5 Flash Preview#

Try it in Google AI Studio
Our first hybrid reasoning model which supports a 1M token context window and has thinking budgets.
Preview models may change before becoming stable and have more restrictive rate limits.
Free TierPaid Tier, per 1M tokens in USD
Input priceFree of charge0.15(text/image/video)1.00 (audio)
Output priceFree of chargeNon-thinking: 0.60Thinking:3.50
Context caching priceComing soon!Coming soon!
Context caching (storage)Coming soon!Coming soon!
Grounding with Google SearchFree of charge, up to 500 RPD1,500 RPD (free), then $35 / 1,000 requests
Used to improve our productsYesNo

Gemini 2.5 Pro Preview#

Try it in Google AI Studio
Our state-of-the-art multipurpose model, which excels at coding and complex reasoning tasks.
Preview models may change before becoming stable and have more restrictive rate limits.
Free TierPaid Tier, per 1M tokens in USD
Input priceFree of charge, use "gemini-2.5-pro-exp-03-25"1.25,prompts<=200ktokens2.50, prompts > 200k tokens
Output price (including thinking tokens)Free of charge, use "gemini-2.5-pro-exp-03-25"10.00,prompts<=200ktokens15.00, prompts > 200k
Context caching priceNot available0.31,prompts<=200ktokens0.625, prompts > 200k $4.50 / 1,000,000 tokens per hour
Grounding with Google SearchFree of charge, up to 500 RPD1,500 RPD (free), then $35 / 1,000 requests
Used to improve our productsYesNo

Gemini 2.0 Flash#

Try it in Google AI Studio
Our most balanced multimodal model with great performance across all tasks, with a 1 million token context window, and built for the era of Agents.
Free TierPaid Tier, per 1M tokens in USD
Input priceFree of charge0.10(text/image/video)0.70 (audio)
Output priceFree of charge$0.40
Context caching priceFree of charge0.025/1,000,000tokens(text/image/video)0.175 / 1,000,000 tokens (audio)
Context caching (storage)Free of charge, up to 1,000,000 tokens of storage per hour$1.00 / 1,000,000 tokens per hour
Tuning priceNot availableNot available
Grounding with Google SearchFree of charge, up to 500 RPD1,500 RPD (free), then $35 / 1,000 requests
Live APIFree of chargeInput: 0.35(text),2.10 (audio / image [video]) Output: 1.50(text),8.50 (audio)
Used to improve our productsYesNo

Gemini 2.0 Flash-Lite#

Try it in Google AI Studio
Our smallest and most cost effective model, built for at scale usage.
Free TierPaid Tier, per 1M tokens in USD
Input priceFree of charge$0.075
Output priceFree of charge$0.30
Context caching priceNot availableNot available
Context caching (storage)Not availableNot available
Tuning priceNot availableNot available
Grounding with Google SearchNot availableNot available
Used to improve our productsYesNo

Imagen 3#

Try it in ImageFX
Our state-of-the-art image generation model, available to developers on the paid tier of the Gemini API.
Free TierPaid Tier, per Image in USD
Image priceNot available$0.03
Used to improve our productsYesNo

Veo 2#

Try the API
Our state-of-the-art video generation model, available to developers on the paid tier of the Gemini API.
Free TierPaid Tier, per second in USD
Video priceNot available$0.35
Used to improve our productsYesNo

Gemma 3#

Try Gemma 3
Our lightweight, state-of the art, open model built from the same technology that powers our Gemini models.
Free TierPaid Tier, per 1M tokens in USD
Input priceFree of chargeNot available
Output priceFree of chargeNot available
Context caching priceFree of chargeNot available
Context caching (storage)Free of chargeNot available
Tuning priceNot availableNot available
Grounding with Google SearchNot availableNot available
Used to improve our productsYesNo

Gemini 1.5 Flash#

Try it in Google AI Studio
Our fastest multimodal model with great performance for diverse, repetitive tasks and a 1 million token context window.
Free TierPaid Tier, per 1M tokens in USD
Input priceFree of charge0.075,prompts<=128ktokens0.15, prompts > 128k tokens
Output priceFree of charge0.30,prompts<=128ktokens0.60, prompts > 128k tokens
Context caching priceFree of charge, up to 1 million tokens of storage per hour0.01875,prompts<=128ktokens0.0375, prompts > 128k tokens
Context caching (storage)Free of charge$1.00 per hour
Tuning priceToken prices are the same for tuned models Tuning service is free of charge.Token prices are the same for tuned models Tuning service is free of charge.
Grounding with Google SearchNot available$35 / 1K grounding requests
Used to improve our productsYesNo

Gemini 1.5 Flash-8B#

Try it in Google AI Studio
Our smallest model for lower intelligence use cases, with a 1 million token context window.
Free TierPaid Tier, per 1M tokens in USD
Input priceFree of charge0.0375,prompts<=128ktokens0.075, prompts > 128k tokens
Output priceFree of charge0.15,prompts<=128ktokens0.30, prompts > 128k tokens
Context caching priceFree of charge, up to 1 million tokens of storage per hour0.01,prompts<=128ktokens0.02, prompts > 128k tokens
Context caching (storage)Free of charge$0.25 per hour
Tuning priceToken prices are the same for tuned models Tuning service is free of charge.Token prices are the same for tuned models Tuning service is free of charge.
Grounding with Google SearchNot available$35 / 1K grounding requests
Used to improve our productsYesNo

Gemini 1.5 Pro#

Try it in Google AI Studio
Our highest intelligence Gemini 1.5 series model, with a breakthrough 2 million token context window.
Free TierPaid Tier, per 1M tokens in USD
Input priceFree of charge1.25,prompts<=128ktokens2.50, prompts > 128k tokens
Output priceFree of charge5.00,prompts<=128ktokens10.00, prompts > 128k tokens
Context caching priceNot available0.3125,prompts<=128ktokens0.625, prompts > 128k tokens
Context caching (storage)Not available$4.50 per hour
Tuning priceNot availableNot available
Grounding with Google SearchNot available$35 / 1K grounding requests
Used to improve our productsYesNo

Text Embedding 004#

Our state-of-the-art text embedding model.
Free TierPaid Tier, per 1M tokens in USD
Input priceFree of chargeNot available
Output priceFree of chargeNot available
Tuning priceNot availableNot available
Used to improve our productsYesNo
[*] Google AI Studio usage is free of charge in all available regions. See Billing FAQs for details.
[**] Prices may differ from the prices listed here and the prices offered on Vertex AI. For Vertex prices, see the Vertex AI pricing page.
[***] If you are using dynamic retrieval to optimize costs, only requests that contain at least one grounding support URL from the web in their response are charged for Grounding with Google Search. Costs for Gemini always apply. Rate limits are subject to change.
Previous
All Model
Next
Rate limits
Built with