Gemini 2.5 Flash

2.5 Flash

Gemini 2.5 Flash is our best model in terms of price and performance, and offers well-rounded capabilities. Gemini 2.5 Flash is our first Flash model model that features thinking capabilities, which lets you see the thinking process that the model goes through when generating its response.

Try in Vertex AI View model card in Model Garden (Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.
Model ID gemini-2.5-flash-preview-05-20
Supported inputs & outputs
  • Inputs:
    Text, Code, Images, Audio, Video
  • Outputs:
    Text
Token limits
  • Maximum input tokens: 1,048,576
  • Maximum output tokens: 65,535
Capabilities
Usage types
Technical specifications
Images
  • Maximum images per prompt: 3,000
  • Maximum image size: 7 MB
  • Supported MIME types:
    image/png, image/jpeg, image/webp
Documents
  • Maximum number of files per prompt: 3,000
  • Maximum number of pages per file: 1,000
  • Maximum file size per file for the API or Cloud Storage imports: 50 MB
  • Maximum file size per file for direct uploads through the console: 7 MB
  • Supported MIME types:
    application/pdf, text/plain
Video
  • Maximum video length (with audio): Approximately 45 minutes
  • Maximum video length (without audio): Approximately 1 hour
  • Maximum number of videos per prompt: 10
  • Supported MIME types:
    video/x-flv, video/quicktime, video/mpeg, video/mpegs, video/mpg, video/mp4, video/webm, video/wmv, video/3gpp
Audio
  • Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens
  • Maximum number of audio files per prompt: 1
  • Speech understanding for: Audio summarization, transcription, and translation
  • Supported MIME types:
    audio/x-aac, audio/flac, audio/mp3, audio/m4a, audio/mpeg, audio/mpga, audio/mp4, audio/opus, audio/pcm, audio/wav, audio/webm
Parameter defaults
  • Temperature: 0-2
  • topP: 0.95
  • topK: 64 (fixed)
  • candidateCount: 1-8
Knowledge cutoff date January 2025
Versions
  • gemini-2.5-flash-preview-05-20
    • Launch stage: Public preview
    • Release date: May 20, 2025
  • gemini-2.5-flash-preview-04-17
    • Launch stage: Public preview
    • Release date: April 17, 2025
Supported regions

Model availability

  • Global
    • global
  • United States
    • us-central1
See Data residency for more information.
Security controls
See Security controls for more information.
Pricing See Pricing.

Live API native audio

Gemini 2.5 Flash with Live API native audio is a preview model that features our cutting-edge native audio functionality for Live API. In addition to the standard Live API features, this preview model includes:

  • Enhanced voice quality and adaptability: Live API native audio provides richer, more natural voice interactions with 30 HD voices in 24 languages.
  • Introducing Proactive Audio: When Proactive Audio is enabled, the model only responds when it's relevant. The model generates text transcripts and audio responses proactively only for queries directed to the device, and does not respond to non-device directed queries.
  • Introducing Affective Dialog: Models using Live API native audio can understand and respond appropriately to users' emotional expressions for more nuanced conversations.

For more information on Live API, see our standalone Live API documentation.

Try in Vertex AI (Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.
Model ID gemini-2.5-flash-preview-native-audio-dialog
Supported inputs & outputs
  • Inputs:
    Audio, Video
  • Outputs:
    Text, Audio
Token limits
  • Maximum input tokens: 1,048,576
  • Maximum output tokens: 128K
Capabilities
Usage types
Technical specifications
Video
  • Maximum screenshare length: Approximately 10 minutes
  • Supported MIME types:
    video/x-flv, video/quicktime, video/mpeg, video/mpegs, video/mpg, video/mp4, video/webm, video/wmv, video/3gpp
Audio
  • Maximum conversation length: Approximately 10 minutes
  • Speech understanding for: Audio summarization, transcription, and translation
  • Supported MIME types:
    audio/x-aac, audio/flac, audio/mp3, audio/m4a, audio/mpeg, audio/mpga, audio/mp4, audio/opus, audio/pcm, audio/wav, audio/webm
Parameter defaults
  • Temperature: 0-2
  • topP: 0.95
  • topK: 64 (fixed)
  • candidateCount: 1-8
Knowledge cutoff date January 2025
Versions
  • gemini-2.5-flash-preview-native-dialog
    • Launch stage: Private preview
    • Release date: May 20, 2025
Supported regions

Model availability

  • United States
    • us-central1
See Data residency for more information.
Security controls
See Security controls for more information.
Pricing See Pricing.