Combining speed and performance, 2.0 Flash Thinking Experimental also excels in science and math, showing its thinking to solve complex problems.
Thinking, revealed
-
Enhanced performance
Improvements on math and science benchmarks.
-
Long context
A one-million token context window enables deeper analysis of long-form text.
-
Improved thinking
More consistency between thoughts and answers.
-
Tool use
Turn on code execution to run and evaluate code.
Benchmarks
Enhanced abilities across math, science and multimodal reasoning.
Benchmark | Gemini 1.5 Pro 002 |
Gemini 2.0 Flash Exp |
Gemini 2.0 Flash Thinking Exp 01-21 |
---|---|---|---|
AIME2024 (Math) | 19.3% | 35.5% | 73.3% |
GPQA Diamond (Science) | 57.6% | 58.6% | 74.2% |
MMMU (Multimodal reasoning) | 64.9% | 70.7% | 75.4% |
Model information
Model deployment status | Experimental |
Supported data types for input | Text, Image |
Supported data types for output | Text |
Supported # tokens for input | 1M |
Supported # tokens for output | 64k |
Knowledge cutoff | June 2024 |
Tool use | Code execution |
Best for | Complex tasks without the need for low latency |
Availability |
Google AI Studio Gemini API Vertex AI Gemini App |