Codesota · Models · Gemini 3 FlashGoogle9 results · 8 benchmarks
Model card

Gemini 3 Flash.

GoogleapiUndisclosed params
§ 01 · Benchmarks

Every benchmark Gemini 3 Flash has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01LiveCodeBenchComputer Code · Code Generationpass@190.8%#2/302026-03-15source ↗
02GPQAReasoning · Multi-step Reasoningaccuracy90.4%#3/33source ↗
03ParseBenchComputer Vision · Document Parsingaccuracy71.0%#3/14source ↗
04MMLU-ProReasoning · Commonsense Reasoningaccuracy89.0%#4/202026-04-20source ↗
05SWE-Bench VerifiedComputer Code · Code Generationresolve-rate78.0%#8/392025-12-17source ↗
06SWE-bench VerifiedAgentic AI · SWE-benchresolve-rate78.0%#9/81source ↗
07SWE-BenchComputer Code · Code Generationresolve-rate-agentic75.8%#13/252026-02-01unverified
08SWE-BenchComputer Code · Code Generationresolve-rate75.8%#14/322026-02-01source ↗
09MMLUReasoning · Commonsense Reasoningaccuracy89.6%#17/412026-01-01source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where Gemini 3 Flash actually performs.

Computer Vision
1
benchmark
avg rank #3.0
Reasoning
3
benchmarks
avg rank #8.0
Agentic AI
1
benchmark
avg rank #9.0
Computer Code
3
benchmarks
avg rank #9.3
§ 03 · Papers

1 paper with results for Gemini 3 Flash.

  1. 2023-10-10· Computer Code· 1 result

    SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

    Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao et al.
§ 04 · Related models

Other Google models scored on Codesota.

Gemini 2.5 Pro
16 results · 3 SOTA
Gemini 3 Pro
Undisclosed params · 13 results · 2 SOTA
Gemini 1.5 Pro
12 results · 1 SOTA
Gemini 3.1 Pro
3 results · 1 SOTA
ViT-H/14
632M params · 2 results · 1 SOTA
CoCa (finetuned)
2.1B params · 1 result · 1 SOTA
Gemini 2.0 Flash
1 result · 1 SOTA
Gemini 3.1 Pro Preview
1 result · 1 SOTA
§ 05 · Sources & freshness

Where these numbers come from.

google-blog
2
results
vendor
1
result
blog-post
1
result
pricepertoken
1
result
editorial
1
result
mini-swe-agent-v2
1
result
swebench-leaderboard
1
result
codesota-shadow-mmlu
1
result
5 of 9 rows marked verified. · first result 2025-12-17, latest 2026-04-20.