Model card
Qwen2.5-72B-Instruct.
Alibabaopen-source72B paramsDense Transformer
Qwen2.5-72B-Instruct. Released September 2024. Strong open-source model. Instruct-tuned version of the Qwen2.5-72B base. Top open-source model on many reasoning benchmarks at release.
§ 01 · Benchmarks
Every benchmark Qwen2.5-72B-Instruct has a recorded score for.
| # | Benchmark | Area · Task | Metric | Value | Rank | Date | Source |
|---|---|---|---|---|---|---|---|
| 01 | GSM8K | Reasoning · Mathematical Reasoning | accuracy | 95.8% | #18 | — | source ↗ |
| 02 | MATH | Reasoning · Mathematical Reasoning | accuracy | 83.1% | #24 | — | source ↗ |
| 03 | GPQA | Reasoning · Multi-step Reasoning | accuracy | 49.0% | #30 | — | source ↗ |
| 04 | MMLU | Reasoning · Commonsense Reasoning | accuracy | 86.1% | #33 | — | source ↗ |
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area
Where Qwen2.5-72B-Instruct actually performs.
§ 04 · Related models
Other Alibaba models scored on Codesota.
§ 05 · Sources & freshness
Where these numbers come from.
qwen25-tech-report
4
results
4 of 4 rows marked verified.