math
Unknown
OCR benchmark
5
Total Results
5
Models Tested
1
Metrics
2025-12-19
Last Updated
accuracy
Higher is better
| Rank | Model | Score | Source |
|---|---|---|---|
| 1 | o1-preview Competition mathematics. Massive improvement over GPT-4. | 94.8 | openai-blog |
| 2 | deepseek-v3 | 90.2 | deepseek-blog |
| 3 | gpt-4o | 76.6 | openai-blog |
| 4 | claude-35-sonnet | 71.1 | anthropic-blog |
| 5 | gemini-15-pro | 67.7 | google-blog |