mbpp
Unknown
OCR benchmark
2
Total Results
2
Models Tested
1
Metrics
2025-12-19
Last Updated
pass@1
Higher is better
| Rank | Model | Score | Source |
|---|---|---|---|
| 1 | claude-35-sonnet | 89.2 | anthropic-blog |
| 2 | gpt-4o | 87.8 | openai-blog |
Unknown
OCR benchmark
Higher is better
| Rank | Model | Score | Source |
|---|---|---|---|
| 1 | claude-35-sonnet | 89.2 | anthropic-blog |
| 2 | gpt-4o | 87.8 | openai-blog |