ARC-Challenge

Unknown

7,787 science questions requiring reasoning. Challenge set contains harder questions that retrieval fails on.

Benchmark Stats

Models4
Papers4
Metrics1

SOTA History

Not enough data to show trend.

Only 4 models on this benchmark

Help build the community leaderboard — submit your model results.

accuracy

accuracy

Higher is better

RankModelSourceScoreYearPaper
1claude-35-sonnetEditorial96.72025Source
2gpt-4o

Grade-school science questions (challenge set).

Editorial96.42025Source
3gemini-15-proEditorial94.82025Source
4llama-3-70bEditorial932025Source

Submit a Result