ARC-Challenge
Unknown
7,787 science questions requiring reasoning. Challenge set contains harder questions that retrieval fails on.
Benchmark Stats
Models4
Papers4
Metrics1
SOTA History
Not enough data to show trend.
Only 4 models on this benchmark
Help build the community leaderboard — submit your model results.
accuracy
accuracy
Higher is better