State of the Art,
Verified

Independent ML benchmarks across 17 research areas. Track progress, find implementations, compare models.

Vision, NLP, reasoning, code, speech, medical, robotics, and more. All results verified with source links.

286+ benchmark results
17 research areas
143 models tracked
Links to implementations
New Explainer

The Ralph Loop: AGI's Dumbest Secret

What if the secret to AGI is while(true)? Interactive visualizations of Wolfram Rule 30, OODA loops, and the $50K→$297 technique.

Explore the loop
Free PDF Download

The Zen of AI Composition

Building intelligent systems from first principles. A philosophical guide to AI transformations, modular composition, and evidence-based prompting.

Download now
New Feature

AI Building Blocks

Stop searching. Start building. See which tools transform your data - with production-ready implementations.

Explore blocks
11 Paradoxes

Interactive Explainers

3Blue1Brown-style explanations of paradoxes and counterintuitive results. Play with simulations, not just read about them.

Explore paradoxes

Can I trust these numbers?

Numbers from published papers, verified with our own tests where possible. No marketing claims, no sponsored rankings.

Which model fits my use case?

Compare accuracy, speed, cost, and deployment complexity. We show you the tradeoffs that matter for production.

Can I use this data?

Yes. All benchmark data available as JSON. Build dashboards, cite it in papers, integrate it into your tools.

286+
Benchmark results
17
Research areas
86
Datasets tracked
143
Models compared
Open Data

Use This Data

All benchmark data available as JSON

Build dashboards, cite in papers, integrate into your tools. No API key needed. Updated weekly with new results.

Free to use
Source links included
Updated weekly

Frequently Asked Questions

What is CodeSOTA?

CodeSOTA is an independent ML benchmark tracking platform. We provide verified state-of-the-art results across 17 research areas including computer vision, NLP, reasoning, code generation, speech, medical AI, robotics, and more.

Is this a Papers with Code replacement?

CodeSOTA builds on the Papers with Code legacy after Meta shut it down in July 2025. We track 286+ benchmark results with links to implementations. Read the full story.

Are these benchmarks verified?

Yes. We run benchmarks independently where possible, rather than just aggregating paper claims. All data includes source URLs and access dates for verification. See our methodology.

Can I use this benchmark data?

Yes. All benchmark data is available as JSON at /data/benchmarks.json. Build dashboards, cite it in papers, or integrate it into your tools.

What People Say

AC

Anonymous

AI Consultant, Voice-AI at scale

"Outstanding work. Just yesterday I was searching for good OCR comparisons and found only marketing BS. Good job!"

December 2025

SA

Anonymous

Senior Architect

"Super clean, slop-free UI, but most importantly the copy: very precise positioning and project overview."

December 2025

Cite CodeSOTA

If you use CodeSOTA in your research, please cite:

@misc{wikiel2025codesota,
  author = {Wikieł, Kacper},
  title = {CodeSOTA: Independent ML Benchmark Tracking},
  year = {2025},
  url = {https://codesota.com},
  note = {Accessed: 2025}
}

Or in plain text: Wikieł, K. (2025). CodeSOTA: Independent ML Benchmark Tracking. https://codesota.com

Want updates on new benchmarks?

We'll let you know when we add tests for new models or tasks.