GPT-5.2.

OpenAIapi

§ 01 · Benchmarks

Every benchmark GPT-5.2 has a recorded score for.

#	Benchmark	Area · Task	Metric	Value	Rank	Date	Source
01	MMLU	Reasoning · Commonsense Reasoning	accuracy	92.4%	#2/41	2026-02-01	source ↗
02	MMMU-Pro	Multimodal · Visual Question Answering	accuracy	81.0%	#2/5	2025-12-11	source ↗
03	Tau2-Bench	Agentic AI · Tool Use	pass_rate	73.0%	#2/8	2025-12-11	source ↗

Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.

§ 02 · Strengths by area

Where GPT-5.2 actually performs.

§ 04 · Related models

Other OpenAI models scored on Codesota.

GPT-4o

Undisclosed params · 35 results · 9 SOTA

Undisclosed params · 8 results

§ 05 · Sources & freshness

Where these numbers come from.

codesota-shadow-mmlu

result

artificialanalysis.ai

result

editorial

result

2 of 3 rows marked verified. · first result 2025-12-11, latest 2026-02-01.