openai/gpt-oss-120b (API).

openaiopen-source120B params

§ 01 · Benchmarks

Every benchmark openai/gpt-oss-120b (API) has a recorded score for.

#	Benchmark	Area · Task	Metric	Value	Rank	Date	Source
01	CPTU-Bench	Natural Language Processing · Polish Text Understanding	tricky-questions	3.9%	#11/93	—	source ↗
02	CPTU-Bench	Natural Language Processing · Polish Text Understanding	language-understanding	4.0%	#17/93	—	source ↗
03	CPTU-Bench	Natural Language Processing · Polish Text Understanding	average	3.8%	#20/93	—	source ↗
04	CPTU-Bench	Natural Language Processing · Polish Text Understanding	phraseology	3.5%	#28/93	—	source ↗
05	CPTU-Bench	Natural Language Processing · Polish Text Understanding	sentiment	3.9%	#32/93	—	source ↗

Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.

§ 02 · Strengths by area

Where openai/gpt-oss-120b (API) actually performs.

Natural Language Processing

benchmark

avg rank #21.6

§ 05 · Sources & freshness

Where these numbers come from.

SpeakLeash/CPTU-Bench

results

5 of 5 rows marked verified.