Codesota · Models · meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 (API)meta-llama5 results · 1 benchmarks
Model card

meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 (API).

meta-llamaopen-source402B params
§ 01 · Benchmarks

Every benchmark meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 (API) has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01CPTU-BenchNatural Language Processing · Polish Text Understandingsentiment4.4%#6/93source ↗
02CPTU-BenchNatural Language Processing · Polish Text Understandinglanguage-understanding4.1%#11/93source ↗
03CPTU-BenchNatural Language Processing · Polish Text Understandingaverage3.9%#16/93source ↗
04CPTU-BenchNatural Language Processing · Polish Text Understandingtricky-questions3.8%#16/93source ↗
05CPTU-BenchNatural Language Processing · Polish Text Understandingphraseology3.5%#30/93source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 (API) actually performs.

Natural Language Processing
1
benchmark
avg rank #15.8
§ 04 · Related models

Other meta-llama models scored on Codesota.

Llama-3.3-70B-Instruct
70.6B params · 1 result
Llama-2-7b-chat-hf
0 results
Llama-2-7b-hf
0 results
Llama-3.2-1B
0 results
Llama-3.2-1B-Instruct
1.24B params · 0 results
Llama-3.2-3B
0 results
Llama-3.2-3B-Instruct
3.21B params · 0 results
Llama-4-Scout-17B-16E
0 results
§ 05 · Sources & freshness

Where these numbers come from.

SpeakLeash/CPTU-Bench
5
results
5 of 5 rows marked verified.