Codesota · Models · EuroLLM-9BUTTER7 results · 1 benchmarks
Model card

EuroLLM-9B.

UTTERopen-source
§ 01 · Benchmarks

Every benchmark EuroLLM-9B has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01PLCCNatural Language Processing · Polish Cultural Competencygeography54.0%#123/165source ↗
02PLCCNatural Language Processing · Polish Cultural Competencyculture-and-tradition40.0%#127/165source ↗
03PLCCNatural Language Processing · Polish Cultural Competencyart-and-entertainment30.0%#135/165source ↗
04PLCCNatural Language Processing · Polish Cultural Competencyaverage41.0%#136/165source ↗
05PLCCNatural Language Processing · Polish Cultural Competencyvocabulary34.0%#139/165source ↗
06PLCCNatural Language Processing · Polish Cultural Competencyhistory49.0%#141/165source ↗
07PLCCNatural Language Processing · Polish Cultural Competencygrammar39.0%#146/165source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where EuroLLM-9B actually performs.

Natural Language Processing
1
benchmark
avg rank #135.3
§ 05 · Sources & freshness

Where these numbers come from.

sdadas/PLCC
7
results
7 of 7 rows marked verified.