Codesota · Models · glm-4-9b-chatTHUDM14 results · 3 benchmarks
Model card

glm-4-9b-chat.

THUDMopen-source9.4B params
§ 01 · Benchmarks

Every benchmark glm-4-9b-chat has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01Polish EQ-BenchNatural Language Processing · Polish Emotional Intelligenceeq-score61.8%#29/101source ↗
02CPTU-BenchNatural Language Processing · Polish Text Understandinglanguage-understanding3.5%#53/93source ↗
03Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generaleq-bench53.8%#55/299source ↗
04CPTU-BenchNatural Language Processing · Polish Text Understandingsentiment3.6%#56/93source ↗
05Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generaldyk67.0%#59/489source ↗
06CPTU-BenchNatural Language Processing · Polish Text Understandingaverage3.0%#65/93source ↗
07CPTU-BenchNatural Language Processing · Polish Text Understandingtricky-questions2.0%#65/93source ↗
08CPTU-BenchNatural Language Processing · Polish Text Understandingphraseology2.8%#68/93source ↗
09Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpolemo2-in81.6%#86/490source ↗
10Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalaverage56.6%#88/491source ↗
11Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalbelebele86.6%#89/490source ↗
12Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalppc73.6%#147/490source ↗
13Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalcbd27.9%#177/490source ↗
14Open PL LLM LeaderboardNatural Language Processing · Polish LLM Generalpolqa-open-book82.7%#250/489source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where glm-4-9b-chat actually performs.

Natural Language Processing
3
benchmarks
avg rank #91.9
§ 04 · Related models

Other THUDM models scored on Codesota.

chatglm3-6b
0 results
chatglm3-6b-base
0 results
glm-4-9b
0 results
§ 05 · Sources & freshness

Where these numbers come from.

speakleash/open_pl_llm_leaderboard
8
results
SpeakLeash/CPTU-Bench
5
results
SpeakLeash/Polish-EQ-Bench
1
result
14 of 14 rows marked verified.