Codesota · Models · Grok-3-BetaxAI7 results · 1 benchmarks
Model card

Grok-3-Beta.

xAIopen-source
§ 01 · Benchmarks

Every benchmark Grok-3-Beta has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01PLCCNatural Language Processing · Polish Cultural Competencyculture-and-tradition90.0%#15/165source ↗
02PLCCNatural Language Processing · Polish Cultural Competencyart-and-entertainment71.0%#38/165source ↗
03PLCCNatural Language Processing · Polish Cultural Competencyhistory85.0%#40/165source ↗
04PLCCNatural Language Processing · Polish Cultural Competencyaverage77.2%#43/165source ↗
05PLCCNatural Language Processing · Polish Cultural Competencyvocabulary69.0%#46/165source ↗
06PLCCNatural Language Processing · Polish Cultural Competencygeography83.0%#50/165source ↗
07PLCCNatural Language Processing · Polish Cultural Competencygrammar65.0%#67/165source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where Grok-3-Beta actually performs.

Natural Language Processing
1
benchmark
avg rank #42.7
§ 04 · Related models

Other xAI models scored on Codesota.

Grok 2
4 results
Grok 4
4 results
Grok 3
1 result
Grok Code Fast 1
1 result
Grok-2-1212
0 results
Grok-3-Mini-Beta
0 results
Grok-4-Fast
0 results
Grok-4.1-Fast
0 results
§ 05 · Sources & freshness

Where these numbers come from.

sdadas/PLCC
7
results
7 of 7 rows marked verified.