Codesota · Tasks · Polish Emotional IntelligenceHome/Tasks/Natural Language Processing/Polish Emotional Intelligence

Polish Emotional Intelligence.

Evaluating language models on emotional intelligence in Polish: understanding emotional states, predicting emotional responses, and nuanced sentiment analysis.

Datasets

101

Results

eq-score

Canonical metric

§ 02 · Canonical benchmark

The reference dataset.

Polish EQ-Bench

Evaluates LLMs on emotional intelligence in Polish. Based on EQ-Bench v2 methodology adapted for Polish language. Models predict emotional intensity changes across 171 questions. Score adjusted for parseability: Benchmark Score × (Parseable / 171). Created by SpeakLeash.

Primary metric: eq-score

View full leaderboard →

§ 03 · Top 10

Leading models.

Leading models on Polish EQ-Bench.

#	Model	eq-score	Year	Source
★	mistralai/Mistral-Large-Instruct-2407✓	78.1	2026	paper ↗
2	mistralai/Mistral-Large-Instruct-2411✓	77.3	2026	paper ↗
3	Meta-Llama-3.1-405B-Instruct-FP8✓	77.2	2026	paper ↗
4	GPT-4o-2024-08-06✓	75.2	2026	paper ↗
5	gpt-4-turbo-2024-04-09✓	74.6	2026	paper ↗
6	speakleash/Bielik-11B-v2.6-Instruct✓	73.7	2026	paper ↗
7	deepseek-ai/DeepSeek-V3-0324 (API)✓	73.5	2026	paper ↗
8	Mistral-Small-Instruct-2409✓	72.8	2026	paper ↗
9	CYFRAGOVPL/Llama-PLLuM-70B-chat✓	72.6	2026	paper ↗
10	meta-llama/Meta-Llama-3.1-70B-Instruct✓	72.5	2026	paper ↗

What were you looking for on Polish Emotional Intelligence?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

1 dataset tracked for this task.

Polish EQ-Bench

CANONICAL

101 results · eq-score

Top: mistralai/Mistral-Large-Instruct-2407 — 78.1

§ 05 · Related tasks

Other tasks in Natural Language Processing.

Feature Extraction Fill-Mask Named Entity Recognition Natural Language Inference Polish Conversation Quality Polish Cultural Competency Polish LLM General Polish Text Understanding

Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Polish Emotional Intelligence? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.