KITAB-Bench

MBZUAI

8,809 Arabic text samples across 9 domains. Tests Arabic script recognition.

Benchmark Stats

Models8
Papers8
Metrics1

SOTA History

Not enough data to show trend.

Character Error Rate

Levenshtein distance between predicted and ground truth (lower is better)

Lower is better

RankModelSourceScoreYearPaper
1gemini-20-flash

Arabic OCR - Character Error Rate (lower is better). 8,809 samples, 9 domains

Editorial0.132025Source
2ain-7bEditorial0.22025Source
3gpt-4oEditorial0.312025Source
4gpt-4o-miniEditorial0.432025Source
5azure-ocrEditorial0.522025Source
6tesseractEditorial0.542025Source
7easyocrEditorial0.582025Source
8paddleocrEditorial0.792025Source

Submit a Result