Codesota · Tasks · Document ParsingHome/Tasks/Computer Vision/Document Parsing

Computer Vision

Document Parsing.

Parsing document structure and content

3

Datasets

149

Results

composite

Canonical metric

§ 02 · Canonical benchmark

The reference dataset.

OmniDocBench

981 annotated PDF pages across 9 document categories. Tests end-to-end document parsing including text, tables, and formulas.

Primary metric: composite

View full leaderboard →

§ 03 · Top 10

Leading models.

Leading models on OmniDocBench.

#	Model	layout-map	Year	Source
★	MinerU 2.5	97.5	2025	paper ↗
2	GLM-OCR	94.6	2026	paper ↗
3	GLM-OCR	94.6	2026	paper ↗
4	PaddleOCR-VL-1.5	94.5	2026	paper ↗
5	PaddleOCR-VL	93.5	2025	paper ↗
6	Qianfan-OCR	93.1	2026	paper ↗
7	Qianfan-OCR	93.1	2026	paper ↗
8	FireRed-OCR-2B	92.9	2026	paper ↗
9	PaddleOCR-VL	92.9	2025	paper ↗
10	PaddleOCR-VL 0.9B	92.6	2025	paper ↗

What were you looking for on Document Parsing?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

3 datasets tracked for this task.

61 results · composite

Top: MinerU 2.5 — 97.5

74 results · pass-rate

Top: Chandra v0.1.0 — 99.9

14 results · accuracy

Top: LlamaParse Agentic — 84.9

§ 05 · Related tasks

Other tasks in Computer Vision.

3D Understanding Depth estimation Document Image Classification Document Layout Analysis Document Understanding General OCR Capabilities Handwriting Recognition Image Classification

Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Document Parsing? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.