Qianfan-OCR.

Baidu Qianfanopen-source4B paramsEnd-to-end VLM (4B params)Apache 2.03 current SOTA

Unified end-to-end document intelligence model. Highest overall score among end-to-end models on olmOCR-Bench (79.8).

§ 01 · Benchmarks

Every benchmark Qianfan-OCR has a recorded score for.

#	Benchmark	Area · Task	Metric	Value	Rank	Date	Source
01	OmniDocBench	Computer Vision · Document Parsing	formula-cdm	92.4%	#1/1	—	source ↗
02	olmOCR-Bench	Computer Vision · Document Parsing	multi-column	92.2%	#1/4	—	source ↗
03	olmOCR-Bench	Computer Vision · Document Parsing	old-scans	73.1%	#1/5	—	source ↗
04	OmniDocBench	Computer Vision · Document Parsing	table-teds	91.0%	#2/4	—	source ↗
05	OmniDocBench	Computer Vision · Document Parsing	composite	93.1%	#3/33	—	source ↗
06	OmniDocBench	Computer Vision · Document Parsing	text-edit-distance	0.0%	#3/3	—	source ↗
07	olmOCR-Bench	Computer Vision · Document Parsing	base	99.6%	#3/4	—	source ↗
08	olmOCR-Bench	Computer Vision · Document Parsing	long-tiny-text	80.4%	#4/4	—	source ↗
09	olmOCR-Bench	Computer Vision · Document Parsing	headers-footers	42.0%	#4/4	—	source ↗
10	olmOCR-Bench	Computer Vision · Document Parsing	arxiv	80.1%	#5/5	—	source ↗
11	olmOCR-Bench	Computer Vision · Document Parsing	tables	81.6%	#5/5	—	source ↗
12	olmOCR-Bench	Computer Vision · Document Parsing	pass-rate	79.8%	#7/21	—	source ↗

Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.

§ 02 · Strengths by area

Where Qianfan-OCR actually performs.

Computer Vision

benchmarks

avg rank #3.3 · 3 SOTA

§ 05 · Sources & freshness

Where these numbers come from.

paper

results

Hugging Face

results

0 of 12 rows marked verified.