Model card
Qianfan-OCR.
Baidu Qianfanopen-source4B paramsEnd-to-end VLM (4B params)Apache 2.03 current SOTA
Unified end-to-end document intelligence model. Highest overall score among end-to-end models on olmOCR-Bench (79.8).
§ 01 · Benchmarks
Every benchmark Qianfan-OCR has a recorded score for.
| # | Benchmark | Area · Task | Metric | Value | Rank | Date | Source |
|---|---|---|---|---|---|---|---|
| 01 | OmniDocBench | Computer Vision · Document Parsing | formula-cdm | 92.4% | #1 | — | source ↗ |
| 02 | olmOCR-Bench | Computer Vision · Document Parsing | multi-column | 92.2% | #1 | — | source ↗ |
| 03 | olmOCR-Bench | Computer Vision · Document Parsing | old-scans | 73.1% | #1 | — | source ↗ |
| 04 | OmniDocBench | Computer Vision · Document Parsing | table-teds | 91.0% | #2 | — | source ↗ |
| 05 | OmniDocBench | Computer Vision · Document Parsing | composite | 93.1% | #3 | — | source ↗ |
| 06 | OmniDocBench | Computer Vision · Document Parsing | text-edit-distance | 0.0% | #3 | — | source ↗ |
| 07 | olmOCR-Bench | Computer Vision · Document Parsing | base | 99.6% | #3 | — | source ↗ |
| 08 | olmOCR-Bench | Computer Vision · Document Parsing | long-tiny-text | 80.4% | #4 | — | source ↗ |
| 09 | olmOCR-Bench | Computer Vision · Document Parsing | headers-footers | 42.0% | #4 | — | source ↗ |
| 10 | olmOCR-Bench | Computer Vision · Document Parsing | arxiv | 80.1% | #5 | — | source ↗ |
| 11 | olmOCR-Bench | Computer Vision · Document Parsing | tables | 81.6% | #5 | — | source ↗ |
| 12 | olmOCR-Bench | Computer Vision · Document Parsing | pass-rate | 79.8% | #7 | — | source ↗ |
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area
Where Qianfan-OCR actually performs.
§ 05 · Sources & freshness
Where these numbers come from.
paper
8
results
Hugging Face
4
results
0 of 12 rows marked verified.