Codesota · Models · HTR-ConvTextDAIR-Group6 results · 3 benchmarks
Model card

HTR-ConvText.

DAIR-Groupunknown65.9M paramsCNN+Transformer hybrid (ConvText block)

Handwritten Text Recognition model combining convolution and textual information. Uses a convolutional feature extractor with cross-attention over character embeddings to improve recognition of historical handwriting. 65.9M parameters.

§ 01 · Benchmarks

Every benchmark HTR-ConvText has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01read2016(line-level)Computer Vision · Optical Character Recognitiontest-wer15.7%#5/52024-12-06source ↗
02read2016(line-level)Computer Vision · Optical Character Recognitiontest-cer3.6%#6/62024-12-06source ↗
03IAMComputer Vision · Handwriting Recognitionwer12.9%#6/10source ↗
04lam(line-level)Computer Vision · Optical Character Recognitiontest-cer2.7%#7/72024-12-06source ↗
05lam(line-level)Computer Vision · Optical Character Recognitiontest-wer7.0%#7/72024-12-06source ↗
06IAMComputer Vision · Handwriting Recognitioncer4.0%#13/22source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where HTR-ConvText actually performs.

Computer Vision
3
benchmarks
avg rank #7.3
§ 03 · Papers

1 paper with results for HTR-ConvText.

  1. 2024-12-06· 4 results

    HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition

§ 05 · Sources & freshness

Where these numbers come from.

arxiv
6
results
6 of 6 rows marked verified.