Codesota · Models · ABINet-LVFang et al.5 results · 5 benchmarks
Model card

ABINet-LV.

Fang et al.open-sourceUnknown paramsResNet + Bidirectional Language Model (LV)

ABINet Language-Vision model from CVPR 2021. arxiv:2103.04049.

§ 01 · Benchmarks

Every benchmark ABINet-LV has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01icdar-2013Computer Vision · Scene Text Detectionaccuracy97.0%#14/152021-03-06source ↗
02svtpComputer Vision · Scene Text Recognitionaccuracy89.5%#15/192021-03-06source ↗
03iiit5kComputer Vision · Scene Text Recognitionaccuracy96.4%#17/212021-03-06source ↗
04cute80Computer Vision · Scene Text Recognitionaccuracy89.2%#18/202021-03-06source ↗
05svtComputer Vision · Scene Text Recognitionaccuracy93.4%#22/402021-03-06source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where ABINet-LV actually performs.

Computer Vision
5
benchmarks
avg rank #17.2
§ 03 · Papers

1 paper with results for ABINet-LV.

  1. 2021-03-06· Computer Vision· 5 results

    Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

§ 04 · Related models

Other Fang et al. models scored on Codesota.

DPNet (ResNet-50, 736px)
Unknown params · 0 results
§ 05 · Sources & freshness

Where these numbers come from.

arxiv
5
results
5 of 5 rows marked verified.