Codesota · Models · CLIP4STR-BResearch12 results · 12 benchmarks
Model card

CLIP4STR-B.

ResearchunknownUnknown paramsUnknown

CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model. Base variant. Exploits CLIP pre-training for robust scene text features. Strong on Union14M benchmark. arXiv 2305.14014.

§ 01 · Benchmarks

Every benchmark CLIP4STR-B has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01Union14MComputer Vision · Scene Text Detectionaccuracy70.8%#1/8source ↗
02hostComputer Vision · Scene Text Recognition1-1-accuracy79.8%#2/32023-05-23source ↗
03coco-textComputer Vision · Scene Text Detection1-1-accuracy81.1%#3/32023-05-23source ↗
04ic19-artComputer Vision · Scene Text Detectionaccuracy85.8%#3/42023-05-23source ↗
05uber-textComputer Vision · Scene Text Recognitionaccuracy86.8%#3/32023-05-23source ↗
06cute80Computer Vision · Scene Text Recognitionaccuracy99.3%#4/202023-05-23source ↗
07wostComputer Vision · Scene Text Recognition1-1-accuracy87.0%#4/52023-05-23source ↗
08icdar2013Computer Vision · Optical Character Recognitionaccuracy98.3%#5/362023-05-23source ↗
09svtpComputer Vision · Scene Text Recognitionaccuracy97.2%#5/192023-05-23source ↗
10iiit5kComputer Vision · Scene Text Recognitionaccuracy99.2%#6/212023-05-23source ↗
11svtComputer Vision · Scene Text Recognitionaccuracy98.3%#7/402023-05-23source ↗
12icdar2015Computer Vision · Optical Character Recognitionaccuracy90.6%#8/292023-05-23source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where CLIP4STR-B actually performs.

Computer Vision
12
benchmarks
avg rank #4.3
§ 03 · Papers

1 paper with results for CLIP4STR-B.

  1. 2023-05-23· Computer Vision· 11 results

    CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model

§ 04 · Related models

Other Research models scored on Codesota.

DenseNet-121 (Chest X-ray)
8M params · 4 results · 2 SOTA
SimpleNet
2 results · 2 SOTA
DGN
1 result · 1 SOTA
DeepASD
1 result · 1 SOTA
DefectDet (ResNet)
1 result · 1 SOTA
PROXI
1 result · 1 SOTA
ASD-SWNet
2 results
ASDFormer
2 results
§ 05 · Sources & freshness

Where these numbers come from.

papers-with-code
11
results
arxiv-paper
1
result
11 of 12 rows marked verified.