DAT-DET.

Wan et al. (Baidu)open-sourceUnknown paramsInteractive attention transformer for multi-granularity text detection

Detection head of DAT (Dual-granularity Attention Transformer). Unified model for text at stroke, word, line, paragraph levels. ICML 2024. arxiv:2405.19765

§ 01 · Benchmarks

Every benchmark DAT-DET has a recorded score for.

#	Benchmark	Area · Task	Metric	Value	Rank	Date	Source
01	Total-Text	Computer Vision · Scene Text Detection	f-measure	91.0%	#2/33	2024-05-30	source ↗
02	Total-Text	Computer Vision · Scene Text Detection	precision	94.0%	#2/30	2024-05-30	source ↗
03	Total-Text	Computer Vision · Scene Text Detection	recall	88.2%	#3/30	2024-05-30	source ↗

Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.

§ 02 · Strengths by area