Codesota · Models · AudioCaps baseline (TopDown+Align)Kim et al.1 results · 1 benchmarks
Model card

AudioCaps baseline (TopDown+Align).

Kim et al.open-sourceUnknown paramsVGGish + Top-Down attention + alignment loss1 current SOTA

Original AudioCaps paper baseline (NAACL 2019).

§ 01 · Benchmarks

Every benchmark AudioCaps baseline (TopDown+Align) has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01AudioCapsAudio · Audio Captioningspider0.4%#1/3source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where AudioCaps baseline (TopDown+Align) actually performs.

Audio
1
benchmark
avg rank #1.0 · 1 SOTA
§ 05 · Sources & freshness

Where these numbers come from.

editorial
1
result
0 of 1 rows marked verified.