Translating spoken audio directly to another language.
Multilingual Speech Translation Corpus built from TED talks. The English-German tst-COMMON split is the de-facto benchmark for end-to-end speech translation. BLEU on tst-COMMON is the primary metric.
Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.
Still looking for something on Speech Translation? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.
Real humans read every message. We track what people are asking for and prioritize accordingly.