Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Benchmark · LibriSpeechHome/Leaderboards/Audio & Speech/Automatic Speech Recognition/LibriSpeech
Johns Hopkins University

LibriSpeech.

1000 hours of English speech from audiobooks. Standard benchmark for automatic speech recognition with clean and noisy test splits.

Paper Leaderboard Lineage
§ 01 · SOTA history

Year over year.

§ 02 · Leaderboard

Results by metric.

Found a wrong score or missing run?
Use row edits to send a sourced correction into moderation.
Add / edit result Report issue

Wer

Wer is the reported evaluation metric for LibriSpeech. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Lower is better

Trust tiers for Werverifiedpapervendorcommunityunverified
RankModelTrustScoreYearLinksEdit
01Granite 4.0 1B Speechunverified2.852025Paper ↗Source ↗Edit result
02Granite Speech 3.3 8Bunverified2.862025Paper ↗Source ↗Edit result
03Canary-1B-Flashunverified2.872025Paper ↗Source ↗Edit result
04Canary-1Bunverified2.932024Paper ↗Edit result
05Distil-Whisper Large v2unverified2.942023Paper ↗Code ↗Edit result
06Whisper Medium (English)unverified3.022022Paper ↗Code ↗Edit result
07Whisper-small.enunverified3.052022Paper ↗Code ↗Source ↗Edit result
08Whisper Small (English)unverified3.052022Paper ↗Code ↗Edit result
09Llama 3 Speech (70B)unverified3.102024Paper ↗Code ↗Edit result
10Llama 3 (405B, Instruct)unverified3.102024Paper ↗Code ↗Edit result
11Canary-Qwen-2.5Bunverified3.102025Paper ↗Source ↗Edit result
12Parakeet-tdt-0.6b-v2unverified3.192023Paper ↗Code ↗Source ↗Edit result
13Granite Speech 3.3 2Bunverified3.262025Paper ↗Source ↗Edit result
14Voxtral-Small-24B-2507unverified3.262025Paper ↗Source ↗Edit result
15Moonshine-baseunverified3.382024Paper ↗Code ↗Source ↗Edit result
16Distil-Whisper Small (English)unverified3.482023Paper ↗Code ↗Edit result
17Parakeet-ctc-1.1bunverified3.512023Paper ↗Source ↗Edit result
18Canary-1b-v2unverified3.562025Paper ↗Edit result
19Parakeet-tdt-0.6b-v3unverified3.592023Paper ↗Code ↗Source ↗Edit result
20Distil-Whisper Medium (English)unverified3.692023Paper ↗Code ↗Edit result
21Parakeet-ctc-0.6bunverified3.802023Paper ↗Source ↗Edit result
22Phi-4 Multimodal Instructunverified3.822025Paper ↗Source ↗Edit result
23Asr-wav2vec2-librispeechunverified3.832021Paper ↗Code ↗Source ↗Edit result
24Lite-whisper-large-v3-accunverified3.912025Paper ↗Code ↗Source ↗Edit result
25Whisper Large v3unverified3.912022Paper ↗Code ↗Source ↗Edit result
26Stt_en_fastconformer_transducer_largeunverified3.972023Paper ↗Source ↗Edit result
27Stt_en_fastconformer_ctc_largeunverified4.042023Paper ↗Source ↗Edit result
28Stt_en_conformer_ctc_largeunverified4.152020Paper ↗Code ↗Source ↗Edit result
29Whisper Large v3 Turbounverified4.242022Paper ↗Code ↗Source ↗Edit result
30Asr-conformer-loquaciousunverified4.242025Paper ↗Edit result
31Whisper baseunverified4.252022Paper ↗Code ↗Edit result
32Canary-180M-Flashunverified4.352025Paper ↗Source ↗Edit result
33Lite-whisper-large-v3unverified4.402025Paper ↗Code ↗Source ↗Edit result
34Qwen3-ASR-0.6Bunverified4.452026Paper ↗Code ↗Source ↗Edit result
35SYMPHONYunverified4.482025Paper ↗Edit result
36Moonshine-streaming-tinyunverified4.502026Paper ↗Edit result
37Moonshine-tinyunverified4.552024Paper ↗Code ↗Source ↗Edit result
38Lite-whisper-large-v3-turbo-accunverified4.602025Paper ↗Code ↗Source ↗Edit result
39Owsm_ctc_v4_1Bunverified4.892025Paper ↗Code ↗Source ↗Edit result
40Moonshine Streaming Mediumunverified5.002026Paper ↗Source ↗Edit result
41Distil-large-v3.5unverified5.042023Paper ↗Code ↗Source ↗Edit result
42Zipformer-transducer-XL-290Munverified5.042023Paper ↗Code ↗Source ↗Edit result
43Whisper Large v2unverified5.142022Paper ↗Code ↗Source ↗Edit result
44Owsm_ctc_v3.1_1Bunverified5.152024Paper ↗Code ↗Source ↗Edit result
45Distil-large-v3unverified5.192023Paper ↗Code ↗Source ↗Edit result
46Lite-whisper-large-v3-fastunverified5.192025Paper ↗Code ↗Source ↗Edit result
47Parakeet-tdt_ctc-110munverified5.222023Paper ↗Code ↗Source ↗Edit result
48Voxtral-Mini-4B-Realtime-2602unverified5.522026Paper ↗Source ↗Edit result
49Whisper Largeunverified5.542022Paper ↗Code ↗Source ↗Edit result
50Whisper Tiny (English)unverified5.662022Paper ↗Code ↗Edit result
51Moshi ASRunverified5.702024Paper ↗Code ↗Edit result
52Whisper-medium.enunverified5.852022Paper ↗Code ↗Source ↗Edit result
53Moonshine-streaming-smallunverified6.782026Paper ↗Source ↗Edit result
54Distil-large-v2unverified6.842023Paper ↗Code ↗Source ↗Edit result
55Distil-small.enunverified7.732023Paper ↗Code ↗Source ↗Edit result
56Stt_en_conformer_ctc_smallunverified7.922020Paper ↗Code ↗Source ↗Edit result
57Distil-medium.enunverified8.352023Paper ↗Code ↗Source ↗Edit result
58Niagara-38m-batch.enunverified9.352026Paper ↗Source ↗Edit result
59Whisper-base.enunverified10.352022Paper ↗Code ↗Source ↗Edit result
60Niagara-19m-batch.enunverified11.22026Paper ↗Edit result
61Hubert-xlarge-ls960-ftunverified12.222021Paper ↗Code ↗Source ↗Edit result
62Wav2vec2-large-960h-lv60-selfunverified12.422020Paper ↗Code ↗Source ↗Edit result
63Wav2vec2-conformer-rel-pos-large-960h-ftunverified12.442020Paper ↗Code ↗Source ↗Edit result
64Wav2vec2-base-960hunverified12.532020Paper ↗Code ↗Source ↗Edit result
65Wav2vec2-conformer-rope-large-960h-ftunverified12.542020Paper ↗Code ↗Source ↗Edit result
66Mms-1b-allunverified12.632023Paper ↗Code ↗Source ↗Edit result
67Hubert-large-ls960-ftunverified12.752021Paper ↗Code ↗Source ↗Edit result
68Data2vec-audio-large-960hunverified12.942022Paper ↗Code ↗Source ↗Edit result
69Wav2vec2-large-robust-ft-libri-960hunverified13.762021Paper ↗Code ↗Source ↗Edit result
70Whisper-tiny.enunverified15.452022Paper ↗Code ↗Source ↗Edit result
71wav2vec 2.0 Large (960h)unverified15.462020Paper ↗Code ↗Source ↗Edit result
72Data2vec-audio-base-960hunverified15.482022Paper ↗Code ↗Source ↗Edit result
73Mms-1b-fl102unverified28.72023Paper ↗Code ↗Source ↗Edit result

WER (test-other)

Word Error Rate on noisier/accented speech (lower is better)

Lower is better

Trust tiers for WER (test-other)verifiedpapervendorcommunityunverified
RankModelTrustScoreYearLinksEdit
01Universal-1
Greedy decoding, no external LM. From AssemblyAI research page Table 1.
verified3.102024Paper ↗Edit result
02Parakeet TDT 0.6B v2
NVIDIA. 0.6B params. FastConformer-TDT.
paper3.192025Source ↗Edit result
03wav2vec 2.0 Large (960h)
test-other WER (%). wav2vec 2.0 Large, 960h. Source: Table 3, arxiv:2006.11477
verified3.302020Paper ↗Edit result
04Canary 1B v2
NVIDIA. 1B multilingual ASR+AST. Aug 2025.
paper3.562025Source ↗Edit result
05Parakeet TDT 0.6B v3
NVIDIA. 0.6B params. Multilingual. Sep 2025.
paper3.592025Source ↗Edit result
06Whisper Large v3
test-other WER (%). Whisper large-v3. Source: OpenAI model card / arxiv:2212.04356
paper3.602024Source ↗Edit result
07HuBERT Large (LS-960)
test-other WER (%). HuBERT Large, 960h. Source: Table 2, arxiv:2106.07447
verified3.602021Paper ↗Edit result
08Canary-1B
test-other WER (%). Canary-1B EN. Source: Table 2, arxiv:2310.09873
verified3.802026Source ↗Edit result
09Voxtral Mini 3B
Mistral AI. 3B multimodal model. July 2025.
paper4.082025Source ↗Edit result
10Google USM
test-other WER (%). Google USM 2B. Source: Table 3, arxiv:2303.01037
verified4.102023Paper ↗Edit result
11Parakeet-CTC-1.1B
test-other WER (%). Parakeet-CTC-1.1B. Source: Table 1, arxiv:2311.13251
verified4.202026Source ↗Edit result
12Whisper Large v2
test-other WER (%). Whisper large-v2. Source: Table 5, arxiv:2212.04356
verified5.202026Source ↗Edit result
13Pulse STT
Smallest AI docs: English STT - ESB Dataset (Streaming), LibriSpeech Other WER.
verified5.832026Source ↗Edit result
14Phi-4-multimodal-instruct
Microsoft. 5.6B multimodal model. Feb 2025.
paper5.972025Source ↗Edit result

WER (test-clean)

Wer Test Clean is the reported evaluation metric for LibriSpeech. Codesota tracks published model scores on this metric so readers can compare state-of-the-art results across sources and model families.

Lower is better

Trust tiers for WER (test-clean)verifiedpapervendorcommunityunverified
RankModelTrustScoreYearLinksEdit
01Whisper Large v2
test-clean WER (%). Whisper large-v2. Source: Table 5, arxiv:2212.04356
verified2.702022Paper ↗Edit result
02wav2vec 2.0 Large
Meta. 317M params. Self-supervised pre-training on 60k hours of speech. Foundational SSL ASR model.
paper2.902024Source ↗Edit result
03Pulse STT
Smallest AI docs: English STT - ESB Dataset (Streaming), LibriSpeech Clean WER.
verified3.222026Source ↗Edit result
Lineage

LibriSpeech in context.

See full speech recognition benchmarks lineage →
None — this is where the lineage begins.
This benchmark (1)
saturated2015-04
LibriSpeech
Successors (3)
active2017-06
VoxCeleb
VoxCeleb covers speaker identity, not transcription — a different task that addresses the 'who spoke' question LibriSpeech ignores. Speaker verification became a standard parallel track in speech evaluation.
active2020-04
CHiME-6
LibriSpeech test-other saturated; CHiME-6's multi-speaker dinner-party setup was the first major challenge where clean-speech progress didn't transfer. Where attention moved when LibriSpeech-other WER dropped below 4%.
active2021-06
GigaSpeech
GigaSpeech is a scale and diversity extension — 10× more data, multi-domain. A training and evaluation resource for robustness rather than a direct successor to LibriSpeech's narrow clean-speech task.
§ 04 · Submit a result

Add to the leaderboard.

← Back to Automatic Speech Recognition