Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Tasks · Music GenerationHome/Tasks/Audio/Music Generation

Music Generation.

Generating music from text, audio, or other inputs.

1
Datasets
3
Results
fad
Canonical metric
§ 02 · Canonical benchmark

The reference dataset.

MusicCaps

Music generation evaluated on 5.5K expert-annotated music clips

Primary metric: fad
View full leaderboard →
§ 03 · Top 10

Leading models.

Leading models on MusicCaps.

#ModelfadYearSource
MusicLM4.002026paper ↗
2MusicGen Large3.802026paper ↗
3Noise2Music2.132026paper ↗

What were you looking for on Music Generation?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

1 dataset tracked for this task.

MusicCaps
CANONICAL
3 results · fad
Top: MusicLM 4.00
§ 05 · Related tasks

Other tasks in Audio.

Audio CaptioningAudio-to-AudioSound Event DetectionText-to-AudioVoice Activity Detection
Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Music Generation? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.