Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Tasks · Semantic SegmentationHome/Tasks/Computer Vision/Semantic Segmentation
Computer Vision· image-segmentation

Semantic Segmentation.

Semantic segmentation assigns a class label to every pixel — the dense prediction problem that underpins autonomous driving, medical imaging, and satellite analysis. FCN (2015) showed you could repurpose classifiers for pixel labeling, DeepLab introduced atrous convolutions and CRFs, and SegFormer (2021) proved transformers dominate here too. State-of-the-art on Cityscapes exceeds 85 mIoU, but ADE20K with its 150 classes remains brutally challenging. The frontier has moved toward universal segmentation models like Mask2Former that handle semantic, instance, and panoptic segmentation in a single architecture.

2
Datasets
24
Results
mIoU
Canonical metric
§ 02 · Canonical benchmark

The reference dataset.

ADE20K

20K training, 2K validation images annotated with 150 object categories. Complex scene parsing benchmark.

Primary metric: mIoU
View full leaderboard →
§ 03 · Top 10

Leading models.

Leading models on ADE20K.

#ModelmIoUYearSource
InternImage-H62.92025paper ↗
2BEiT-3 (ViT-L)62.82026paper ↗
3DINOv3 + Mask2Former (simple) 62.62025paper ↗
4DINOv2 (ViT-g) + Linear62.02026paper ↗
5EoMT (ViT-L)58.42025paper ↗
6BEiT-L+57.92021paper ↗
7Mask2Former (Swin-L)57.32025paper ↗
8Mask2Former (Swin-L)57.32026paper ↗
9OneFormer (Swin-L)57.02022paper ↗
10Mask2Former + Swin-L-FaPN56.42021paper ↗

What were you looking for on Semantic Segmentation?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

2 datasets tracked for this task.

ADE20K
CANONICAL
21 results · mIoU
Top: InternImage-H 62.9
Cityscapes
3 results · mIoU
Top: EoMT (ViT-L) 84.2
§ 05 · Related tasks

Other tasks in Computer Vision.

Document Image ClassificationDocument Layout AnalysisDocument ParsingDocument UnderstandingGeneral OCR CapabilitiesHandwriting RecognitionImage Feature ExtractionImage-to-3D
Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Semantic Segmentation? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.