Recent studyBlind TTS Elo is live. Compare two anonymous voice samples, vote after listening, and help separate real preference signal from noise.Vote in the study ->
Codesota · Tasks · Object DetectionHome/Tasks/Computer Vision/Object Detection
Computer Vision· object-detection

Object Detection.

Object Detection is a computer vision task that involves identifying and localizing objects within an image. The goal is to detect instances or objects of a certain class (such as humans, buildings, or cars) in digital images and videos. Object detection models typically output a set of bounding boxes with corresponding predicted class names.

11
Datasets
104
Results
box-map
Canonical metric
§ 02 · Canonical benchmark

The reference dataset.

COCO

Microsoft COCO is the gold standard for large-scale object detection, segmentation, and captioning, with 330k+ images, 1.5M+ object instances, and 80 categories. Primary metric is box mAP averaged over 10 IoU thresholds (0.5:0.95).

Primary metric: box-map
View full leaderboard →
§ 03 · Top 10

Leading models.

Leading models on COCO.

#Modelbox-mapYearSource
ScyllaNet66.12026paper ↗
2DINOv3 + Plain-DETR + TTA66.12025paper ↗
3Co-DETR (Swin-L)66.02022paper ↗
4Co-DETR (Swin-L)66.02026paper ↗
5SenseTime Basemodel66.02026paper ↗
6CW_Detection66.02026paper ↗
7Co-DETR (Swin-L)66.02025paper ↗
8Thinker66.02026paper ↗
9DINOv3 + Plain-DETR65.62025paper ↗
10InternImage-H (OneFormer)65.52026paper ↗

What were you looking for on Object Detection?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

11 datasets tracked for this task.

COCO
CANONICAL
79 results · box-map
Top: ScyllaNet 66.1
LVIS v1.0
16 results · mask-ap
Top: DINO-X 71.4
Pascal VOC 2012
9 results · mAP
Top: DINOv3 (7B) 86.6
COCO 2014 val
0 results
COCO test-dev
0 results
COCO val2017
0 results
DIOR
0 results
ImageNet Detection (ILSVRC DET)
0 results
ImageNet Localization (ILSVRC LOC)
0 results
PASCAL VOC 2007
0 results
Roboflow100-VL (RF100-VL)
0 results
§ 05 · Related tasks

Other tasks in Computer Vision.

3D Understanding3D generationDepth estimationFew-Shot Image ClassificationImage ClassificationImage editingImage generationImage segmentation
Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Object Detection? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.