Computer Visionimage-classification

Image Classification

Image classification is the task that launched modern deep learning — AlexNet's 2012 ImageNet win cut error rates in half overnight and triggered the entire neural network renaissance. The progression from VGGNet to ResNet to Vision Transformers traces the intellectual history of the field itself. Today's frontier models like EVA-02 and SigLIP push top-1 accuracy above 91% on ImageNet, but the real action has shifted to efficiency (MobileNet, EfficientNet) and robustness under distribution shift. Still the default benchmark for new architectures, and the foundation that every other vision task builds on.

4
Datasets
44
Results
top-1-accuracy
Canonical metric
Canonical Benchmark

ImageNet-1K

1.28M training images, 50K validation images across 1,000 object classes. The standard benchmark for image classification since 2012.

Primary metric: top-1-accuracy
View full leaderboard

Top 10

Leading models on ImageNet-1K.

RankModeltop-1-accuracyYearSource
1
coca-finetuned
91.02025paper
2
vit-g-14
90.52025paper
3
EVA-02-L
90.12026paper
4
EVA-Giant
89.82026paper
5
InternImage-H
89.62026paper
6
SigLIP-SO400M
89.42026paper
7
convnext-v2-huge
88.92025paper
8
ViT-H/14 CLIP (LAION-2B)
88.62026paper
9
ConvNeXt-XXLarge (CLIP LAION)
88.62026paper
10
vit-h-14
88.52025paper

All datasets

4 datasets tracked for this task.

Related tasks

Other tasks in Computer Vision.

Run Inference

Looking to run a model? HuggingFace hosts inference for this task type.

HuggingFace