Codesota · Models · ViTDet-HMeta AI1 results · 1 benchmarks
Model card

ViTDet-H.

Meta AIopen-sourceUnknown paramsPlain ViT-Huge + Cascade Mask R-CNN

Plain non-hierarchical ViT for detection. 53.4 APbox on LVIS v1.0. NeurIPS 2022.

§ 01 · Benchmarks

Every benchmark ViTDet-H has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01LVIS v1.0Computer Vision · Object Detectionmask-ap53.4%#7/9source ↗
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where ViTDet-H actually performs.

Computer Vision
1
benchmark
avg rank #7.0
§ 04 · Related models

Other Meta AI models scored on Codesota.

GENRE
1 result · 1 SOTA
SeamlessM4T v2 Large
2.3B params · 1 result · 1 SOTA
DINOv2 (ViT-g) + Linear
Unknown params · 1 result
Fairseq S2T (MuST-C)
~150M params · 1 result
Mask2Former (Swin-L)
Unknown params · 1 result
MusicGen Large
3.3B params · 1 result
Voicebox
330M params · 1 result
convnext_base.fb_in22k_ft_in1k
1 result
§ 05 · Sources & freshness

Where these numbers come from.

arxiv
1
result
0 of 1 rows marked verified.