Codesota · Models · InternVL3-78BShanghai AI Lab3 results · 3 benchmarks
Model card

InternVL3-78B.

Shanghai AI Labopen-source78B paramsVision-Language Model
§ 01 · Benchmarks

Every benchmark InternVL3-78B has a recorded score for.

#BenchmarkArea · TaskMetricValueRankDateSource
01MMBenchMultimodal · Visual Question Answeringaccuracy90.1%#2/82025-01-22source ↗
02MME-VideoOCRComputer Vision · General OCR Capabilitiestotal-accuracy67.2%#3/6source ↗
03MMMUMultimodal · Visual Question Answeringaccuracy73.3%#8/182025-01-22unverified
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 02 · Strengths by area

Where InternVL3-78B actually performs.

Computer Vision
1
benchmark
avg rank #3.0
Multimodal
2
benchmarks
avg rank #5.0
§ 03 · Papers

1 paper with results for InternVL3-78B.

  1. 2025-01-22· Multimodal· 2 results

    InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

§ 04 · Related models

Other Shanghai AI Lab models scored on Codesota.

InternImage-H
2 results · 1 SOTA
InternVL2-76B
76B params · 4 results
InternImage-H
Unknown params · 1 result
InternVL3-76B
1 result
InternVL3.5-241B
1 result
InternImage-XL
0 results
TCP
0 results
§ 05 · Sources & freshness

Where these numbers come from.

arxiv
2
results
alphaxiv-leaderboard
1
result
1 of 3 rows marked verified.