Model card
GPT-4V.
UnknownmultimodalUnknown paramsTransformer
GPT-4 with Vision. First major multimodal GPT-4 release, Sept 2023. Evaluated on MMMU, VQA, TextVQA. Source: GPT-4 Technical Report.
§ 01 · Benchmarks
Every benchmark GPT-4V has a recorded score for.
| # | Benchmark | Area · Task | Metric | Value | Rank | Date | Source |
|---|---|---|---|---|---|---|---|
| 01 | MMBench | Multimodal · Visual Question Answering | accuracy | 75.8% | #6 | 2023-03-15 | source ↗ |
| 02 | TextVQA | Multimodal · Visual Question Answering | accuracy | 78.0% | #6 | 2023-03-15 | source ↗ |
| 03 | VQA v2.0 | Multimodal · Visual Question Answering | accuracy | 77.2% | #7 | 2023-03-15 | source ↗ |
| 04 | MMMU | Multimodal · Visual Question Answering | accuracy | 56.8% | #18 | 2023-03-15 | source ↗ |
Rank column shows this model’s position vs all other models scored on the same benchmark + metric (competitors after the slash). #1 in red means current SOTA. Sorted by rank, then newest result.
§ 03 · Papers
1 paper with results for GPT-4V.
- 2023-03-15· Natural Language Processing· 4 results
GPT-4 Technical Report
§ 04 · Related models
Other Unknown models scored on Codesota.
fglihai
Unknown params · 6 results · 1 SOTA
CLIP4STR-L
Unknown params · 1 result · 1 SOTA
USYD NLP_CS29-2
Unknown params · 6 results
Corner-based Region Proposals
Unknown params · 3 results
EAST + VGG16
Unknown params · 3 results
SSTD
Unknown params · 3 results
TextBoxes++_MS
Unknown params · 3 results
WordSup (VGG16-synth-coco)
Unknown params · 3 results
§ 05 · Sources & freshness
Where these numbers come from.
arxiv
4
results
4 of 4 rows marked verified.