Codesota · Tasks · Text-to-3DHome/Tasks/Computer Vision/Text-to-3D

Computer Vision· text-to-3d

Text-to-3D.

Text-to-3D generates 3D assets — meshes, NeRFs, or Gaussian splats — from text descriptions alone, a capability that barely existed before DreamFusion (2022) showed score distillation sampling could lift 2D diffusion priors into 3D. The field moves at breakneck speed: Magic3D added coarse-to-fine generation, Instant3D achieved single-shot inference, and Meshy and Tripo brought commercial quality. Multi-view consistency remains the core challenge — the "Janus problem" where different viewpoints produce contradictory details. The promise of democratizing 3D content creation for games, VR, and e-commerce is driving massive investment.

1

Datasets

0

Results

composite

Canonical metric

§ 02 · Canonical benchmark

The reference dataset.

T3Bench

Evaluates text-to-3D generation quality and text alignment

Primary metric: composite

View full leaderboard →

§ 03 · Top 10

Leading models.

Leading models on T3Bench.

No results yet. Be the first to contribute.

What were you looking for on Text-to-3D?

Didn't find the model, metric, or dataset you needed? Tell us in one line. We read every message and reply within 48 hours.

§ 04 · All datasets

Tracked datasets.

1 dataset tracked for this task.

0 results · composite

§ 05 · Related tasks

Other tasks in Computer Vision.

Document Image Classification Document Layout Analysis Document Parsing Document Understanding General OCR Capabilities Handwriting Recognition Image Feature Extraction Image-to-3D

Reply within 48 hours · No newsletter

Didn't find what you came for?

Still looking for something on Text-to-3D? A missing model, a stale score, a benchmark we should cover — drop it here and we'll handle it.

Real humans read every message. We track what people are asking for and prioritize accordingly.