Recent Papers / arXiv:2606.09169
IMUG-Bench: Benchmarking Unified Multimodal Models on Interleaved Understanding and Generation
Authors pending
Abstract
3,113 samples across static, temporal, and hybrid tasks; exposes exposure bias.
Tasks
editResults
No benchmark results recorded yet.
Benchmark results referencing this paper haven't been added to the registry yet. If you have a reproduction, submit it →
CodeSOTA extraction
Benchmark evidence
- IMUG-Bench: confirm exposure bias gap between understanding and generation in multi-turn settings