Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models Paper • 2601.19834 • Published 9 days ago • 25
Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound Paper • 2512.00883 • Published Nov 30, 2025
Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models Paper • 2601.19834 • Published 9 days ago • 25
RLVR-World: Training World Models with Reinforcement Learning Paper • 2505.13934 • Published May 20, 2025 • 16