Lars D. Welch, Sven Watkins, Tarun M. Raman, and Massimo Wagner. “Multi-Modal Robotic World Modeling via Physically Consistent Video Generation and Cross-View Representation Alignment”. Computational Intelligence Systems 4, no. 1 (May 15, 2026). Accessed July 12, 2026. https://scivexus.org/index.php/CIS/article/view/365.