Lars D. Welch, Sven Watkins, Tarun M. Raman, & Massimo Wagner. (2026). Multi-Modal Robotic World Modeling via Physically Consistent Video Generation and Cross-View Representation Alignment. Computational Intelligence Systems, 4(1). Retrieved from https://scivexus.org/index.php/CIS/article/view/365