Lars D. Welch, et al. “Multi-Modal Robotic World Modeling via Physically Consistent Video Generation and Cross-View Representation Alignment”. Computational Intelligence Systems, vol. 4, no. 1, May 2026, https://scivexus.org/index.php/CIS/article/view/365.