Maxwell Ashford. “Facilitating Cross-Domain Reasoning Generalization through Conservative Offline Reinforcement Learning Leveraging Pre-Trained Large Language Model Representations”. Computational Intelligence Systems 4, no. 1 (May 19, 2026). Accessed July 12, 2026. https://scivexus.org/index.php/CIS/article/view/196.