Maxwell Ashford. 2026. “Facilitating Cross-Domain Reasoning Generalization through Conservative Offline Reinforcement Learning Leveraging Pre-Trained Large Language Model Representations”. Computational Intelligence Systems 4 (1). https://doi.org/10.66280/cis.v1i1.196.