Maxwell Ashford (2026) “Facilitating Cross-Domain Reasoning Generalization through Conservative Offline Reinforcement Learning Leveraging Pre-trained Large Language Model Representations”, Computational Intelligence Systems, 4(1). doi: 10.66280/cis.v1i1.196.