ADITYA M. ROY; AAKASH D. MISHRA. Reinforcement Learning-Based Reasoning Optimization for Large Language Models in Complex Decision-Making Systems. Computational Intelligence Systems, [S. l.], v. 3, n. 1, 2025. Disponível em: https://www.scivexus.org/index.php/CIS/article/view/344. Acesso em: 8 jun. 2026.