ProRAG: Process-Supervised Reinforcement Learning for Retrieval-Augmented Generation
Paper • Jan 29, 2026 • arxiv.org • Zhao Wang, Ziliang Zhao, Zhicheng Dou
Reinforcement learning (RL) has become a promising paradigm for optimizing Retrieval-Augmented Generation (RAG) in complex reasoning tasks. However, traditional outcome-based RL approaches often su...