ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published May 7 • 65
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Paper • 2503.24290 • Published Mar 31 • 62
MaziyarPanahi/Llama-Nemotron-Post-Training-Dataset-v1-ShareGPT Viewer • Updated Jun 2 • 30.2M • 979 • 40