swiss-ai/Apertus-8B-Instruct-2509 Text Generation • 8B • Updated 15 days ago • 310k • • 404
FreedomIntelligence/medical-o1-verifiable-problem Viewer • Updated Dec 30, 2024 • 40.6k • 344 • 117
🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 174
nvidia/Nemotron-Research-Reasoning-Qwen-1.5B Text Generation • 2B • Updated 7 days ago • 7.57k • 235
Discovering Preference Optimization Algorithms with and for Large Language Models Paper • 2406.08414 • Published Jun 12, 2024 • 16
Discovering Preference Optimization Algorithms with and for Large Language Models Paper • 2406.08414 • Published Jun 12, 2024 • 16