view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning Oct 27 • 68
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13 • 174
Language Models Can Learn from Verbal Feedback Without Scalar Rewards Paper • 2509.22638 • Published Sep 26 • 67
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10 • 188
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model Paper • 2509.00676 • Published Aug 31 • 83
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8 • 190
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models Aug 4 • 28
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! Aug 8 • 105