Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 11 days ago • 121
OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 9 days ago • 45
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 191
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 17 days ago • 46
AI Companionship: Why We Need to Evaluate How AI Systems Handle Emotional Bonds By giadap and 2 others • 6 days ago • 4
RealPerformance, A Dataset of Language Model Business Compliance Issues By davidberenstein1957 and 1 other • 6 days ago • 4
Detecting Beyond Sight: Building AI-Enabled SAR Intelligence with Synthetic Data By DualityAI-RebekahBogdanoff • 1 day ago • 4
Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 11 days ago • 121
OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 9 days ago • 45
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 191
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 17 days ago • 46
AI Companionship: Why We Need to Evaluate How AI Systems Handle Emotional Bonds By giadap and 2 others • 6 days ago • 4
RealPerformance, A Dataset of Language Model Business Compliance Issues By davidberenstein1957 and 1 other • 6 days ago • 4
Detecting Beyond Sight: Building AI-Enabled SAR Intelligence with Synthetic Data By DualityAI-RebekahBogdanoff • 1 day ago • 4