When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance By Nicolas-BZRD and 1 other • 2 days ago • 10
CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions By paulml • about 19 hours ago • 10
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 2 days ago • 9
Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi By Arunbiz and 1 other • 7 days ago • 6
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 72
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 225
When Does Reasoning Matter? Unpacking the Contribution of Reasoning to LLM Performance By Nicolas-BZRD and 1 other • 2 days ago • 10
CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions By paulml • about 19 hours ago • 10
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 2 days ago • 9
Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi By Arunbiz and 1 other • 7 days ago • 6
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 72
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 225