view article Article Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ By Wauplin and 2 others • 3 days ago • 32
view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events By vinid and 6 others • 11 days ago • 26
view article Article Arc Virtual Cell Challenge: A Primer By FL33TW00D-HF and 1 other • 10 days ago • 39
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements Paper • 2506.22419 • Published about 1 month ago • 14
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published Apr 29 • 97
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Paper • 2506.20639 • Published Jun 25 • 27
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Paper • 2506.21506 • Published Jun 26 • 49
USAD: Universal Speech and Audio Representation via Distillation Paper • 2506.18843 • Published Jun 23 • 11
TabArena: A Living Benchmark for Machine Learning on Tabular Data Paper • 2506.16791 • Published Jun 20 • 2
view article Article (LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware By derekl35 and 4 others • Jun 19 • 79
EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition Paper • 2505.20033 • Published May 26 • 4
TabICL: A Tabular Foundation Model for In-Context Learning on Large Data Paper • 2502.05564 • Published Feb 8 • 1
Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper • 2506.09250 • Published Jun 10 • 28