view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • 10 days ago • 55
nablaNABLA: Neighborhood Adaptive Block-Level Attention Paper • 2507.13546 • Published Jul 17 • 123
view article Article Understanding Gemma 3n: How MatFormer Gives You Many Models in One By rishiraj • Jun 26 • 48
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published Jun 19 • 88
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub Jun 12 • 148
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning Paper • 2505.16933 • Published May 22 • 34
view article Article Open-Source Handwritten Signature Detection Model By samuellimabraz • Mar 14 • 119
Running 3.4k 3.4k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning By NormalUhr • Feb 4 • 16