Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!
By
and 1 other
β’
β’
55PP-OCRv5 on Hugging Face: A Specialized Approach to OCR
By
and 5 others
β’
β’
94How to Choose the Best Open Source LLM for Your Project in 2025
By
β’
β’
66mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL
By
and 1 other
β’
β’
15AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models
By
and 4 others
β’
β’
9"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack
By
β’
β’
9Code a simple RAG from scratch
By
β’
β’
196Unleashing the Full Potential of ERNIE4.5 using FastDeploy
By
and 3 others
β’
β’
7Small Language Models (SLM): A Comprehensive Overview
By
β’
β’
66How to Train an Antibody Developability Model
By
and 1 other
β’
β’
6π What kind of environmental impacts are AI companies disclosing? (And can we compare them?) π
By
and 1 other
β’
β’
5Finegrain Product Placement LoRA (experiment)
By
β’
β’
5Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth
By
β’
β’
360Use AI on Your PC: Optimize and Deploy a Multimodal Agentic Pipeline on AI PC Powered by Intel
By
and 2 others
β’
β’
4Decoding Strategies in Large Language Models
By
β’
β’
88KV Caching Explained: Optimizing Transformer Inference Efficiency
By
β’
β’
135Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm
By
and 5 others
β’
β’
91Diffusion Language Models: The New Paradigm
By
β’
β’
16Fine-tune Any LLM from the Hugging Face Hub with Together AI
By
and 3 others
β’
β’
8BioClinical ModernBERT: an example of continued pre-training of ModernBERT
By
β’
β’
3