RexBERT: Encoders for a brave new world of E-Commerce
By
and 1 other
•
•
33Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!
By
and 1 other
•
•
58AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models
By
and 4 others
•
•
14mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL
By
and 1 other
•
•
21"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack
By
•
•
10Unleashing the Full Potential of ERNIE4.5 using FastDeploy
By
and 3 others
•
•
10How to Train an Antibody Developability Model
By
and 1 other
•
•
9🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎
By
and 1 other
•
•
9SyGra: The One-Stop Framework for Building Data for LLMs and SLMs
By
and 3 others
•
•
9How to Choose the Best Open Source LLM for Your Project in 2025
By
•
•
70Small Language Models (SLM): A Comprehensive Overview
By
•
•
72Code a simple RAG from scratch
By
•
•
199DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
By
•
•
221Use AI on Your PC: Optimize and Deploy a Multimodal Agentic Pipeline on AI PC Powered by Intel
By
and 2 others
•
•
5Finegrain Product Placement LoRA (experiment)
By
•
•
5Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
By
•
•
69Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm
By
and 5 others
•
•
91From GRPO to DAPO and GSPO: What, Why, and How
By
•
•
30🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders
By
and 1 other
•
•
13Native FP8 Mixed Precision Training for Ling 2.0, Open Sourced!
By
•
•
4