Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs Paper β’ 2505.15210 β’ Published May 21 β’ 18
Running on CPU Upgrade Featured 2.7k The Smol Training Playbook π 2.7k The secrets to building world-class LLMs
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Paper β’ 2506.03143 β’ Published Jun 3 β’ 53
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper β’ 2411.04905 β’ Published Nov 7, 2024 β’ 127
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning Paper β’ 2411.02337 β’ Published Nov 4, 2024 β’ 36
Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models Paper β’ 2411.00492 β’ Published Nov 1, 2024 β’ 6
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper β’ 2305.18290 β’ Published May 29, 2023 β’ 64
Unbounded: A Generative Infinite Game of Character Life Simulation Paper β’ 2410.18975 β’ Published Oct 24, 2024 β’ 37
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise Paper β’ 2410.03017 β’ Published Oct 3, 2024 β’ 29
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper β’ 2410.05993 β’ Published Oct 8, 2024 β’ 111
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free Paper β’ 2410.10814 β’ Published Oct 14, 2024 β’ 51