Routine: A Structural Planning Framework for LLM Agent System in Enterprise Paper • 2507.14447 • Published 10 days ago • 1
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8 Text Generation • 235B • Updated 3 days ago • 986 • 23
view article Article Fast LoRA inference for Flux with Diffusers and PEFT By sayakpaul and 1 other • 6 days ago • 27
Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated 5 days ago • 10.1k • • 826
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • 20 days ago • 613
Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search Paper • 2503.04412 • Published Mar 6 • 4
HIRAG: Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation Paper • 2507.05714 • Published 21 days ago • 1
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published 27 days ago • 197
WebSailor: Navigating Super-human Reasoning for Web Agent Paper • 2507.02592 • Published 25 days ago • 99
zai-org/GLM-4.1V-9B-Thinking Image-Text-to-Text • 10B • Updated 20 days ago • 81.8k • • 673