PERK: Long-Context Reasoning as Parameter-Efficient Test-Time Learning Paper ⢠2507.06415 ⢠Published 19 days ago ⢠6
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper ⢠2507.05687 ⢠Published 20 days ago ⢠26
4KAgent: Agentic Any Image to 4K Super-Resolution Paper ⢠2507.07105 ⢠Published 18 days ago ⢠89
A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality Paper ⢠2507.07202 ⢠Published 18 days ago ⢠22
LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS Paper ⢠2507.07136 ⢠Published 19 days ago ⢠34
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding Paper ⢠2507.07984 ⢠Published 17 days ago ⢠38
Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs Paper ⢠2507.07990 ⢠Published 17 days ago ⢠44
view article Article Upskill your LLMs with Gradio MCP Servers By freddyaboulton ⢠19 days ago ⢠17
view article Article Building the Hugging Face MCP Server By evalstate and 3 others ⢠18 days ago ⢠51
RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy Paper ⢠2503.24388 ⢠Published Mar 31 ⢠31
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization Paper ⢠2503.19901 ⢠Published Mar 25 ⢠41
Efficient Inference for Large Reasoning Models: A Survey Paper ⢠2503.23077 ⢠Published Mar 29 ⢠47
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Paper ⢠2503.24290 ⢠Published Mar 31 ⢠63
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper ⢠2503.23461 ⢠Published Mar 30 ⢠95
MoCha: Towards Movie-Grade Talking Character Synthesis Paper ⢠2503.23307 ⢠Published Mar 30 ⢠136