-
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 61 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 264 -
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 36 -
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Paper • 2506.13585 • Published • 259
Av
Avi66
·
AI & ML interests
ML Research , LLMs , Applications
MultiModality
Recent Activity
updated
a collection
about 2 hours ago
Vlm
updated
a collection
about 2 hours ago
TTS
updated
a collection
about 6 hours ago
TTS