-
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 61 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 267 -
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 36 -
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Paper • 2506.13585 • Published • 260
Av
Avi66
·
AI & ML interests
ML Research , LLMs , Applications
MultiModality
Recent Activity
updated
a collection
about 20 hours ago
Vlm
updated
a collection
about 20 hours ago
TTS
updated
a collection
about 24 hours ago
TTS
Organizations
Vlm
-
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text • 8B • Updated • 19.8k • 155 -
mradermacher/Janus-Pro-7B-LM-GGUF
7B • Updated • 1.31k • 36 -
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 60.4k • 205 -
RedHatAI/Llama-3.2-11B-Vision-Instruct-FP8-dynamic
Text Generation • 11B • Updated • 1.32k • 24
Spaces
Papers
-
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 61 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 267 -
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 36 -
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Paper • 2506.13585 • Published • 260
Tamil llm
Vlm
-
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text • 8B • Updated • 19.8k • 155 -
mradermacher/Janus-Pro-7B-LM-GGUF
7B • Updated • 1.31k • 36 -
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 60.4k • 205 -
RedHatAI/Llama-3.2-11B-Vision-Instruct-FP8-dynamic
Text Generation • 11B • Updated • 1.32k • 24
TTS
Spaces