1 219 739

Motoki Wu PRO

tokestermw

https://motoki.co

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

MiniMaxAI/MiniMax-M2

liked a model 3 days ago

fixie-ai/ultravox-v0_6-llama-3_1-8b

liked a model 3 days ago

fixie-ai/ultraVAD

View all activity

Organizations

liked a model 1 day ago

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated 4 days ago • 530k • • 914

liked 2 models 3 days ago

fixie-ai/ultravox-v0_6-llama-3_1-8b

Audio-Text-to-Text • 0.7B • Updated Jul 5 • 11.4k • 2

fixie-ai/ultraVAD

Feature Extraction • 0.7B • Updated Sep 3 • 799 • 29

liked a model 5 days ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated Jul 26 • 421k • • 808

liked a model 10 days ago

PokeeAI/pokee_research_7b

Text Generation • 8B • Updated 10 days ago • 5.57k • 95

liked a model 12 days ago

ByteDance/Dolphin-1.5

Image-Text-to-Text • 0.4B • Updated 16 days ago • 5.67k • 17

liked a model 13 days ago

zeroentropy/zerank-1

Text Ranking • 4B • Updated Jul 24 • 920 • 62

liked a model 19 days ago

inclusionAI/Ling-1T

Text Generation • 1000B • Updated 5 days ago • 4.51k • • 501

upvoted a paper 23 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 306

upvoted an article 23 days ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

and 1 other •

24 days ago

• 32

upvoted a paper 25 days ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published 27 days ago • 112

upvoted a collection about 1 month ago

Qwen3-Omni

Collection

6 items • Updated 24 days ago • 162

upvoted 2 papers about 2 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 189

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 219

upvoted 3 papers 2 months ago

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28 • 109

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Paper • 2508.16949 • Published Aug 23 • 22

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 154

liked a Space 2 months ago

601

Sheets

🗂

Create and enrich datasets with AI

liked a model 2 months ago

xai-org/grok-2

Updated Aug 24 • 9.53k • 977