黄炜锴

tsrigo

tsrigo

AI & ML interests

Trustworthy AI

Recent Activity

upvoted a paper 6 days ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

upvoted a paper 7 days ago

Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO

liked a dataset 21 days ago

meta-agents-research-environments/gaia2

View all activity

Organizations

upvoted a paper 6 days ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

Paper • 2511.19900 • Published 7 days ago • 46

upvoted a paper 7 days ago

Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO

Paper • 2511.13288 • Published 15 days ago • 17

liked a dataset 21 days ago

meta-agents-research-environments/gaia2

Viewer • Updated Sep 25 • 963 • 5.09k • 36

liked a Space 21 days ago

Gaia2 Agents Evaluation Leaderboard

🐠

Display and submit model evaluation results on a leaderboard

upvoted an article 21 days ago

Article

Gaia2 and ARE: Empowering the community to study agents

Sep 22

•

120

upvoted a collection 2 months ago

DeepSeek-V3.2

Collection

4 items • Updated 1 day ago • 477

upvoted a paper 7 months ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 98

liked a model 10 months ago

tsrigo/coconut

Updated Feb 15 • 1

updated a model 10 months ago

tsrigo/coconut

Updated Feb 15 • 1

published a model 10 months ago

tsrigo/coconut

Updated Feb 15 • 1

updated a model 10 months ago

tsrigo/unsloth_model

Text Generation • 3B • Updated Feb 13 • 2

published a model 10 months ago

tsrigo/unsloth_model

Text Generation • 3B • Updated Feb 13 • 2

updated a model 11 months ago

tsrigo/Qwen2.5-1.5B-Instruct-DPO-bad-boy

2B • Updated Jan 20 • 3

published a model 11 months ago

tsrigo/Qwen2.5-1.5B-Instruct-DPO-bad-boy

2B • Updated Jan 20 • 3

updated 2 datasets 11 months ago

tsrigo/btfChinese-DPO-small

Viewer • Updated Jan 20 • 5k • 8

tsrigo/btfChinese-DPO-small

Viewer • Updated Jan 20 • 5k • 8

published a dataset 11 months ago

tsrigo/btfChinese-DPO-small

Viewer • Updated Jan 20 • 5k • 8

updated a model 11 months ago

tsrigo/Qwen2.5-0.5B-Instruct-DPO-bad-boy

Updated Jan 20

published a model 11 months ago

tsrigo/Qwen2.5-0.5B-Instruct-DPO-bad-boy

Updated Jan 20

upvoted a collection 11 months ago

LLM Reasoning Papers

Collection

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 123

黄炜锴

AI & ML interests

Recent Activity

Organizations

tsrigo's activity

Gaia2 Agents Evaluation Leaderboard

Gaia2 and ARE: Empowering the community to study agents