yuzhe gu's picture

yuzhe gu

vanilla1116

·

https://guyuzhe.site/

Liqu1d-G

AI & ML interests

LLM; Hallucination; Self-Improvement

Recent Activity

authored a paper 6 days ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

upvoted a paper 6 days ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

commented on a paper 6 days ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

View all activity

Organizations

authored a paper 6 days ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

Paper • 2507.16814 • Published 6 days ago • 22

upvoted a paper 6 days ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

Paper • 2507.16814 • Published 6 days ago • 22

commented a paper 6 days ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

Paper • 2507.16814 • Published 6 days ago • 22 •

authored a paper 11 days ago

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

Paper • 2507.13332 • Published 11 days ago • 46

upvoted a paper 11 days ago

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

Paper • 2507.13332 • Published 11 days ago • 46

commented a paper 11 days ago

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

Paper • 2507.13332 • Published 11 days ago • 46 •

liked a Space 3 months ago

Open LMM Subjective Leaderboard

VLMEvalKit Subjectivce Benchmark Results

upvoted 2 papers 3 months ago

MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space

Paper • 2504.13835 • Published Apr 18 • 38

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 279

commented a paper 4 months ago

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Paper • 2503.24388 • Published Mar 31 • 31 •

upvoted 2 papers 4 months ago

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Paper • 2503.24388 • Published Mar 31 • 31

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 49

updated a dataset 5 months ago

opencompass/anah

Viewer • Updated Mar 13 • 783 • 86 • 3

New activity in opencompass/anah 5 months ago

Update dataset card, link to paper, add category

#2 opened 5 months ago by

New activity in opencompass/anah-7b 5 months ago

Add missing metadata and clarify license

#1 opened 5 months ago by

New activity in opencompass/anah-20b 5 months ago

Add missing metadata: `pipeline_tag`, `library_name`, and `license`

#1 opened 5 months ago by

New activity in opencompass/anah-v2 5 months ago

Improve model card with library_name and pipeline_tag

#1 opened 5 months ago by

authored a paper 5 months ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published Mar 4 • 19

upvoted a paper 5 months ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published Mar 4 • 19

commented a paper 5 months ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published Mar 4 • 19 •