Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
6
4
Yiming Jia
jymmmmm
Follow
21world's profile picture
1 follower
·
3 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
22 days ago
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
upvoted
a
paper
22 days ago
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
updated
a model
about 1 month ago
jymmmmm/vlm-fixed-onpolicy-step600
View all activity
Organizations
jymmmmm
's models
14
Sort: Recently updated
jymmmmm/vlm-fixed-onpolicy-step600
8B
•
Updated
Aug 23
•
4
jymmmmm/vlm-fixed-onpolicy-step400
8B
•
Updated
Aug 23
•
3
jymmmmm/vlm-fixed-onpolicy-step200
8B
•
Updated
Aug 20
•
4
jymmmmm/qwen2_5vl_tgrpo_f32_s300
8B
•
Updated
May 9
•
5
jymmmmm/qwen2_5vl_tgrpo_f32_s900
8B
•
Updated
May 9
•
3
jymmmmm/qwen2_5vl_tgrpo_f32_s1561
8B
•
Updated
May 9
•
3
jymmmmm/qwen2_5vl_grpo_f64_s300
8B
•
Updated
May 7
•
5
jymmmmm/qwen2_5vl_grpo_f64_s900
8B
•
Updated
May 7
•
4
jymmmmm/qwen2_5vl_grpo_f64_s1500
8B
•
Updated
May 7
•
5
jymmmmm/qwen2_5vl_grpo_1500
Updated
May 7
•
4
jymmmmm/training1_checkpoint-1100
8B
•
Updated
May 5
•
2
jymmmmm/training1_checkpoint-500
8B
•
Updated
May 5
•
2
jymmmmm/visualwebinstruct_temp
Updated
Apr 14
jymmmmm/pot-r1-grpo-qwen2.5-7b-Instruct
Updated
Mar 30