Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Zhizhang Fu
HarryFu
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
3 days ago
HarryFu/Qwen2.5-3B-GRPO-0725
published
a model
3 days ago
HarryFu/Qwen2.5-3B-GRPO-0725
updated
a model
3 days ago
HarryFu/Qwen2.5-3B-SFT-GRPO-0725
View all activity
Organizations
None yet
Papers
1
arxiv:
2502.09100
models
8
Sort: Recently updated
HarryFu/Qwen2.5-3B-GRPO-0725
3B
•
Updated
3 days ago
•
1
HarryFu/Qwen2.5-3B-SFT-GRPO-0725
3B
•
Updated
3 days ago
•
4
HarryFu/Qwen2.5-3B-SFT-GRPO-0724
Updated
6 days ago
HarryFu/Qwen2.5-3B-SFT-GRPO
3B
•
Updated
7 days ago
•
10
HarryFu/Qwen2.5-3B-GRPO
3B
•
Updated
9 days ago
•
14
HarryFu/Qwen2.5-3B-Distill
3B
•
Updated
11 days ago
•
7
HarryFu/Qwen2.5-3B-Distill-GRPO
Updated
14 days ago
HarryFu/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
20 days ago
datasets
0
None public yet