Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
Nguyễn Minh Phúc
DatPySci
Follow
Oztobuzz's profile picture
1 follower
·
1 following
AI & ML interests
Reinforcement learning, NLP
Recent Activity
updated
a model
2 days ago
DatPySci/RLDI
published
a model
17 days ago
DatPySci/RLDI
updated
a model
4 months ago
DatPySci/Qwen-2.5-7B-Simple-RL
View all activity
Organizations
DatPySci
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
2 days ago
DatPySci/RLDI
2B
•
Updated
2 days ago
•
22
published
a model
17 days ago
DatPySci/RLDI
2B
•
Updated
2 days ago
•
22
updated
a model
4 months ago
DatPySci/Qwen-2.5-7B-Simple-RL
Updated
May 3
published
2 models
4 months ago
DatPySci/Qwen-2.5-7B-Simple-RL
Updated
May 3
DatPySci/Llama-3.2-3B-sft-mixture
Text Generation
•
3B
•
Updated
Feb 10
•
1.12k
updated
a model
4 months ago
DatPySci/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
Apr 28
•
6
updated
a model
5 months ago
DatPySci/DeepSeek-Qwen-1.5B-GRPO
2B
•
Updated
Apr 22
•
6
published
3 models
5 months ago
DatPySci/DeepSeek-Qwen-1.5B-GRPO
2B
•
Updated
Apr 22
•
6
DatPySci/Qwen-1.5B-Math-GRPO
Updated
Apr 22
DatPySci/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
Apr 28
•
6
updated
a dataset
7 months ago
DatPySci/Llama-3.1-8B-rm-anthropic-hh
Viewer
•
Updated
Feb 10
•
140k
•
9
published
a dataset
7 months ago
DatPySci/Llama-3.1-8B-rm-anthropic-hh
Viewer
•
Updated
Feb 10
•
140k
•
9
updated
a dataset
7 months ago
DatPySci/Llama-3.1-8B-rm-tldr-pref
Viewer
•
Updated
Feb 10
•
177k
•
1
published
a dataset
7 months ago
DatPySci/Llama-3.1-8B-rm-tldr-pref
Viewer
•
Updated
Feb 10
•
177k
•
1
updated
2 models
7 months ago
DatPySci/Llama-3.2-3B-sft-mixture
Text Generation
•
3B
•
Updated
Feb 10
•
1.12k
DatPySci/Llama-3.2-3B-sft-mixture
Text Generation
•
3B
•
Updated
Feb 10
•
1.12k
Load more