Nguyễn Minh Phúc

DatPySci

AI & ML interests

Reinforcement learning, NLP

Recent Activity

updated a model 2 days ago

DatPySci/RLDI

published a model 17 days ago

DatPySci/RLDI

updated a model 4 months ago

DatPySci/Qwen-2.5-7B-Simple-RL

View all activity

Organizations

updated a model 2 days ago

DatPySci/RLDI

2B • Updated 2 days ago • 22

published a model 17 days ago

DatPySci/RLDI

2B • Updated 2 days ago • 22

updated a model 4 months ago

DatPySci/Qwen-2.5-7B-Simple-RL

Updated May 3

published 2 models 4 months ago

DatPySci/Qwen-2.5-7B-Simple-RL

Updated May 3

DatPySci/Llama-3.2-3B-sft-mixture

Text Generation • 3B • Updated Feb 10 • 1.12k

updated a model 4 months ago

DatPySci/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

2B • Updated Apr 28 • 6

updated a model 5 months ago

DatPySci/DeepSeek-Qwen-1.5B-GRPO

2B • Updated Apr 22 • 6

published 3 models 5 months ago

updated a dataset 7 months ago

DatPySci/Llama-3.1-8B-rm-anthropic-hh

Viewer • Updated Feb 10 • 140k • 9

published a dataset 7 months ago

DatPySci/Llama-3.1-8B-rm-anthropic-hh

Viewer • Updated Feb 10 • 140k • 9

updated a dataset 7 months ago

DatPySci/Llama-3.1-8B-rm-tldr-pref

Viewer • Updated Feb 10 • 177k • 1

published a dataset 7 months ago

DatPySci/Llama-3.1-8B-rm-tldr-pref

Viewer • Updated Feb 10 • 177k • 1

updated 2 models 7 months ago

DatPySci/Llama-3.2-3B-sft-mixture

Text Generation • 3B • Updated Feb 10 • 1.12k

DatPySci/Llama-3.2-3B-sft-mixture

Text Generation • 3B • Updated Feb 10 • 1.12k

Nguyễn Minh Phúc

AI & ML interests

Recent Activity

Organizations

DatPySci's activity