Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Junyoung
Gredora
Follow
0 followers
·
1 following
AI & ML interests
None yet
Organizations
models
9
Sort: Recently updated
Gredora/LLaDA-sft-s1k-merged
8B
•
Updated
Jul 30
•
1
Gredora/Qwen2.5-1.5B-GRPO-compress
Updated
Mar 31
Gredora/Llama-2-7b-ORM-LoRA
Updated
Mar 16
Gredora/qwen7b-grpo
Updated
Feb 20
Gredora/mistral7b-grpo
Updated
Feb 20
Gredora/Mistral-7B-Instruct-v0.3
Updated
Feb 20
Gredora/Qwen2-0.5B-GRPO-test
Updated
Feb 18
Gredora/ppo-Huggy
Reinforcement Learning
•
Updated
Aug 21, 2023
•
11
Gredora/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Aug 16, 2023
•
2
datasets
0
None public yet