Junyoung's picture

Junyoung

Gredora

·

AI & ML interests

None yet

Organizations

models 9

Gredora/LLaDA-sft-s1k-merged

8B • Updated Jul 30 • 1

Gredora/Qwen2.5-1.5B-GRPO-compress

Gredora/Llama-2-7b-ORM-LoRA

Gredora/qwen7b-grpo

Gredora/mistral7b-grpo

Gredora/Mistral-7B-Instruct-v0.3

Gredora/Qwen2-0.5B-GRPO-test

Gredora/ppo-Huggy

Reinforcement Learning • Updated Aug 21, 2023 • 11

Gredora/ppo-LunarLander-v2

Reinforcement Learning • Updated Aug 16, 2023 • 2

datasets 0

None public yet