rgtjf
AI & ML interests
None yet
Organizations
None yet
rgtjf/ppo-Pyramids
Reinforcement Learning
•
Updated
•
9
rgtjf/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
23
rgtjf/Reinforce-2048
Reinforcement Learning
•
Updated
rgtjf/Qwen2-UtK-72B-128K
73B
•
Updated
•
5
rgtjf/LLama3.1-UtK-8B-128K
8B
•
Updated
•
5
rgtjf/Qwen2-UtK-ChatQA2-7B-128K
8B
•
Updated
•
8
rgtjf/Qwen2-UtK-ChatQA2-72B-128K
73B
•
Updated
•
6
rgtjf/Qwen2-UtK-7B-128K
8B
•
Updated
•
6
rgtjf/Reinforce-1024
Reinforcement Learning
•
Updated
rgtjf/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
7
rgtjf/q-Taxi-v3
Reinforcement Learning
•
Updated
rgtjf/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
rgtjf/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
1