BaleChen
AI & ML interests
None yet
Organizations
None yet
BaleChen/checkpoint-4500_merged
Text Classification
•
Updated
•
5
BaleChen/checkpoint-3000_merged
Text Generation
•
Updated
•
3
BaleChen/checkpoint-2000_merged
Text Generation
•
Updated
•
7
BaleChen/llama-rm
Text Classification
•
Updated
•
5
BaleChen/llama-sft
Text Generation
•
Updated
•
3
BaleChen/llama-7b-hf_peft_stack-exchange-paired_rmts__100000_2e-05_peft_last_checkpoint_oct4_merged
Text Classification
•
Updated
•
5
BaleChen/checkpoint-800_merged
Text Generation
•
Updated
•
3
BaleChen/checkpoint-1300_merged
Text Classification
•
Updated
•
5
BaleChen/checkpoint-500_merged
Text Generation
•
Updated
•
3
BaleChen/checkpoint-400_merged
Text Generation
•
Updated
•
3
BaleChen/checkpoint-900_merged
Text Classification
•
Updated
•
5
BaleChen/checkpoint-400-merged
Text Classification
•
Updated
•
5
BaleChen/REINFORCE-pixelcopter-test
Reinforcement Learning
•
Updated
BaleChen/REINFORCE-cartpolev1-test
Reinforcement Learning
•
Updated
BaleChen/dqn-SpaceInvadersNoFrameskip-v4-test
Reinforcement Learning
•
Updated
•
4
BaleChen/taxi-v3-q-test
Reinforcement Learning
•
Updated
BaleChen/frozenlake-v1-noslippery-test-q
Reinforcement Learning
•
Updated
BaleChen/test-ppo-Huggy
Reinforcement Learning
•
Updated
•
36
BaleChen/test_lunarlanderv2_mlp_ppo
Reinforcement Learning
•
Updated
•
2