Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Kallinteris-Andreas
/
TRL-demo-Qwen2.5-0.5B-Reward-max_lenght256-4RA
like
0
PEFT
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
1
Use this model
4a4bc2d
TRL-demo-Qwen2.5-0.5B-Reward-max_lenght256-4RA
1.52 kB
1 contributor
History:
1 commit
Kallinteris-Andreas
initial commit
4a4bc2d
verified
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago