Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
NickyNicky
/
Llama-1B-GRPO_Final
like
5
Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Llama-1B-GRPO_Final
/
README.md
NickyNicky
Update README.md
d14d614
verified
10 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
Safe
82 Bytes
metadata
library_name:
transformers
tags:
[]
dataset: openai/gsm8k
132 steps