Add model card
#1
by
nielsr
HF Staff
- opened
This PR adds a model card for the paper OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.
It sets the pipeline_tag, library_name and license.
This PR adds a model card for the paper OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.
It sets the pipeline_tag, library_name and license.