Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RLVER
/
PPO-non-thinking
like
1
Safetensors
qwen2
arxiv:
2507.03112
License:
license
Model card
Files
Files and versions
xet
Community
1
RLVER
commited on
Jul 4
Commit
c760724
·
verified
·
1 Parent(s):
d6c4fc6
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+7
-5
README.md
CHANGED
Viewed
@@ -1,5 +1,7 @@
1
-
---
2
-
license: other
3
-
license_name: license
4
-
license_link: LICENSE
5
-
---
1
+
---
2
+
license: other
3
+
license_name: license
4
+
license_link: LICENSE
5
+
base_model:
6
+
- Qwen/Qwen2.5-7B-Instruct
7
+
---