Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
9
Xingtai Lv
XingtaiHF
Follow
lindsay-qu's profile picture
Roizzz's profile picture
aakashbilly's profile picture
3 followers
ยท
5 following
taitel1321401
telxt
AI & ML interests
LLM
Recent Activity
published
a model
16 days ago
XingtaiHF/0705_switch-sft_alr-5e-6_Qwen2.5-Math-7B
upvoted
a
paper
about 1 month ago
RLPR: Extrapolating RLVR to General Domains without Verifiers
upvoted
a
paper
2 months ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
View all activity
Organizations
None yet
Papers
10
arxiv:
2503.11224
arxiv:
2502.01456
arxiv:
2412.17739
arxiv:
2412.14689
Expand 10 papers
models
1
XingtaiHF/0705_switch-sft_alr-5e-6_Qwen2.5-Math-7B
Updated
16 days ago
datasets
0
None public yet