Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xingtai Lv's picture
9

Xingtai Lv

XingtaiHF
lindsay-qu's profile picture Roizzz's profile picture aakashbilly's profile picture
ยท
  • taitel1321401
  • telxt

AI & ML interests

LLM

Recent Activity

published a model 16 days ago
XingtaiHF/0705_switch-sft_alr-5e-6_Qwen2.5-Math-7B
upvoted a paper about 1 month ago
RLPR: Extrapolating RLVR to General Domains without Verifiers
upvoted a paper 2 months ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
View all activity

Organizations

None yet

Papers 10

arxiv:2503.11224
arxiv:2502.01456
arxiv:2412.17739
arxiv:2412.14689

models 1

XingtaiHF/0705_switch-sft_alr-5e-6_Qwen2.5-Math-7B

Updated 16 days ago

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs