Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
SihengLi's picture
4 10 1

SihengLi

Siheng99
pt-sk's profile picture John6666's profile picture browallia's profile picture
·
  • SihengLi99

AI & ML interests

Artificial Intelligence

Recent Activity

upvoted a paper 30 days ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
authored a paper about 2 months ago
Reinforcement Learning on Pre-Training Data
upvoted a paper about 2 months ago
Reinforcement Learning on Pre-Training Data
View all activity

Organizations

The Chinese University of Hong Kong's profile picture

Siheng99 's models 9

Siheng99/Qwen3-1.7B-DeepMath-1024samples-RePO

Text Generation • 2B • Updated Jun 6 • 2

Siheng99/Qwen3-1.7B-DeepMath-1024samples-GRPO

Text Generation • 2B • Updated Jun 6 • 2

Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-RePO

Text Generation • 8B • Updated Jun 6 • 1

Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-GRPO

Text Generation • 8B • Updated Jun 6 • 3

Siheng99/Qwen2.5-Math-1.5B-DeepMath-1024samples-RePO

Text Generation • 2B • Updated Jun 6 • 3

Siheng99/Qwen2.5-Math-1.5B-DeepMath-1024samples-GRPO

Text Generation • 2B • Updated Jun 6 • 1

Siheng99/Qwen2.5-14B-Instruct-SEALONG

Text Generation • 15B • Updated Nov 10, 2024 • 1 • 1

Siheng99/Qwen2.5-7B-Instruct-SEALONG

Text Generation • 8B • Updated Nov 10, 2024 • 2 • 2

Siheng99/Llama-3.1-8B-Instruct-SEALONG

Text Generation • 8B • Updated Nov 10, 2024 • 1 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs