Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
10
1
SihengLi
Siheng99
Follow
pt-sk's profile picture
John6666's profile picture
browallia's profile picture
7 followers
·
1 following
SihengLi99
AI & ML interests
Artificial Intelligence
Recent Activity
upvoted
a
paper
30 days ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
authored
a paper
about 2 months ago
Reinforcement Learning on Pre-Training Data
upvoted
a
paper
about 2 months ago
Reinforcement Learning on Pre-Training Data
View all activity
Organizations
Siheng99
's models
9
Sort: Recently updated
Siheng99/Qwen3-1.7B-DeepMath-1024samples-RePO
Text Generation
•
2B
•
Updated
Jun 6
•
2
Siheng99/Qwen3-1.7B-DeepMath-1024samples-GRPO
Text Generation
•
2B
•
Updated
Jun 6
•
2
Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-RePO
Text Generation
•
8B
•
Updated
Jun 6
•
1
Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-GRPO
Text Generation
•
8B
•
Updated
Jun 6
•
3
Siheng99/Qwen2.5-Math-1.5B-DeepMath-1024samples-RePO
Text Generation
•
2B
•
Updated
Jun 6
•
3
Siheng99/Qwen2.5-Math-1.5B-DeepMath-1024samples-GRPO
Text Generation
•
2B
•
Updated
Jun 6
•
1
Siheng99/Qwen2.5-14B-Instruct-SEALONG
Text Generation
•
15B
•
Updated
Nov 10, 2024
•
1
•
1
Siheng99/Qwen2.5-7B-Instruct-SEALONG
Text Generation
•
8B
•
Updated
Nov 10, 2024
•
2
•
2
Siheng99/Llama-3.1-8B-Instruct-SEALONG
Text Generation
•
8B
•
Updated
Nov 10, 2024
•
1
•
2