SihengLi's picture

4 10 1

SihengLi

Siheng99

·

SihengLi99

AI & ML interests

Artificial Intelligence

Recent Activity

upvoted a paper 30 days ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

authored a paper about 2 months ago

Reinforcement Learning on Pre-Training Data

upvoted a paper about 2 months ago

Reinforcement Learning on Pre-Training Data

View all activity

Organizations

Siheng99 's models 9

Siheng99/Qwen3-1.7B-DeepMath-1024samples-RePO

Text Generation • 2B • Updated Jun 6 • 2

Siheng99/Qwen3-1.7B-DeepMath-1024samples-GRPO

Text Generation • 2B • Updated Jun 6 • 2

Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-RePO

Text Generation • 8B • Updated Jun 6 • 1

Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-GRPO

Text Generation • 8B • Updated Jun 6 • 3

Siheng99/Qwen2.5-Math-1.5B-DeepMath-1024samples-RePO

Text Generation • 2B • Updated Jun 6 • 3

Siheng99/Qwen2.5-Math-1.5B-DeepMath-1024samples-GRPO

Text Generation • 2B • Updated Jun 6 • 1

Siheng99/Qwen2.5-14B-Instruct-SEALONG

Text Generation • 15B • Updated Nov 10, 2024 • 1 • 1

Siheng99/Qwen2.5-7B-Instruct-SEALONG

Text Generation • 8B • Updated Nov 10, 2024 • 2 • 2

Siheng99/Llama-3.1-8B-Instruct-SEALONG

Text Generation • 8B • Updated Nov 10, 2024 • 1 • 2