Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Zhikai Lei
Kausal77
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping
upvoted
a
paper
7 days ago
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents
upvoted
a
paper
11 months ago
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
View all activity
Organizations
None yet
Kausal77
's datasets
None public yet