Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
long
seamoke111
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents
upvoted
a
paper
2 months ago
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
upvoted
a
paper
6 months ago
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
View all activity
Organizations
None yet
models
1
seamoke111/HTL-CodeLlama-7B
Text Generation
•
7B
•
Updated
Jun 20, 2024
•
2
datasets
0
None public yet