Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
4
Mark
Makrrr
Follow
0 followers
·
2 following
AI & ML interests
NLP, RLHF, IR
Recent Activity
upvoted
a
paper
about 14 hours ago
GRACE: Generative Representation Learning via Contrastive Policy Optimization
updated
a dataset
about 2 months ago
Makrrr/RolePred
published
a dataset
about 2 months ago
Makrrr/RolePred
View all activity
Organizations
models
13
Sort: Recently updated
Makrrr/qwen3-8B-reasonmed-finetune-extreme
Text Generation
•
8B
•
Updated
Jul 24
•
13
Makrrr/qwen2.5-7B-reasonmed-finetune-extreme
Text Generation
•
8B
•
Updated
Jul 23
•
2
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
Reinforcement Learning
•
2B
•
Updated
Jul 5
•
31
•
2
Makrrr/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
May 31
Makrrr/Pyramids
Reinforcement Learning
•
Updated
May 30
•
3
Makrrr/ppo-SnowballTarget
Reinforcement Learning
•
Updated
May 30
•
4
Makrrr/Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
May 29
Makrrr/Cartpole-v1
Reinforcement Learning
•
Updated
May 29
Makrrr/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
May 28
Makrrr/QTable-Taxi-V3
Reinforcement Learning
•
Updated
May 28
View 13 models
datasets
1
Makrrr/RolePred
Viewer
•
Updated
Aug 12
•
854
•
366