Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
174.4
TFLOPS
3
15
91
rasdani
PRO
rasdani
Follow
DavidGF's profile picture
johannhartmann's profile picture
21world's profile picture
19 followers
·
71 following
rasdani_
rasdani
rasdani
AI & ML interests
None yet
Recent Activity
liked
a dataset
6 days ago
R2E-Gym/R2EGym-TestingAgent-SFT-Trajectories
published
a dataset
8 days ago
rasdani/SkyRL-v0-293-data-oracle-4k-context-100-epochs
liked
a model
8 days ago
StringChaos/R2E-TestgenAgent
View all activity
Organizations
rasdani
's models
37
Sort: Recently updated
rasdani/deepseek_r1_qwen14b_swe_rl_8k
15B
•
Updated
19 days ago
•
5
rasdani/deepseek_r1_llama_8b_swe_rl_8k_12_epochs
8B
•
Updated
21 days ago
•
6
rasdani/qwen3_8b_swe_rl_8k
8B
•
Updated
24 days ago
•
18
rasdani/deepseek_r1_7b_gh_patches_2k_fixed_reward
8B
•
Updated
Jun 29
•
6
rasdani/deepseek_r1_7b_gh_patches_2k
8B
•
Updated
Jun 28
•
4
rasdani/crux-eval_math-eval-logs
Updated
Jun 25
rasdani/git-diff-Qwen-4B-10k
4B
•
Updated
Jun 25
•
3
rasdani/git-diff-Qwen-4B-10k-checkpoints
Updated
Jun 25
rasdani/git-diff-Qwen-4B-32k-checkpoints
Updated
Jun 23
rasdani/git-diff-Qwen-4B-30k
4B
•
Updated
Jun 22
•
4
rasdani/git-diff-Qwen-4B
4B
•
Updated
Jun 17
•
4
rasdani/git-diff-Qwen-1.7B
2B
•
Updated
Jun 16
•
2
rasdani/git-diff-Qwen-1.7-B
2B
•
Updated
Jun 16
•
2
rasdani/simple-math-Qwen-1.5B
2B
•
Updated
Jun 15
•
2
rasdani/qwen3_0_6b_function_rm
0.8B
•
Updated
May 22
•
1
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-8192k
0.5B
•
Updated
Apr 8
•
1
rasdani/Qwen2.5-0.5B-simpleRL-Zoo
Text Generation
•
0.5B
•
Updated
Apr 6
•
2
rasdani/smolR1-Qwen2.5-0.5B
Text Generation
•
0.5B
•
Updated
Mar 31
•
6
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-no-KL
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-3072k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-4096k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-2560k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-2048k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-first-try
0.5B
•
Updated
Mar 29
•
3
rasdani/Qwen-1.5B-Distill-GRPO
Text Generation
•
2B
•
Updated
Mar 28
•
6
rasdani/Qwen-0.5B-Instruct-GRPO
Updated
Mar 27
rasdani/gsm8k_qwen2.5-0.5b
0.5B
•
Updated
Mar 11
•
1
rasdani/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Mar 9
rasdani/Qwen2.5-0.5B-Open-R1-Code-GRPO
Text Generation
•
0.6B
•
Updated
Mar 8
•
4
rasdani/Qwen2.5-7B-Instruct-GRPO-unsloth
Text Generation
•
8B
•
Updated
Mar 2
•
10
Previous
1
2
Next