Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
26
Drax
dddraxxx
Follow
AI & ML interests
None yet
Recent Activity
updated
a dataset
1 day ago
dddraxxx/docvqa-augmented-rl
published
a dataset
1 day ago
dddraxxx/docvqa-augmented-rl
updated
a dataset
1 day ago
dddraxxx/docvqa-single-rl
View all activity
Organizations
None yet
dddraxxx
's models
64
Sort: Recently updated
dddraxxx/longref_v1_long_weighted-BOX-Qwen2.5-VL-7B-GRPO-REC
8B
•
Updated
Mar 6
•
2
dddraxxx/longref_v1_long-BOX-Qwen2.5-VL-7B-GRPO-REC_1
8B
•
Updated
Mar 5
•
2
dddraxxx/base_Qwen2.5-VL-7B
8B
•
Updated
Mar 4
•
2
dddraxxx/longref_base_Qwen2.5-VL-7B1
8B
•
Updated
Mar 3
•
2
dddraxxx/longref_v1_shortprompt-BOX-Qwen2.5-VL-7B-GRPO-REC
8B
•
Updated
Mar 3
•
4
dddraxxx/longref_base_Qwen2.5-VL-3B
4B
•
Updated
Mar 3
•
2
dddraxxx/longref_v1_tempconstant_long-BOX-Qwen2.5-VL-3B-GRPO-REC
4B
•
Updated
Mar 3
•
2
dddraxxx/v1_tempconstant_long-FORMAT-BOX-Qwen2.5-VL-7B-GRPO-REC
Updated
Mar 2
dddraxxx/longref_v1_tempconstant_long-FORMAT-BOX-Qwen2.5-VL-3B-GRPO-REC
Updated
Mar 2
dddraxxx/v0_temp_trial-FORMAT-BOX-Qwen2.5-VL-3B-GRPO-REC
4B
•
Updated
Mar 1
•
3
dddraxxx/v0_temptrial_weightediou-FORMAT-BOX-Qwen2.5-VL-3B-GRPO-REC
4B
•
Updated
Mar 1
•
2
dddraxxx/v0_temptrial_long-FORMAT-BOX-Qwen2.5-VL-3B-GRPO-REC
4B
•
Updated
Mar 1
•
2
dddraxxx/v2_tempdecay150-300_lengthcontrol_long-REC-Qwen2.5-VL-3B
4B
•
Updated
Mar 1
•
2
dddraxxx/v2_tempdecay_lengthcontrol_long-REC-Qwen2.5-VL-3B
4B
•
Updated
Feb 28
•
2
dddraxxx/v2_tempdecay_lengthcontrol_long-FORMAT-BOX-Qwen2.5-VL-3B-GRPO-REC
Updated
Feb 28
dddraxxx/Qwen2.5-VL-3B-GRPO-REC_base
Updated
Feb 28
dddraxxx/v0_temp_trial_6-1-FORMAT-BOX-Qwen2.5-VL-3B-GRPO-REC
4B
•
Updated
Feb 28
•
3
dddraxxx/base_Qwen2.5-VL-3B-GRPO-REC
4B
•
Updated
Feb 28
•
3
dddraxxx/base_ref_l_even
Updated
Feb 28
dddraxxx/Qwen2.5-VL-3B-GRPO-REC-FORMAT-BOX_v0_temp
4B
•
Updated
Feb 27
•
3
dddraxxx/Qwen2.5-VL-3B-GRPO-REC-FORMAT-BOX_v0_1_constant_temp
Updated
Feb 27
dddraxxx/Qwen2.5-VL-3B-GRPO-REC-FORMAT-BOX_v0
Updated
Feb 27
dddraxxx/Qwen2.5-VL-3B-GRPO-REC-FORMAT-BOX-v1
Updated
Feb 27
dddraxxx/Qwen2.5-VL-3B-GRPO-REC-FORMAT-BOX_v0_1
Updated
Feb 27
dddraxxx/base_ref_l
Updated
Feb 27
•
3
dddraxxx/Qwen2.5-VL-3B-GRPO-tallyqa
Updated
Feb 25
dddraxxx/Qwen2.5-VL-3B-GRPO-tallyqa-som
Updated
Feb 25
dddraxxx/Qwen2.5-VL-3B-GRPO-REC
4B
•
Updated
Feb 25
•
3
dddraxxx/Qwen2.5-VL-3B-GRPO-CLEVR-spatial
Updated
Feb 24
dddraxxx/qwen2-2b-instruct-trl-sft-thinking
Updated
Feb 18
Previous
1
2
3
Next