Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
21
minju
iaminju
Follow
mberkanbicer's profile picture
gmlwns5176's profile picture
saytes's profile picture
8 followers
·
4 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
8 days ago
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
upvoted
a
paper
13 days ago
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
upvoted
a
paper
20 days ago
ACON: Optimizing Context Compression for Long-horizon LLM Agents
View all activity
Organizations
iaminju
's models
13
Sort: Recently updated
iaminju/rlpvr_pref_only
2B
•
Updated
Mar 28
iaminju/rlpvr_math_only
2B
•
Updated
Mar 28
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_3
Updated
Feb 28
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_2
2B
•
Updated
Feb 28
•
1
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k
2B
•
Updated
Feb 27
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_10k
Text Generation
•
2B
•
Updated
Feb 26
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_1k
Text Generation
•
2B
•
Updated
Feb 26
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_nq_s_pref
Text Generation
•
2B
•
Updated
Feb 25
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_pref
Text Generation
•
2B
•
Updated
Feb 25
•
1
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math_nq_s
Updated
Feb 25
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math_m
Text Generation
•
2B
•
Updated
Feb 25
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_nq_s
Updated
Feb 24
iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Feb 24