minju's picture

1 21

minju

iaminju

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

upvoted a paper 13 days ago

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

upvoted a paper 20 days ago

ACON: Optimizing Context Compression for Long-horizon LLM Agents

View all activity

Organizations

iaminju 's models 13

iaminju/rlpvr_pref_only

2B • Updated Mar 28

iaminju/rlpvr_math_only

2B • Updated Mar 28

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_3

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k_2

2B • Updated Feb 28 • 1

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_83k

2B • Updated Feb 27

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_10k

Text Generation • 2B • Updated Feb 26

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_1k

Text Generation • 2B • Updated Feb 26

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_nq_s_pref

Text Generation • 2B • Updated Feb 25

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_pref

Text Generation • 2B • Updated Feb 25 • 1

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math_nq_s

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math_m

Text Generation • 2B • Updated Feb 25

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_nq_s

iaminju/DeepSeek-R1-Distill-Qwen-1.5B-GRPO