5 4 9

Zafir Stojanovski

zafstojano

AI & ML interests

None yet

Recent Activity

updated a model about 6 hours ago

zafstojano/qwen2.5-3b-knights_knaves_noncurriculum

published a model about 6 hours ago

zafstojano/qwen2.5-3b-knights_knaves_noncurriculum

updated a model about 19 hours ago

zafstojano/qwen2.5-3b-knights_knaves_curriculum

View all activity

Organizations

None yet

updated a model about 6 hours ago

zafstojano/qwen2.5-3b-knights_knaves_noncurriculum

3B • Updated about 6 hours ago • 1

published a model about 6 hours ago

zafstojano/qwen2.5-3b-knights_knaves_noncurriculum

3B • Updated about 6 hours ago • 1

updated a model about 19 hours ago

zafstojano/qwen2.5-3b-knights_knaves_curriculum

3B • Updated about 19 hours ago • 1

published a model about 19 hours ago

zafstojano/qwen2.5-3b-knights_knaves_curriculum

3B • Updated about 19 hours ago • 1

upvoted an article about 2 months ago

Article

GRPO for GUI Grounding Done Right

•

Jun 11

• 30

updated a dataset about 2 months ago

zafstojano/sample-rg-data

Viewer • Updated Jun 9 • 20k • 10

published a dataset about 2 months ago

zafstojano/sample-rg-data

Viewer • Updated Jun 9 • 20k • 10

New activity in zafstojano/Qwen2.5-3B-Instruct-RG-Math about 2 months ago

Add metadata

#1 opened about 2 months ago by

nielsr

upvoted a paper about 2 months ago

Taming LLMs by Scaling Learning Rates with Gradient Grouping

Paper • 2506.01049 • Published Jun 1 • 37

commented a paper about 2 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 66 •

upvoted 2 papers about 2 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 174

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 66

commented a paper about 2 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 66 •

updated a model about 2 months ago

zafstojano/Qwen2.5-3B-Instruct-RG-Math

Text Generation • 3B • Updated Jun 4 • 3

liked a Space 2 months ago

446

AI Deadlines

⚡

Manage project deadlines efficiently

published a model 2 months ago

zafstojano/Qwen2.5-3B-Instruct-RG-Math

Text Generation • 3B • Updated Jun 4 • 3

liked a dataset 5 months ago

hkust-nlp/CodeIO-PyEdu-Reasoning

Preview • Updated Jun 18 • 88 • 53

liked a dataset 8 months ago

alpindale/two-million-bluesky-posts

Viewer • Updated Nov 28, 2024 • 2.11M • 430 • 199

liked a Space about 1 year ago

455

Omni-Zero

🧛

Restylize & repose person ID

New activity in HuggingFaceM4/idefics2-8b over 1 year ago

Text model not being loaded with Flash Attention 2

#27 opened over 1 year ago by

zafstojano

Zafir Stojanovski

AI & ML interests

Recent Activity

Organizations

zafstojano's activity

GRPO for GUI Grounding Done Right

Add metadata

AI Deadlines

Omni-Zero

Text model not being loaded with Flash Attention 2