mk's picture

7

mk

mk1111

AI & ML interests

None yet

Recent Activity

authored a paper 7 days ago

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

upvoted a paper 8 days ago

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

upvoted a paper 7 months ago

Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents

View all activity

Organizations

None yet

Papers 1

arxiv:2604.22748

models 0

None public yet

datasets 4

mk1111/llama3.2-3b-instruct-ultrafeedback-armorm

Viewer • Updated Jul 15, 2025 • 60.7k • 18

mk1111/llama3-8b-instruct-ultrafeedback-armorm

Viewer • Updated Jul 15, 2025 • 59.9k • 4

mk1111/gemma2-2b-it-ultrafeedback-armorm

Viewer • Updated Jul 15, 2025 • 59.7k • 14

mk1111/llama3-8b-instruct-ultrafeedback

Viewer • Updated Jul 15, 2025 • 59.9k • 5