Mark's picture

2 3 4

Mark

Makrrr

·

AI & ML interests

NLP, RLHF, IR

Recent Activity

upvoted a paper about 2 months ago

Adaptation of Agentic AI

upvoted a paper 4 months ago

DeepSeek-OCR: Contexts Optical Compression

new activity 4 months ago

Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl:Can we have the training setting?

View all activity

Organizations

liked a model 7 months ago

Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl

Reinforcement Learning • 2B • Updated Jul 5, 2025 • 5 • 2

liked a Space 9 months ago

Check My Progress Deep RL Course

Check your progress in a Deep RL course

liked 2 Spaces 10 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

FineWeb: decanting the web for the finest text data at scale

Read about FineWeb, a large web‑text dataset for LLMs