18 20 51

Alexander Kovrigin

waleko

https://alexkovrigin.me

waleko

AI & ML interests

AI for Code

Recent Activity

upvoted a paper 3 days ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

upvoted a paper 18 days ago

The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management

upvoted a collection 21 days ago

🦫 PIPer

View all activity

Organizations

upvoted a paper 3 days ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published 5 days ago • 20

upvoted a paper 18 days ago

The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management

Paper • 2508.21433 • Published Aug 29 • 7

upvoted a collection 21 days ago

🦫 PIPer

Collection

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated about 1 month ago • 2

liked a model 23 days ago

agentica-org/DeepSWE-Preview

Text Generation • 33B • Updated Jul 3 • 1.95k • • 187

upvoted a paper 30 days ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published about 1 month ago • 87

commented a paper 30 days ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 35 •

updated 2 datasets 30 days ago

JetBrains-Research/PIPer-SFT-2500-sharegpt

Viewer • Updated 30 days ago • 2.5k • 43 • 1

JetBrains-Research/PIPer-envbench-zeroshot-rl

Viewer • Updated 30 days ago • 742 • 60 • 1

upvoted a paper 30 days ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 35

authored a paper about 1 month ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 35

liked a model about 1 month ago

JetBrains-Research/PIPer-8B

Text Generation • 8B • Updated about 1 month ago • 12 • 2

updated a dataset about 1 month ago

JetBrains-Research/PIPer-eval

Preview • Updated Sep 30 • 12

updated 4 models about 1 month ago

published a model about 1 month ago

waleko/latent-diffusion-autoencoder-128

Updated Sep 27

Alexander Kovrigin

AI & ML interests

Recent Activity

Organizations

waleko's activity