1 90 105

Jarrod Barnes PRO

Jarrodbarnes

https://arc.computer

AI & ML interests

Continual Learning, Reinforcement Learning

Recent Activity

liked a model about 16 hours ago

Qwen/Qwen3-Coder-Next

liked a dataset about 16 hours ago

facebook/principia-collection

liked a model about 16 hours ago

Qwen/Qwen3-Coder-30B-A3B-Instruct

View all activity

Organizations

upvoted a paper about 19 hours ago

ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning

Paper • 2602.02192 • Published 11 days ago • 12

upvoted 3 collections about 24 hours ago

upvoted a paper 3 days ago

Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation

Paper • 2602.07670 • Published 6 days ago • 1

upvoted an article 6 days ago

Article

Where should test-time compute go? Surprisal-guided selection in verifiable environments

6 days ago

•

upvoted a paper 6 days ago

Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents

Paper • 2601.18217 • Published 18 days ago • 11

upvoted a paper 14 days ago

OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence

Paper • 2601.21083 • Published 16 days ago • 1

upvoted a collection 17 days ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano v3. • 8 items • Updated 9 days ago • 63

upvoted an article 21 days ago

Article

Frontier Security Agents Don't Lack Detection. They Lack Restraint

21 days ago

•

upvoted a paper 24 days ago

PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models

Paper • 2601.11087 • Published 28 days ago • 11

upvoted an article 24 days ago

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Jan 5

•

upvoted a paper 28 days ago

ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking

Paper • 2601.06487 • Published Jan 10 • 52

upvoted a paper 29 days ago

CausalARC: Abstract Reasoning with Causal World Models

Paper • 2509.03636 • Published Sep 3, 2025 • 1

upvoted a paper 30 days ago

Ministral 3

Paper • 2601.08584 • Published Jan 13 • 53

upvoted an article about 1 month ago

Article

NVIDIA brings agents to life with DGX Spark and Reachy Mini

Jan 5

•

upvoted a collection about 1 month ago

Parakeet

Collection

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 16 items • Updated 9 days ago • 54

upvoted an article about 1 month ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

upvoted a paper about 1 month ago

Web World Models

Paper • 2512.23676 • Published Dec 29, 2025 • 27

upvoted an article about 1 month ago

Article

Deriving the DPO Loss from First Principles

Dec 30, 2025

•

Jarrod Barnes PRO

AI & ML interests

Recent Activity

Organizations

Jarrodbarnes's activity

Where should test-time compute go? Surprisal-guided selection in verifiable environments

Frontier Security Agents Don't Lack Detection. They Lack Restraint

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

NVIDIA brings agents to life with DGX Spark and Reachy Mini

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Deriving the DPO Loss from First Principles