Joakim Lee's picture

602

Joakim Lee

Reinforcement4All

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

upvoted a paper about 3 hours ago

BabyVision: Visual Reasoning Beyond Language

upvoted a paper about 3 hours ago

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

View all activity

Organizations

None yet

upvoted 20 papers about 3 hours ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published 2 days ago • 152

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published 3 days ago • 131

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Paper • 2601.05593 • Published 4 days ago • 60

X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests

Paper • 2601.06953 • Published 2 days ago • 27

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published about 19 hours ago • 23

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

Paper • 2601.07779 • Published about 20 hours ago • 21

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

Paper • 2601.07526 • Published 1 day ago • 14

DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving

Paper • 2601.01528 • Published 9 days ago • 16

Dr. Zero: Self-Evolving Search Agents without Training Data

Paper • 2601.07055 • Published 1 day ago • 6

MMFormalizer: Multimodal Autoformalization in the Wild

Paper • 2601.03017 • Published 7 days ago • 97

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published 4 days ago • 45

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published 4 days ago • 36

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Paper • 2601.05808 • Published 4 days ago • 31

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 5 days ago • 28

Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals

Paper • 2601.05848 • Published 4 days ago • 13

SmartSearch: Process Reward-Guided Query Refinement for Search Agents

Paper • 2601.04888 • Published 5 days ago • 7

AnyDepth: Depth Estimation Made Easy

Paper • 2601.02760 • Published 7 days ago • 7

DR-LoRA: Dynamic Rank LoRA for Mixture-of-Experts Adaptation

Paper • 2601.04823 • Published 5 days ago • 4

TCAndon-Router: Adaptive Reasoning Router for Multi-Agent Collaboration

Paper • 2601.04544 • Published 5 days ago • 3

TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents

Paper • 2601.05899 • Published 4 days ago • 2