Lize Pirenne's picture

266 22

Lize Pirenne

Inversta

·

Pangasius

AI & ML interests

LLMs, RL

Recent Activity

upvoted a paper 5 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 13 days ago

ROOT: Robust Orthogonalized Optimizer for Neural Network Training

upvoted a paper 16 days ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

View all activity

Organizations

None yet

upvoted a paper 5 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 9 days ago • 194

upvoted a paper 13 days ago

ROOT: Robust Orthogonalized Optimizer for Neural Network Training

Paper • 2511.20626 • Published 15 days ago • 169

upvoted a paper 16 days ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published 29 days ago • 104

upvoted 4 papers 17 days ago

Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

Paper • 2511.08633 • Published Nov 9 • 53

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published 29 days ago • 112

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

Paper • 2511.10629 • Published 27 days ago • 122

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9 • 129

upvoted 2 papers 19 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 29 days ago • 194

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29 • 220

liked a dataset 22 days ago

zai-org/LongBench-v2

Viewer • Updated Dec 20, 2024 • 503 • 13.5k • 26

upvoted 5 papers about 2 months ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23 • 68

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 189

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 660

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 88

DINOv3

Paper • 2508.10104 • Published Aug 13 • 285

upvoted 5 papers 2 months ago

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

Paper • 2507.10524 • Published Jul 14 • 70

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 315

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published Jun 10 • 54

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5 • 133

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263