5 16

Wenkai Yang

Keven16

https://keven980716.github.io/

keven980716

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

upvoted a paper 2 days ago

NVIDIA Nemotron 3: Efficient and Open Intelligence

upvoted a paper 2 days ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

View all activity

Organizations

None yet

upvoted 3 papers 2 days ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published 3 days ago • 24

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published 3 days ago • 20

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published 9 days ago • 70

upvoted a paper 24 days ago

Mixture of Horizons in Action Chunking

Paper • 2511.19433 • Published Nov 24 • 17

upvoted a paper about 2 months ago

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published Oct 31 • 12

commented a paper about 2 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28 • 19 •

authored a paper 2 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16 • 39

upvoted 2 papers 2 months ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published Feb 18 • 2

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16 • 39

updated a collection 2 months ago

LaSeR

Collection

Models from the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding" • 5 items • Updated Oct 17 • 1

commented a paper 2 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16 • 39 •

updated a collection 2 months ago

LaSeR

Collection

Models from the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding" • 5 items • Updated Oct 17 • 1

published a dataset 2 months ago

Keven16/LaSeR_training_data

Viewer • Updated Oct 16 • 104k • 42 • 2

published 3 models 2 months ago

Wenkai Yang

AI & ML interests

Recent Activity

Organizations

Keven16's activity