Boxin Wang's picture

Boxin Wang

boxin-wbx

·

https://boxin.wang

AI & ML interests

None yet

Recent Activity

updated a model 12 days ago

nvidia/Nemotron-Cascade-2-30B-A3B

upvoted a paper 16 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

liked a dataset 17 days ago

nvidia/Nemotron-Cascade-2-SFT-Data

View all activity

Organizations

upvoted a paper 16 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 17 days ago • 66

upvoted a collection 17 days ago

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 5 days ago • 46

upvoted a paper 4 months ago

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Paper • 2512.13607 • Published Dec 15, 2025 • 38

upvoted a collection 4 months ago

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 14 items • Updated 5 days ago • 54

upvoted a collection about 1 year ago

AceMath

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated 5 days ago • 17

upvoted a collection over 1 year ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 5 days ago • 53

upvoted a paper over 1 year ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 74

upvoted a collection about 2 years ago

InstructRetro

InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning. • 4 items • Updated 5 days ago • 11