view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • 11 days ago • 47
Reward Models Collection Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 8 days ago • 16
view article Article Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B By nvidia and 3 others • Jun 10 • 7
AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset Paper • 2504.16891 • Published Apr 23 • 24
OpenMathReasoning Collection Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 8 days ago • 42
Countering Language Drift with Seeded Iterated Learning Paper • 2003.12694 • Published Mar 28, 2020 • 1
Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment Paper • 2502.00203 • Published Jan 31 • 2
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models Paper • 2504.03624 • Published Apr 4 • 13
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 8 days ago • 61
Llama Nemotron Collection Open, Production-ready Enterprise Models • 9 items • Updated 4 days ago • 62
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment Paper • 2405.01481 • Published May 2, 2024 • 31