nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-NVFP4 Text Generation • 18B • Updated 1 day ago • 40 • 1
nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-BF16 Text Generation • 32B • Updated 1 day ago • 39 • 3
nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-NVFP4 Text Generation • 18B • Updated 1 day ago • 40 • 1
nvidia/NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B-BF16 Text Generation • 32B • Updated 1 day ago • 39 • 3
Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs Paper • 2511.16664 • Published Nov 20, 2025 • 29
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published Aug 20, 2025 • 48
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models Paper • 2504.03624 • Published Apr 4, 2025 • 18
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning Paper • 2504.11409 • Published Apr 15, 2025 • 9
view article Article Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B Aug 18, 2025 • 32
view article Article Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B Aug 18, 2025 • 32
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning Paper • 2504.11409 • Published Apr 15, 2025 • 9