view article Article Supercharge Edge AI With HighβAccuracy Reasoning Using NVIDIA Nemotron Nano 2 9B By nvidia and 9 others β’ Aug 18 β’ 30
nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1 Text Generation β’ 5B β’ Updated 8 days ago β’ 44.5k β’ 109
NVILA: Efficient Frontier Visual Language Models Paper β’ 2412.04468 β’ Published Dec 5, 2024 β’ 59
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Paper β’ 2409.17481 β’ Published Sep 26, 2024 β’ 47
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Paper β’ 2409.17481 β’ Published Sep 26, 2024 β’ 47
LLM Pruning and Distillation in Practice: The Minitron Approach Paper β’ 2408.11796 β’ Published Aug 21, 2024 β’ 57
nvidia/Mistral-NeMo-Minitron-8B-Base Text Generation β’ 8B β’ Updated Aug 22, 2024 β’ 4.13k β’ 176
LLM Pruning and Distillation in Practice: The Minitron Approach Paper β’ 2408.11796 β’ Published Aug 21, 2024 β’ 57