datasets datasets of interest bigcode/the-stack-dedup Viewer • Updated Aug 17, 2023 • 237M • 4.17k • 366 liwu/MNBVC Updated 2 days ago • 23.2k • 559 code-search-net/code_search_net Updated Jan 18, 2024 • 5.98k • 311 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 25.2k • 571
paper reading LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58
LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58
datasets datasets of interest bigcode/the-stack-dedup Viewer • Updated Aug 17, 2023 • 237M • 4.17k • 366 liwu/MNBVC Updated 2 days ago • 23.2k • 559 code-search-net/code_search_net Updated Jan 18, 2024 • 5.98k • 311 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 25.2k • 571
paper reading LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58
LLM Pruning and Distillation in Practice: The Minitron Approach Paper • 2408.11796 • Published Aug 21, 2024 • 58