ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-BitBLAS Text Generation • 20B • Updated Jul 22, 2024 • 3
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128-BitBLAS Text Generation • 5B • Updated Jul 22, 2024 • 3
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64-BitBLAS Text Generation • 3B • Updated Jul 22, 2024 • 2
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-BitBLAS Text Generation • 3B • Updated Jul 22, 2024 • 2
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-BitBLAS Text Generation • 37B • Updated Jul 22, 2024 • 5
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64-BitBLAS Text Generation • 21B • Updated Jul 22, 2024 • 11
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-GPTQ Text Generation • 2B • Updated Jul 22, 2024 • 37 • 1
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-GPTQ Text Generation • 11B • Updated Jul 22, 2024 • 3
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-GPTQ Text Generation • 7B • Updated Jul 22, 2024 • 13