RedHatAI/Meta-Llama-3.1-70B-Instruct-FP8-dynamic Text Generation • 71B • Updated Oct 19, 2024 • 1.03k • 7
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w8a8 Text Generation • 71B • Updated Feb 11 • 6.48k • 21
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 Image-Text-to-Text • 402B • Updated May 22 • 125k • • 123
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16 Image-Text-to-Text • 20B • Updated May 30 • 10.4k • 12
nm-testing/tinyllama-one-shot-static-quant-test-compressed Text Generation • 1B • Updated Oct 9, 2024 • 5