RedHatAI
/

Llama-3.1-8B-Instruct-FP8-block

Text Generation

compressed-tensors

Model card Files Files and versions

Llama-3.1-8B-Instruct-FP8-block

9.1 GB

3 contributors

History: 20 commits

alexmarques's picture

Update README.md

b4040dc verified 9 days ago

.gitattributes

1.61 kB

Add Llama 3.1 8B Instruct FP8-block model weights and tokenizer 28 days ago
README.md

7.42 kB

Update README.md 9 days ago
chat_template.jinja

4.61 kB

Add FP8 block quantized model weights 21 days ago
config.json

2.09 kB
xet

Add FP8 block quantized model weights 21 days ago
generation_config.json

184 Bytes
xet

Add FP8 block quantized model weights 21 days ago
model-00001-of-00002.safetensors

5 GB
LFS

Add FP8 block quantized model weights 21 days ago
model-00002-of-00002.safetensors

4.08 GB
LFS

Add FP8 block quantized model weights 21 days ago
model.safetensors.index.json

43.5 kB
LFS

Add FP8 block quantized model weights 21 days ago
recipe.yaml

134 Bytes

Add FP8 block quantized model weights 21 days ago
special_tokens_map.json

296 Bytes
LFS

Add FP8 block quantized model weights 21 days ago
tokenizer.json

17.2 MB
LFS

Add FP8 block quantized model weights 21 days ago
tokenizer_config.json

50.5 kB
LFS

Add FP8 block quantized model weights 21 days ago