FP8 Quant of TheDrummer/Anubis-70B-v1.1

FP8 quant of TheDrummer/Anubis-70B-v1.1 using llm-compressor for quantization.

Downloading quants with huggingface-cli

Click to view download instructions

Install hugginface-cli:

pip install -U "huggingface_hub[cli]"

Download quant by targeting the specific quant revision (branch):

huggingface-cli download ArtusDev/TheDrummer_Anubis-70B-v1.1-FP8-Dynamic --local-dir ./
Downloads last month
547
Safetensors
Model size
70.6B params
Tensor type
BF16
·
F8_E4M3
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ArtusDev/TheDrummer_Anubis-70B-v1.1-FP8-Dynamic

Quantized
(10)
this model