ArtusDev
/

TheDrummer_Anubis-70B-v1.1-FP8-Dynamic

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions

ArtusDev commited on Jun 28

Commit

c57715e

·

verified ·

1 Parent(s): 0fedf24

Create README.md

Files changed (1) hide show

README.md +33 -0

README.md ADDED Viewed

	@@ -0,0 +1,33 @@

+---
+base_model: TheDrummer/Anubis-70B-v1.1
+base_model_relation: quantized
+quantized_by: ArtusDev
+license: llama3.3
+pipeline_tag: text-generation
+library_name: transformers
+tags:
+- fp8
+- fp8-dynamic
+---
+## FP8 Quant of TheDrummer/Anubis-70B-v1.1
+FP8 quant of [TheDrummer/Anubis-70B-v1.1](https://huggingface.co/TheDrummer/Anubis-70B-v1.1) using <a href="https://github.com/vllm-project/llm-compressor/">llm-compressor</a> for quantization.
+### Downloading quants with huggingface-cli
+<details>
+  <summary>Click to view download instructions</summary>
+Install hugginface-cli:
+```bash
+pip install -U "huggingface_hub[cli]"
+```
+Download quant by targeting the specific quant revision (branch):
+```
+huggingface-cli download ArtusDev/TheDrummer_Anubis-70B-v1.1-FP8-Dynamic --local-dir ./
+```
+</details>