ArtusDev commited on
Commit
c57715e
·
verified ·
1 Parent(s): 0fedf24

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: TheDrummer/Anubis-70B-v1.1
3
+ base_model_relation: quantized
4
+ quantized_by: ArtusDev
5
+ license: llama3.3
6
+ pipeline_tag: text-generation
7
+ library_name: transformers
8
+ tags:
9
+ - fp8
10
+ - fp8-dynamic
11
+ ---
12
+
13
+ ## FP8 Quant of TheDrummer/Anubis-70B-v1.1
14
+
15
+ FP8 quant of [TheDrummer/Anubis-70B-v1.1](https://huggingface.co/TheDrummer/Anubis-70B-v1.1) using <a href="https://github.com/vllm-project/llm-compressor/">llm-compressor</a> for quantization.
16
+
17
+ ### Downloading quants with huggingface-cli
18
+
19
+ <details>
20
+ <summary>Click to view download instructions</summary>
21
+
22
+ Install hugginface-cli:
23
+
24
+ ```bash
25
+ pip install -U "huggingface_hub[cli]"
26
+ ```
27
+
28
+ Download quant by targeting the specific quant revision (branch):
29
+
30
+ ```
31
+ huggingface-cli download ArtusDev/TheDrummer_Anubis-70B-v1.1-FP8-Dynamic --local-dir ./
32
+ ```
33
+ </details>