GGUF
English
qwen_vl
video
real-time
multimodal
LLM
imatrix
conversational
Mungert commited on
Commit
a9828ad
·
verified ·
1 Parent(s): 4036e07

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md CHANGED
@@ -18,6 +18,14 @@ tags:
18
 
19
  # <span style="color: #7FFF7F;">LiveCC-7B-Instruct GGUF Models</span>
20
 
 
 
 
 
 
 
 
 
21
  ## <span style="color: #7FFF7F;">Ultra-Low-Bit Quantization with IQ-DynamicGate (1-2 bit)</span>
22
 
23
  Our latest quantization method introduces **precision-adaptive quantization** for ultra-low-bit models (1-2 bit), with benchmark-proven improvements on **Llama-3-8B**. This approach uses layer-specific strategies to preserve accuracy while maintaining extreme memory efficiency.
@@ -72,6 +80,7 @@ All tests conducted on **Llama-3-8B-Instruct** using:
72
  ✔ **Research** into ultra-low-bit quantization
73
 
74
 
 
75
  ## **Choosing the Right Model Format**
76
 
77
  Selecting the correct model format depends on your **hardware capabilities** and **memory constraints**.
 
18
 
19
  # <span style="color: #7FFF7F;">LiveCC-7B-Instruct GGUF Models</span>
20
 
21
+
22
+ ## <span style="color: #7F7FFF;">Model Generation Details</span>
23
+
24
+ This model was generated using [llama.cpp](https://github.com/ggerganov/llama.cpp) at commit [`e291450`](https://github.com/ggerganov/llama.cpp/commit/e291450b7602d7a36239e4ceeece37625f838373).
25
+
26
+
27
+
28
+
29
  ## <span style="color: #7FFF7F;">Ultra-Low-Bit Quantization with IQ-DynamicGate (1-2 bit)</span>
30
 
31
  Our latest quantization method introduces **precision-adaptive quantization** for ultra-low-bit models (1-2 bit), with benchmark-proven improvements on **Llama-3-8B**. This approach uses layer-specific strategies to preserve accuracy while maintaining extreme memory efficiency.
 
80
  ✔ **Research** into ultra-low-bit quantization
81
 
82
 
83
+
84
  ## **Choosing the Right Model Format**
85
 
86
  Selecting the correct model format depends on your **hardware capabilities** and **memory constraints**.