LLMJapan
/

OlympicCoder-7B_exl2_8.0bpw

Text Generation

text-generation-inference

8-bit precision

Model card Files Files and versions

LLMJapan commited on Mar 15

Commit

9733b94

·

verified ·

1 Parent(s): d425cc4

Update README.md

Quantization updated

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -10,6 +10,21 @@ pipeline_tag: text-generation
 library_name: transformers
 ---
 # Model Card for OlympicCoder-7B
 OlympicCoder-7B is a code model that achieves strong performance on competitive coding benchmarks such as LiveCodeBench and the 2024 International Olympiad in Informatics.

 library_name: transformers
 ---
+## Exllama v2 Quantizations of OlympicCoder-7B
+Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.2.8">turboderp's ExLlamaV2 v0.2.8</a> for quantization.
+average:8.0bpw
+lm_head:8.0bpw
+```sh
+python convert.py \
+    -i {path}/OlympicCoder-7B \
+    -o {path}/OlympicCoder-7B/workingdir/ \
+    -cf {path}/OlympicCoder-7B_8.0bpw/ \
+    -b 8.0 \
+    -hb 8
+```
 # Model Card for OlympicCoder-7B
 OlympicCoder-7B is a code model that achieves strong performance on competitive coding benchmarks such as LiveCodeBench and the 2024 International Olympiad in Informatics.