Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ tags:
|
|
| 12 |
|
| 13 |
This is a quick "down and dirty" demo, with full sampler settings (3) to augment operation of "Llama-3.3-70B-Instruct" at "IQ1_S" (ultra low bit).
|
| 14 |
|
| 15 |
-
(can also apply these using IQ1_M, IQ2 quants too AND use for any 70B model at low quant levels.)
|
| 16 |
|
| 17 |
This will allow you to load and run this model on a 16 GB video card fully, at 2048 ctx and achieve 13-15 t/s.
|
| 18 |
|
|
|
|
| 12 |
|
| 13 |
This is a quick "down and dirty" demo, with full sampler settings (3) to augment operation of "Llama-3.3-70B-Instruct" at "IQ1_S" (ultra low bit).
|
| 14 |
|
| 15 |
+
(can also apply these using IQ1_M, IQ2 quants too AND you can use these settings for any 70B model at low quant levels.)
|
| 16 |
|
| 17 |
This will allow you to load and run this model on a 16 GB video card fully, at 2048 ctx and achieve 13-15 t/s.
|
| 18 |
|