Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ Exllamav3 quantizations of [Qwen/Qwen3-235B-A22B-Thinking-2507](https://huggingf
|
|
13 |
|
14 |
[2.10 bpw h6](https://huggingface.co/MikeRoz/Qwen3-235B-A22B-Thinking-2507-exl3/tree/2.10bpw_H6) 59.287 GiB
|
15 |
[2.80 bpw h6](https://huggingface.co/MikeRoz/Qwen3-235B-A22B-Thinking-2507-exl3/tree/2.80bpw_H6) 78.295 GiB
|
16 |
-
[3.60 bpw h6](https://huggingface.co/MikeRoz/Qwen3-235B-A22B-Thinking-2507-exl3/tree/3.60bpw_H6) 100.116 GiB
|
17 |
[4.25 bpw h6](https://huggingface.co/MikeRoz/Qwen3-235B-A22B-Thinking-2507-exl3/tree/4.25bpw_H6) 117.803 GiB
|
18 |
|
19 |
* The 2.10 bpw quant will fit in three 24 GB cards with 45k of context.
|
|
|
13 |
|
14 |
[2.10 bpw h6](https://huggingface.co/MikeRoz/Qwen3-235B-A22B-Thinking-2507-exl3/tree/2.10bpw_H6) 59.287 GiB
|
15 |
[2.80 bpw h6](https://huggingface.co/MikeRoz/Qwen3-235B-A22B-Thinking-2507-exl3/tree/2.80bpw_H6) 78.295 GiB
|
16 |
+
[3.60 bpw h6](https://huggingface.co/MikeRoz/Qwen3-235B-A22B-Thinking-2507-exl3/tree/3.60bpw_H6) 100.116 GiB
|
17 |
[4.25 bpw h6](https://huggingface.co/MikeRoz/Qwen3-235B-A22B-Thinking-2507-exl3/tree/4.25bpw_H6) 117.803 GiB
|
18 |
|
19 |
* The 2.10 bpw quant will fit in three 24 GB cards with 45k of context.
|