Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,7 @@ license: mit
|
|
4 |
|
5 |
Converted [intfloat/multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large) model in onnx int8 format for use with [Vespa Embedding](https://docs.vespa.ai/en/embedding.html).
|
6 |
|
7 |
-
If your CPU supports the avx-512 VNNI instruction set,
|
8 |
|
9 |
The model was quantized using the [optimum](https://github.com/huggingface/optimum) toolkit.
|
10 |
|
|
|
4 |
|
5 |
Converted [intfloat/multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large) model in onnx int8 format for use with [Vespa Embedding](https://docs.vespa.ai/en/embedding.html).
|
6 |
|
7 |
+
If your CPU supports the avx-512 VNNI instruction set, `multilingual-e5-large-int8-avx512_vnni.onnx` will perform best.
|
8 |
|
9 |
The model was quantized using the [optimum](https://github.com/huggingface/optimum) toolkit.
|
10 |
|