hotchpotch
/

vespa-onnx-intfloat-multilingual-e5-large

Feature Extraction

text-embeddings-inference

Model card Files Files and versions

hotchpotch commited on Mar 23, 2024

Commit

6e9ecc4

·

verified ·

1 Parent(s): 4c2dc50

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ license: mit
 Converted [intfloat/multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large) model in onnx int8 format for use with [Vespa Embedding](https://docs.vespa.ai/en/embedding.html).
-If your CPU supports the avx-512 VNNI instruction set, such as vespa cloud, `multilingual-e5-large-int8-avx512_vnni.onnx` will perform best.
 The model was quantized using the [optimum](https://github.com/huggingface/optimum)  toolkit.

 Converted [intfloat/multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large) model in onnx int8 format for use with [Vespa Embedding](https://docs.vespa.ai/en/embedding.html).
+If your CPU supports the avx-512 VNNI instruction set, `multilingual-e5-large-int8-avx512_vnni.onnx` will perform best.
 The model was quantized using the [optimum](https://github.com/huggingface/optimum)  toolkit.