arthurcollet/Qwen3-Embedding-4B-mlx-nvfp4
The Model arthurcollet/Qwen3-Embedding-4B-mlx-nvfp4 was converted to MLX format from Qwen/Qwen3-Embedding-4B using mlx-lm version 0.0.6.
Use with mlx
pip install mlx-embeddings
from mlx_embeddings import load, generate
import mlx.core as mx
model, tokenizer = load("arthurcollet/Qwen3-Embedding-4B-mlx-nvfp4")
# For text embeddings
output = generate(model, processor, texts=["I like grapes", "I like fruits"])
embeddings = output.text_embeds # Normalized embeddings
# Compute dot product between normalized embeddings
similarity_matrix = mx.matmul(embeddings, embeddings.T)
print("Similarity matrix between texts:")
print(similarity_matrix)
- Downloads last month
- 293
Model size
0.8B params
Tensor type
U8
路
U32 路
F16 路
Hardware compatibility
Log In to add your hardware
Quantized
Model tree for arthurcollet/Qwen3-Embedding-4B-mlx-nvfp4
Base model
Qwen/Qwen3-4B-Base