Visual Document Retrieval
Transformers
Safetensors
ColPali
sentence-transformers
multilingual
feature-extraction
vidore
multimodal-embedding
multilingual-embedding
Text-to-Visual Document (T→VD) retrieval
sentence-similarity
mteb
vllm
custom_code
🇪🇺 Region: EU
Instructions to use jinaai/jina-embeddings-v4 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use jinaai/jina-embeddings-v4 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("jinaai/jina-embeddings-v4", trust_remote_code=True, dtype="auto") - ColPali
How to use jinaai/jina-embeddings-v4 with ColPali:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- sentence-transformers
How to use jinaai/jina-embeddings-v4 with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("jinaai/jina-embeddings-v4", trust_remote_code=True) sentences = [ "The weather is lovely today.", "It's so sunny outside!", "He drove to the stadium." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [3, 3] - Notebooks
- Google Colab
- Kaggle
feat-rename-vector-type-0622
#21
by nan - opened
- remove redundant
_vectorin the vector types - use Enum for the vector types.
nan changed pull request status to open
LGTM
Minor comment: instead of
single (torch.Tensor): Single-vector embeddings of shape (batch_size, dim).
multi (torch.Tensor): Multi-vector embeddings of shape (batch_size, num_tokens, dim).
it should be single_vec_emb and multi_vec_emb.
nan changed pull request status to merged