Integrate with Sentence Transformers v5.4

by tomaarsen HF Staff - opened Apr 16

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+99

-17

tomaarsen

Apr 16

•

edited Apr 16

Hello!

Pull Request overview

Integrate this model with Sentence Transformers v5.4+ so it can be loaded via CrossEncoder("ContextualAI/ctxl-rerank-v2-instruct-multilingual-6b").

Details

This is the 6B sibling PR to https://huggingface.co/ContextualAI/ctxl-rerank-v2-instruct-multilingual-1b/discussions/2, with the same changes in place. There is one exception, however: the BOS token is emitted explicitly inside the chat template. Unlike the 1B and 2B, this model has "add_bos_token": true and "bos_token": "<s>", so I've included it in the chat template.

import torch
from sentence_transformers import CrossEncoder

model = CrossEncoder("ContextualAI/ctxl-rerank-v2-instruct-multilingual-6b", model_kwargs={"dtype": torch.bfloat16}, revision="refs/pr/1")

query = "What are the health benefits of exercise?"
instruction = "Prioritize recent medical research"
documents = [
    "Regular exercise reduces risk of heart disease and improves mental health.",
    "A 2024 study shows exercise enhances cognitive function in older adults.",
    "Ancient Greeks valued physical fitness for military training.",
]

pairs = [(query, doc) for doc in documents]
scores = model.predict(pairs, prompt=instruction)
print(scores)
# [ -4.6875   -2.171875 -12.4375  ]

rankings = model.rank(query, documents, prompt=instruction)
print(rankings)
# [{'corpus_id': 1, 'score': np.float32(-2.171875)}, {'corpus_id': 0, 'score': np.float32(-4.6875)}, {'corpus_id': 2, 'score': np.float32(-12.4375)}]

You can run this outright due to the revisionargument. After merging, the revision argument isn't needed anymore.

Note that none of the old behaviour is affected or changed: this only adds an additional way to run the model in a familiar and common format. The raw AutoModelForCausalLM and vLLM paths already documented in the README continue to work unchanged, and the Sentence Transformers path produces identical bfloat16 scores to those paths for every sample I tested (0.0 diff vs. the README's Transformers baseline on 3/3 examples, with and without an instruction).

Tom Aarsen

Integrate with Sentence Transformers v5.4a44964a2

tomaarsen changed pull request status to open Apr 16

sheshansh-ctx changed pull request status to merged 28 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment