Model Card for Model ID

This is a text Reranker model to score if a text is kindergarten-teacher style.

Model Details

Model Description

Useage

from transformers import AutoTokenizer, AutoModelForCausalLM

# Load tokenizer and reranker model (Note that you can also download the models and load with local path.)
tokenizer_reward = AutoTokenizer.from_pretrained("Miao025/Qwen-KinderChatbot-Reward")
reward_model = AutoModelForSequenceClassification.from_pretrained("Miao025/Qwen-KinderChatbot-Reward")

# For each prompt-response pair, get the score
inputs = tokenizer_reward(prompt, response, return_tensors="pt", truncation=True)
with torch.no_grad():
    logits = reward_model(**inputs).logits
    score = torch.softmax(logits, dim=-1)[0,1].item()

Training Data

[Training Dataset Card](to be add) Training process can be found on Github.

Contact

For any questions, please contact the author [email protected]

Downloads last month
7
Safetensors
Model size
67M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Miao025/Qwen-KinderChatbot-Reward

Finetuned
(9178)
this model

Space using Miao025/Qwen-KinderChatbot-Reward 1