RealGuardrails Models

This model was trained on the RealGuardrails dataset, an instruction-tuning dataset focused on improving system prompt adherence and precedence. In particular, it was trained via SFT on the simplemix split of ~150K examples using our custom training library torchllms and converted back to a transformers compatible checkpoint.

Training Hyperparameters

Name	Value
optimizer	AdamW
batch size	128
learning rate	2e-5
lr scheduler	cosine with 200 warmup steps
betas	(0.9, 0.999)
eps	1e-8
weight decay	0
epochs	1
max grad norm	1.0
precision	bf16
max length	4096

Downloads last month: 4

Model tree for normster/RealGuardrails-Qwen2.5-7B-SFT-weak

Base model

Qwen/Qwen2.5-7B

Finetuned

(771)

this model

Quantizations

1 model

Dataset used to train normster/RealGuardrails-Qwen2.5-7B-SFT-weak

Collection including normster/RealGuardrails-Qwen2.5-7B-SFT-weak

SystemCheck

Collection

Models and datasets from the paper: "SystemCheck: A Closer Look at System Prompt Reliability" • 18 items • Updated May 16, 2025 • 2