Qwen2.5-7B-Instruct - OpenThoughts3

This model is the final checkpoint from the exp2199b_redo3 experiment, which fine-tunes Qwen2.5-7B-Instruct on the OpenThoughts3 dataset.

Model Details

Training Hyperparameters

Parameter Value
Epochs 5
Batch Size 512
Learning Rate 8e-5
Max Sequence Length 16384
LR Schedule Cosine
Warmup 10%
Decay 0.9
Weight Decay 0.0
Beta1 0.9
Beta2 0.999
Hardware TPU v4-512

Training Notes

  • Era shuffling enabled (dataset shuffled every epoch)
  • RoPE theta set to 1,000,000 for extended context support
  • Fixed Qwen2.5 chat template applied
Downloads last month
3
Safetensors
Model size
8B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for marin-community/qwen2.5-7b-instruct-openthoughts3

Base model

Qwen/Qwen2.5-7B
Finetuned
(3367)
this model

Dataset used to train marin-community/qwen2.5-7b-instruct-openthoughts3