ReWiz
Collection
The ReWiz series is based on a subset of data from 3 different data sets, which has been used for fine tuning.
•
17 items
•
Updated
•
1
This is a fine tune of Mistral 7B Instruct (0.3). Half the data was geared towards better reasoning (EvolKit-20k and reasoning-base-20k), the other half will help to de-censor the model (WizardLM data set).
This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.
Detailed results can be found here
| Metric | Value |
|---|---|
| Avg. | 17.54 |
| IFEval (0-Shot) | 40.48 |
| BBH (3-Shot) | 23.50 |
| MATH Lvl 5 (4-Shot) | 2.57 |
| GPQA (0-shot) | 3.36 |
| MuSR (0-shot) | 16.74 |
| MMLU-PRO (5-shot) | 18.56 |
Base model
mistralai/Mistral-7B-v0.3