File size: 499 Bytes
ace7261
 
 
 
 
233d3b7
ace7261
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
---
license: cc-by-nc-4.0
---

A model made with curated synthetic data and then KTO'd on a small curated set. Total time to train was 4 H100 hours. 
I quite like the results this gave despite the dataset sizes involved. Its also a lot cheaper to iterate.
I plan to hand review the human data I've been using and slowly work that back into the datamix.
Additionally, planning to make a focused instruction following KTO set to improve system prompt adherance and steerability.


Use chatML and minP.