Shahradmz
/

Qwen2-0.5B-Instruct_continual_data_debug_PPO_0

Generated from Trainer

Model card Files Files and versions

Qwen2-0.5B-Instruct_continual_data_debug_PPO_0 / eval_results.json

Shahradmz's picture

End of training

23c34e9 verified 7 months ago

history blame contribute delete

57 Bytes

	{
	"dataset": 0,
	"eval_score": 7.055242538452148
	}