thwannbe/Llama-3.1-8B-Instruct-GSM8K-RLVR-Distill-Persona-Mixed Text Generation • 8B • Updated about 4 hours ago
thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated about 4 hours ago
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Persona-Mixed Text Generation • 8B • Updated about 4 hours ago
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Sft-Persona-Mixed Text Generation • 8B • Updated 4 days ago • 22
thwannbe/Llama-3.1-8B-Instruct-GSM8K-GPT5-mini-Style-distill Text Generation • 8B • Updated 4 days ago • 19