All checkpoints for our work Language Imbalance Driven Rewarding for Multilingual Self-improving
-
James-WYang/LIDR_M0_Meta-Llama-3-8B-Instruct_en_es_ru_de_fr
8B • Updated • 1 -
James-WYang/LIDR_M0_Meta-Llama-3-8B-Instruct_en_th_bn_sw
8B • Updated • 1 -
James-WYang/LIDR_M0_Meta-Llama-3-8B-Instruct_translate_by_system_en_th_bn_sw
8B • Updated • 1 -
James-WYang/LIDR_M0_Qwen2-7B-Instruct_en_es_ru_de_fr
8B • Updated • 1