abhranil14/Gemma2B_FF_on_qwen14B_wrong_2130_batch256_lr10e-6_warmup0.1_30_epoch_linear_lr Updated 6 days ago
abhranil14/Gemma2B_FF_on_qwen14B_wrong_2130_batch256_lr10e-6_warmup0.1_30_epoch_linear_lr Updated 6 days ago
abhranil14/Gemma_FF_on_Gemma27B_wrong_soln_wrt_human_1_soln_per_qs_6076_batch256_lr10e-6_warmup0.1 Updated 27 days ago
abhranil14/Gemma_FF_on_Gemma27B_wrong_soln_wrt_human_1_soln_per_qs_6076_batch256_lr10e-6_warmup0.1 Updated 27 days ago
abhranil14/Gemma2B_FF_on_qwen14B_gold_6158_batch256_lr10e-6_warmup0.1_10_epoch_linear_lr Updated 27 days ago
abhranil14/Gemma2B_FF_on_qwen14B_gold_6158_batch256_lr10e-6_warmup0.1_10_epoch_linear_lr Updated 27 days ago
abhranil14/Gemma2B_FF_on_gemma2B_self_distill_wrong_7044_batch256_lr10e-6_warmup0.1_10_epoch_linear_lr Updated Jun 26
abhranil14/Gemma2B_FF_on_gemma2B_self_distill_wrong_7044_batch256_lr10e-6_warmup0.1_10_epoch_linear_lr Updated Jun 26
abhranil14/Gemma2B_FF_on_gemma2B_self_distill_wrong_7044_batch256_lr10e-6_warmup0.1_10_epoch_linear_lr Updated Jun 26