adityasoni17/qwen_2_5_7b__sft__swe_bench_extra__16K__lora__r16_a16__adamw_linear_lr2e_4_epochs2_batchsize32 Updated 20 days ago
adityasoni17/qwen_2_5_7b__sft__swe_bench_extra__32K__lora__r16_a16__adamw_linear_lr2e_4_epochs2_batchsize32 Updated 20 days ago
genies-llm/text2sql-grpo-intermediate-reward-h100 Text Generation • 8B • Updated about 1 hour ago • 5