Harsh1729/R1-Distill-Llama-8B-SFT-cotroller_dataset-bespoke-17k-llama3-hehe-ta_and_ps-train-v2 Updated Aug 27 • 3
Harsh1729/R1-Distill-Llama-8B-SFT-cotroller_dataset-bespoke-17k-llama3-active_computation-train Text Generation • 1B • Updated Aug 27 • 5
Harsh1729/R1-Distill-Llama-8B-SFT-cotroller_dataset-bespoke-17k-llama3-uncertainty_management-train Text Generation • 1B • Updated Aug 26 • 4
Harsh1729/R1-Distill-Llama-8B-SFT-cotroller_dataset-bespoke-17k-llama3-plan_generation-train Text Generation • 1B • Updated Aug 26 • 6
Harsh1729/R1-Distill-Llama-8B-SFT-cotroller_dataset-bespoke_openthoughts-llama3-hehe-ta_ps_ct-v2 Text Generation • 8B • Updated Aug 26 • 1
Harsh1729/R1-Distill-Llama-8B-SFT-cotroller_dataset-bespoke-52k_all_cotif-w_partial_soln-w_change_of_thgt Text Generation • 8B • Updated Aug 16 • 4
Harsh1729/R1-8B-SFT-cotroller_dataset-bespoke-52k_all_cotif-v6-w_partial_soln-w_change_of_thgt Text Generation • 8B • Updated Aug 16 • 7
Harsh1729/R1-Distill-Llama-8B-SFT-cotroller_dataset-bespoke-35k_all_cotif-w_partial_soln Text Generation • 8B • Updated Aug 16 • 2
Harsh1729/R1-Distill-Llama-8B-SFT-cotroller_dataset-bespoke-17k-llama3-hehe-ta_and_ps Text Generation • 8B • Updated Aug 16 • 3
Harsh1729/R1-Distill-Llama-8B-SFT-cotroller_dataset-bespoke_52k_cotif-v6-mv2 Text Generation • 8B • Updated Aug 16 • 2
Harsh1729/R1-Distill-Llama-8B-SFT-cotroller_dataset-bespoke_52k_cotif-ood-v7 Text Generation • 8B • Updated Aug 16 • 1
Harsh1729/R1-Distill-Llama-8B-SFT-bespoke-52k_all_cotif-w_partial_sol-v6 Text Generation • 8B • Updated Jun 7