Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a dataset
4 days ago
hamishivi/tulu_3_rewritten_400k_string_f1_only_v2_nocode_all_filtered_qwen2_5_openthoughts2
updated
a dataset
4 days ago
hamishivi/virtuoussy_multi_subject_rlvr
updated
a dataset
6 days ago
hamishivi/llama_nemotron_post_training_sft_science