mradermacher/SEOcrate-4B_grpo_new_01-GGUF Reinforcement Learning • 4B • Updated 21 days ago • 2.41k • 1
unionai/pythia-1b-deduped-finetune-alpaca-cleaned Text Generation • 1B • Updated Nov 8, 2023 • 14 • 1