view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 20 days ago • 587
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • Jun 26 • 112
view article Article How NuminaMath Won the 1st AIMO Progress Prize By yfleureau and 7 others • Jul 11, 2024 • 122
view article Article Preference Optimization for Vision Language Models By qgallouedec and 3 others • Jul 10, 2024 • 80
view article Article Patch Time Series Transformer in Hugging Face By namctin and 4 others • Feb 1, 2024 • 10
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods By kashif and 4 others • Jan 18, 2024 • 68
view article Article Finetune Stable Diffusion Models with DDPO via TRL By metric-space and 3 others • Sep 29, 2023 • 16
view article Article Introducing Würstchen: Fast Diffusion for Image Generation By dome272 and 4 others • Sep 13, 2023 • 19
view article Article Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer) By elisim and 2 others • Jun 16, 2023 • 35
view article Article StackLLaMA: A hands-on guide to train LLaMA with RLHF By edbeeching and 6 others • Apr 5, 2023 • 41
view article Article Multivariate Probabilistic Time Series Forecasting with Informer By elisim and 2 others • Mar 10, 2023 • 21
view article Article Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU By edbeeching and 5 others • Mar 9, 2023 • 60
view article Article Probabilistic Time Series Forecasting with 🤗 Transformers By nielsr and 1 other • Dec 1, 2022 • 37