view article Article Fine-tuning Llama 2 70B using PyTorch FSDP By smangrul and 3 others β’ Sep 13, 2023 β’ 27
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA By ybelkada and 4 others β’ May 24, 2023 β’ 161
view article Article Introducing RWKV β An RNN with the advantages of a transformer By BlinkDL and 3 others β’ May 15, 2023 β’ 22
view article Article How π€ Accelerate runs very large models thanks to PyTorch By sgugger β’ Sep 27, 2022 β’ 12
view article Article Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate By stas and 1 other β’ Sep 16, 2022 β’ 1
view article Article Accelerate Large Model Training using DeepSpeed By smangrul and 1 other β’ Jun 28, 2022 β’ 6
view article Article Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel By smangrul and 1 other β’ May 2, 2022 β’ 5