view article Article Enabling Long Context Training with Sequence Parallelism in Axolotl Apr 4, 2025 • 15
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 • 90
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate +2 Jun 13, 2024 • 61
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models +1 Jun 24, 2024 • 205
phamkinhquoc2002/bge-base-financial-matryoshka_test Sentence Similarity • 0.1B • Updated Jun 10, 2024 • 5