view article Article Fast LoRA inference for Flux with Diffusers and PEFT By sayakpaul and 1 other • 5 days ago • 23
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs By wenhuach and 8 others • Apr 29 • 37
view article Article Accelerating LLM Inference with TGI on Intel Gaudi By baptistecolle and 4 others • Mar 28 • 14
view article Article Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon By juliensimon and 8 others • May 9, 2024 • 12