Llms Exponentially Faster Language Modelling Paper • 2311.10770 • Published Nov 15, 2023 • 119 MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks Paper • 2311.07463 • Published Nov 13, 2023 • 15
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks Paper • 2311.07463 • Published Nov 13, 2023 • 15
Ops Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Paper • 2312.03694 • Published Dec 6, 2023 • 2 FaceStudio: Put Your Face Everywhere in Seconds Paper • 2312.02663 • Published Dec 5, 2023 • 33 Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models Paper • 2402.07033 • Published Feb 10, 2024 • 17 GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6, 2024 • 189
Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Paper • 2312.03694 • Published Dec 6, 2023 • 2
FaceStudio: Put Your Face Everywhere in Seconds Paper • 2312.02663 • Published Dec 5, 2023 • 33
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models Paper • 2402.07033 • Published Feb 10, 2024 • 17
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6, 2024 • 189
Llms Exponentially Faster Language Modelling Paper • 2311.10770 • Published Nov 15, 2023 • 119 MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks Paper • 2311.07463 • Published Nov 13, 2023 • 15
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks Paper • 2311.07463 • Published Nov 13, 2023 • 15
Ops Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Paper • 2312.03694 • Published Dec 6, 2023 • 2 FaceStudio: Put Your Face Everywhere in Seconds Paper • 2312.02663 • Published Dec 5, 2023 • 33 Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models Paper • 2402.07033 • Published Feb 10, 2024 • 17 GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6, 2024 • 189
Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Paper • 2312.03694 • Published Dec 6, 2023 • 2
FaceStudio: Put Your Face Everywhere in Seconds Paper • 2312.02663 • Published Dec 5, 2023 • 33
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models Paper • 2402.07033 • Published Feb 10, 2024 • 17
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6, 2024 • 189