view article Article Test-Driving the LLMD Inference Engine by ZML π By erikkaum β’ 12 days ago β’ 21
view article Article Automated Discovery of High-Performance GPU Kernels with OpenEvolve By codelion β’ Jun 27 β’ 21
H-Net Collection The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 β’ 8 items β’ Updated 20 days ago β’ 18
OmniGEC Collection This is a collection of multilingual silver-standard datasets and models for the task of Grammatical Error Correction (GEC). β’ 8 items β’ Updated Apr 26 β’ 8
view article Article Boost Wav2Vec2 with n-gram LM in π€ Transformers By patrickvonplaten β’ Jan 12, 2022 β’ 12
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others β’ Mar 12 β’ 447
Gemma 3 Collection All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. β’ 50 items β’ Updated about 19 hours ago β’ 73
MT Quality Estimation Collection Models for reference-free quality estimation of machine translation β’ 10 items β’ Updated Jan 29 β’ 4
GTE models Collection General Text Embedding Models Released by Tongyi Lab of Alibaba Group β’ 21 items β’ Updated Jan 21 β’ 30
OWLS: Scaling Laws for Speech Recognition and Translation Collection π¦ A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate. β’ 8 items β’ Updated May 3 β’ 7
view article Article From Llasa to Llasagna π: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other β’ Feb 11 β’ 31
NeMo Curator - Classifier Models Collection Classifier models that can be used in NeMo Curator for labelling/filtering datasets. β’ 11 items β’ Updated 9 days ago β’ 19
Ukrainian Text-to-Speech datasets Collection Five voices: Mykyta, Oleksa, Lada, Kateryna or Tetiana β’ 6 items β’ Updated Feb 26 β’ 4
Crimean Tatar Text-to-Speech datasets Collection Three voices: Abibullah, Sevil, or Arslan β’ 4 items β’ Updated May 27 β’ 2
Setting up the Data Printer with Improved English to Ukrainian Machine Translation Paper β’ 2404.15196 β’ Published Apr 23, 2024 β’ 1
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition Paper β’ 2310.06434 β’ Published Oct 10, 2023 β’ 4
AudioSR: Versatile Audio Super-resolution at Scale Paper β’ 2309.07314 β’ Published Sep 13, 2023 β’ 28