CoreML
Collection
Models for Apple devices. See https://github.com/FluidInference/FluidAudio for usage details
•
3 items
•
Updated
This is a CoreML-optimized version of NVIDIA's Parakeet TDT 0.6B V2 model, designed for high-performance automatic speech recognition on Apple platforms.
Models will continue to evolve as we optimize performance and accuracy. This model has been converted to CoreML format for efficient on-device inference on Apple Silicon and iOS devices, enabling real-time speech recognition with minimal memory footprint.
See the FluidAudio repository for instructions.
This model is released under the CC-BY-4.0 license. See the LICENSE file for details.
Acknowledgments
Based on NVIDIA's Parakeet TDT model. CoreML conversion and Swift integration by the FluidInference team.
Base model
nvidia/parakeet-tdt-0.6b-v2