YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Meta Spirit LM: Interleaved Spoken and Written Language Model
This repository contains the model weights, inference code and evaluation scripts for the Spirit LM paper. You can find more generation samples on our demo page.
Spirit LM Model Overview

Installation Setup
Conda
conda env create -f env.yml
pip install -e '.[eval]'
Pip
pip install -e '.[eval]'
Dev
(Optionally, use only if you want to run the tests.)
pip install -e '.[dev]'
Checkpoints Setup
Quick Start
Speech Tokenization
See spiritlm/speech_tokenizer/README.md
Spirit LM Generation
Speech-Text Sentiment Preservation benchmark (STSP)
Model Card
More details of the model can be found in MODEL_CARD.md.
License
The present code is provided under the FAIR Noncommercial Research License found in LICENSE.
Citation
@misc{nguyen2024spiritlminterleavedspokenwritten,
title={SpiRit-LM: Interleaved Spoken and Written Language Model},
author={Tu Anh Nguyen and Benjamin Muller and Bokai Yu and Marta R. Costa-jussa and Maha Elbayad and Sravya Popuri and Paul-Ambroise Duquenne and Robin Algayres and Ruslan Mavlyutov and Itai Gat and Gabriel Synnaeve and Juan Pino and Benoit Sagot and Emmanuel Dupoux},
year={2024},
eprint={2402.05755},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2402.05755},
}
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support