Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Vikhrmodels
/
salt-qwen2.5-0.5b-asr
like
0
Follow
Vikhr models
441
Safetensors
openslr/librispeech_asr
amphion/Emilia-Dataset
English
qwen2
Model card
Files
Files and versions
xet
Community
Model Performance Overview
Our Solution
Resources
Model Performance Overview
Metrics
:
CER
: Character Error Rate (lower = better).
WER
: Word Error Rate (lower = better).
Model
CER
WER
SALT-asr
8.42
18.49
Our Solution
Method
: Extends a pre-trained LLM with audio tokens and fine-tunes on
ASR
task.
Audio tokenization
: SpeechTokenizer (semantic tokens only).
Resources
Code:
GitHub Repo
Downloads last month
4
Safetensors
Model size
495M params
Tensor type
F32
·
Chat template
Files info
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for
Vikhrmodels/salt-qwen2.5-0.5b-asr
Base model
Qwen/Qwen2.5-0.5B
Finetuned
(
359
)
this model
Datasets used to train
Vikhrmodels/salt-qwen2.5-0.5b-asr
amphion/Emilia-Dataset
Viewer
•
Updated
Feb 28
•
54.8M
•
64.9k
•
348
openslr/librispeech_asr
Viewer
•
Updated
5 days ago
•
585k
•
15.1k
•
159
Collection including
Vikhrmodels/salt-qwen2.5-0.5b-asr
SALT
Collection
3 items
•
Updated
25 days ago