ListenX Logo

ListenX Medium (GGUF) by Token AI

ListenX Medium is an advanced speech recognition model developed by Token AI, based on the Whisper Medium architecture.
It is optimized for accurate multilingual transcription and translation, supporting high-quality performance across various speech recognition tasks.

Model Overview

Model Name: ListenX Medium
Developer: Token AI
Format: GGUF
Architecture: Whisper Medium (modified and optimized by Token AI)
Primary Use: Speech-to-text and audio transcription
Supported Languages: English, Arabic, and multiple others
Release Year: 2025

This model was designed to achieve high transcription accuracy even in noisy environments, making it suitable for research, AI assistants, automated captioning, and call center systems.

Technical Details

Attribute	Description
Model Type	Encoder-decoder Transformer
Quantization	GGUF format for optimized CPU and GPU inference
Input	16kHz mono audio waveform
Output	Transcribed or translated text
Training Data	Multilingual and domain-diverse speech datasets
Framework Compatibility	whisper.cpp, ctransformers, llama.cpp, and compatible backends

Usage

1. Using `whisper.cpp`

Download the model file (model.gguf) and run:

./main -m models/listenx-medium/model.gguf -f samples/audio.wav -otxt

Downloads last month: 16

GGUF

Model size

0.8B params

Architecture

undefined

Hardware compatibility

We're not able to determine the quantization variants.

View all variants

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Evaluation results

word-error-rate on multilingual-speech
self-reported

5.200