ListenX Logo

ListenX Medium (GGUF) by Token AI

ListenX Medium is an advanced speech recognition model developed by Token AI, based on the Whisper Medium architecture.
It is optimized for accurate multilingual transcription and translation, supporting high-quality performance across various speech recognition tasks.

Model Overview

  • Model Name: ListenX Medium
  • Developer: Token AI
  • Format: GGUF
  • Architecture: Whisper Medium (modified and optimized by Token AI)
  • Primary Use: Speech-to-text and audio transcription
  • Supported Languages: English, Arabic, and multiple others
  • Release Year: 2025

This model was designed to achieve high transcription accuracy even in noisy environments, making it suitable for research, AI assistants, automated captioning, and call center systems.

Technical Details

Attribute Description
Model Type Encoder-decoder Transformer
Quantization GGUF format for optimized CPU and GPU inference
Input 16kHz mono audio waveform
Output Transcribed or translated text
Training Data Multilingual and domain-diverse speech datasets
Framework Compatibility whisper.cpp, ctransformers, llama.cpp, and compatible backends

Usage

1. Using whisper.cpp

Download the model file (model.gguf) and run:

./main -m models/listenx-medium/model.gguf -f samples/audio.wav -otxt
Downloads last month
16
GGUF
Model size
0.8B params
Architecture
undefined
Hardware compatibility
Log In to view the estimation

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Evaluation results