Standalone ECAPA-TDNN x-vector speaker encoders extracted from Qwen3-TTS. 1024-dim (0.6B) and 2048-dim (1.7B).
-
marksverdhei/Qwen3-Voice-Embedding-12Hz-0.6B
Feature Extraction • Updated • 3.07k • 19 -
marksverdhei/Qwen3-Voice-Embedding-12Hz-1.7B
Feature Extraction • Updated • 9.87k • 23 -
marksverdhei/Qwen3-Voice-Embedding-12Hz-0.6B-onnx
Feature Extraction • Updated • 21 -
marksverdhei/Qwen3-Voice-Embedding-12Hz-1.7B-onnx
Feature Extraction • Updated • 3