Issues using model.safetensors
#59 opened 1 day ago
by
smartyshiv

[GZip] Bad GZip file error from using restore_from
#58 opened 8 days ago
by
arfang1121

close text postprocessing
#57 opened 14 days ago
by
Weiweier
Parakeet.js - Browser-based Parakeet TDT 0.6B Implementation
π
2
2
#56 opened 18 days ago
by
ysdede

Segmentation fault (core dumped)
2
#55 opened 21 days ago
by
zhouxinxin
Outputs words when silence is input to it. How to stop this?
#54 opened 23 days ago
by
polonuim210
Information on finetuning the model for new langauges!!
1
#53 opened 26 days ago
by
gude
Conversion to TensorRT?
#52 opened 28 days ago
by
shannu122
Word boosting for TDT models
π
1
#51 opened about 1 month ago
by
hoavu1234
how does the model handle timestamp decoding ?
π
1
#50 opened about 1 month ago
by
StephennFernandes

How can I align timesteps to text for Parakeet-tdt-0.6b-v2 output using KenLM?
3
#48 opened about 2 months ago
by
Nguyen667201

Poor WER when trying to fine-tune Parakeet v2 TDT to other dataset than English
2
#47 opened about 2 months ago
by
pronoobie

OutOfMemoryError: CUDA out of memory. on RTX A5000
1
#46 opened about 2 months ago
by
akskuchi

Recipes to Finetune to new Language example Hindi (Finally figured out)
β€οΈ
7
4
#45 opened about 2 months ago
by
pronoobie

how to load the model from local directory?
1
#44 opened 2 months ago
by
ALu7

Use this to fill web forms no typing (STT server)
β€οΈ
π₯
2
#43 opened 2 months ago
by
pronoobie

Real-Time Mic Transcription on free 2vCPU - using this model, check it out
β€οΈ
π
5
6
#41 opened 2 months ago
by
WJ88

How should I start word-level timestampοΌ
1
#40 opened 2 months ago
by
ppoudd
Speaker Diarization ??
π
2
#39 opened 2 months ago
by
vasanth5596
Word boosting / context biasing
8
#34 opened 2 months ago
by
hoavu1234
Why does using the same fastconformer_hybrid_tdt_ctc_bpe.yaml config to fine-tune pre-train model result in a "mismatch" error?
1
#33 opened 2 months ago
by
Nguyen667201

How can I get timestamps when using KenLM with the model?
1
#32 opened 2 months ago
by
Nguyen667201

parakeet as a local MCP server
β€οΈ
1
#31 opened 2 months ago
by
alexmnahas
Besides GPU, can any other edge accellerators run it. EX: (Hailo AI Hat for RPI)
4
#30 opened 2 months ago
by
Flyingcrabs
Is the model capable of splitting different speakers?
π
1
1
#29 opened 2 months ago
by
BigDeeper
How can I get the correct y_sequence format as I expect?
#27 opened 3 months ago
by
Nguyen667201

Model initialization
1
#22 opened 3 months ago
by
Homin
Bug report
π
1
2
#20 opened 3 months ago
by
JustinRocks
Is CUDA supported when running on Jetson Orin?
4
#19 opened 3 months ago
by
kikaitachi

Seeking a Clear Guide for Fine-Tuning NVIDIA NeMo Models on New English Audio Domains
1
#18 opened 3 months ago
by
jacktol

Only English is supported?
4
#17 opened 3 months ago
by
wangleineo
Does this model identifies speaker?
π
1
8
#16 opened 3 months ago
by
SouravAhmed

How can I transcribe an audio file thatβs longer than an hour when I have only 12β―GB of VRAM?
6
#15 opened 3 months ago
by
will1130
New Language Training
π
π₯
7
6
#11 opened 3 months ago
by
ali-amiri
What is the data format for training?
5
#10 opened 3 months ago
by
Nguyen667201

ONNX conversion
16
#9 opened 3 months ago
by
Berrisius

Ignores repeated words
2
#8 opened 3 months ago
by
HeadlessBandit

Finetuning With Custom Data Tutorial
2
#7 opened 3 months ago
by
SirCodesAlot
A German Version would be fantastic!
3
#6 opened 3 months ago
by
Buttermilk03
We clearly need a french version
π
2
1
#4 opened 3 months ago
by
Sarwg
Streaming?
β
8
12
#3 opened 3 months ago
by
pscar
Please do It for Japanese
π
1
4
#2 opened 3 months ago
by
riken12
Local Installation Video and Testing - Step by Step
π
3
1
#1 opened 3 months ago
by
fahdmirzac
