QuartzNet: Deep Automatic Speech Recognition with 1D Time-Channel Separable Convolutions Paper • 1910.10261 • Published Oct 22, 2019
Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition Paper • 2104.01721 • Published Apr 5, 2021
TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction Paper • 2104.08189 • Published Apr 16, 2021
Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition Paper • 2305.05084 • Published May 8, 2023
Efficient Sequence Transduction by Jointly Predicting Tokens and Durations Paper • 2304.06795 • Published Apr 13, 2023
TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context Paper • 2110.04410 • Published Oct 8, 2021
Stochastic Gradient Methods with Layer-wise Adaptive Moments for Training of Deep Networks Paper • 1905.11286 • Published May 27, 2019