EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-Judge Paper • 2505.23009 • Published May 29 • 18
bosonai/higgs-audio-v2-generation-3B-base Text-to-Speech • 6B • Updated 1 day ago • 82.5k • 441
TokensGen: Harnessing Condensed Tokens for Long Video Generation Paper • 2507.15728 • Published 8 days ago • 6
view changelog Changelog Inference Providers now fully support OpenAI-compatible API 11 days ago • 67
Running on CPU Upgrade 96 96 Appoint Ready - MedGemma Demo 📋 Simulated Pre-visit Intake Demo built using MedGemma
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild Paper • 2211.14758 • Published Nov 27, 2022 • 2
view article Article Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • 13 days ago • 124
PresentAgent: Multimodal Agent for Presentation Video Generation Paper • 2507.04036 • Published 24 days ago • 9