Generate web application code from descriptions
Generate audio for videos using captions and descriptions
Generate images from sketches
Start camera and receive responses based on video feed
Expressive Zeroshot TTS
Transcribe audio to text in multiple languages