Generate images from text prompts
Convert text to natural-sounding speech audio
Generate speech from text using a reference voice