Generate edited video frames using text prompts
Transform images based on text instructions
Transcribe audio to text