Demo for MMaDA: Multimodal Large Diffusion Language Models
Demo for BAGEL
Generate text and speech responses from various inputs
4M: Massively Multimodal Masked Modeling