the most powerful vision-language model in the Qwen series to date. Available in Dense and MoE architectures