InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published Aug 25, 2025 • 211
InternVL3.5 Collection This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28, 2025 • 104
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 247
Running Featured 160 SmolVLM realtime WebGPU ⚡ 160 Start camera to get descriptions based on instructions
Running Featured 1.03k Can You Run It? LLM version 🚀 1.03k Determine GPU requirements for running large language models
Running Featured 365 Qwen2.5 Omni 7B Demo 🏆 365 Generate text and speech responses from text, audio, images, or video input
view article Article SmolVLM Grows Smaller – Introducing the 256M & 500M Models! +1 Jan 23, 2025 • 189
Runtime error Featured 53 Compare Siglip1 Siglip2 🚀 53 Compare SigLIP1 and SigLIP2 on zero shot classification