Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

2,377

Full-text search

Active filters: multimodal

microsoft/Fara-7B

Image-Text-to-Text • 8B • Updated 6 days ago • 30.6k • 423

stepfun-ai/GELab-Zero-4B-preview

Image-to-Text • 4B • Updated 6 days ago • 648 • 89

jinaai/jina-vlm

Image-Text-to-Text • 2B • Updated 1 day ago • 1.03k • 14

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 278k • 743

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 3.44M • • 1.38k

bytedance-research/Vidi-7B

9B • Updated 16 days ago • 405 • 8

ZJU-AI4H/Hulu-Med-4B

Image-Text-to-Text • 5B • Updated 10 days ago • 2.1k • 9

xuemduan/reevaluate-clip

0.4B • Updated 1 day ago • 107 • 6

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 160k • 442

Kwai-Keye/Keye-VL-671B-A37B

Video-Text-to-Text • 672B • Updated 17 days ago • 124 • 17

jinaai/jina-clip-v2

Feature Extraction • 0.9B • Updated Apr 28 • 209k • 295

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 7.96M • 566

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 137k • 1.83k

stepfun-ai/Step1X-Edit

Image-to-Image • Updated Jul 9 • 136 • 326

cpatonn/Qwen3-Omni-30B-A3B-Instruct-AWQ-4bit

Any-to-Any • 10B • Updated Sep 28 • 23.4k • 32

ZJU-AI4H/Hulu-Med-7B

Image-Text-to-Text • 8B • Updated 10 days ago • 7.21k • 46

ZJU-AI4H/Hulu-Med-14B

Image-Text-to-Text • 15B • Updated 10 days ago • 10.6k • 42

ByteDance/Dolphin-1.5

Image-Text-to-Text • 0.4B • Updated 25 days ago • 1.59k • 31

huihui-ai/Huihui-Fara-7B-abliterated

Image-Text-to-Text • 8B • Updated 11 days ago • 650 • 6

omlab/VLM-FO1_Qwen2.5-VL-3B-v01

Object Detection • 4B • Updated 9 days ago • 1.89k • 9

thesby/Qwen3-VL-8B-NSFW-Caption-V4.5

Image-to-Text • 9B • Updated 29 days ago • 14.7k • 40

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12 • 2.02M • 472

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 171k • • 568

Mungert/Qwen2.5-VL-3B-Instruct-GGUF

Image-Text-to-Text • 3B • Updated Sep 24 • 17.5k • 25

Qwen/Qwen2.5-Omni-3B

Any-to-Any • 6B • Updated Apr 30 • 295k • 311

imageomics/bioclip-2

Zero-Shot Image Classification • Updated Oct 16 • 16.4k • 23

unsloth/Qwen2.5-Omni-7B-GGUF

Any-to-Any • 8B • Updated May 28 • 9.15k • 46

mispeech/midashenglm-7b-0804-fp32

Audio-Text-to-Text • 8B • Updated Oct 31 • 33.5k • 76

Qwen/Qwen3-Omni-30B-A3B-Captioner

Any-to-Any • 32B • Updated Sep 22 • 22.2k • 177

Qwen/Qwen3-Omni-30B-A3B-Thinking

Any-to-Any • 32B • Updated Sep 22 • 50.3k • 230