Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen2-VL-7B-Instruct
like
1.24k
Follow
Qwen
55.8k
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_vl
image-to-text
multimodal
conversational
text-generation-inference
arxiv:
2409.12191
arxiv:
2308.12966
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
82
Train
Deploy
Use this model
a4b7c25
Qwen2-VL-7B-Instruct
/
requirements.txt
nbroad
f95af0f48139531e860db5b9a87e09e66e06c43c6909a2266bc6752b6807b211
c3a944b
verified
11 months ago
raw
Copy download link
history
blame
106 Bytes
qwen-vl-utils
git+https://github.com/huggingface/transformers.git@b99ca4d28b47fa7166e7882cb0695a5c0cc0d411