Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jojo-ai-mst
/
image-vision-cap
like
1
Image-to-Text
Transformers
Safetensors
vision-encoder-decoder
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
63499ef
image-vision-cap
957 MB
1 contributor
History:
2 commits
jojo-ai-mst
Upload model
63499ef
verified
over 1 year ago
.gitattributes
1.52 kB
initial commit
over 1 year ago
README.md
5.17 kB
Upload model
over 1 year ago
config.json
4.91 kB
Upload model
over 1 year ago
generation_config.json
179 Bytes
Upload model
over 1 year ago
model.safetensors
957 MB
xet
Upload model
over 1 year ago