Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
VILA-HD-8B-PS3-4K-SigLIP
like
1
Follow
NVIDIA
43.6k
Image-Text-to-Text
Safetensors
English
llava_topdown_llama
VLM
VILA-HD
PS3
arxiv:
2503.19903
arxiv:
2412.04468
License:
cc-by-nc-sa-4.0
Model card
Files
Files and versions
xet
Community
7575a20
VILA-HD-8B-PS3-4K-SigLIP
/
mm_projector
85.8 MB
2 contributors
History:
1 commit
bfshi-nvidia
Upload folder using huggingface_hub
6a34849
verified
6 months ago
config.json
259 Bytes
Upload folder using huggingface_hub
6 months ago
model.safetensors
85.8 MB
xet
Upload folder using huggingface_hub
6 months ago