GGUF Conversion & Quantization of OpenGVLab/InternVL3-2B (4-Bit Quantization)

This model is converted & quantized from OpenGVLab/InternVL3-2B using llama.cpp version 6217 (7a6e91ad)

All quants made using imatrix option with Bartowski's dataset

Model Details

For more details about the model, see its original model card

Downloads last month
148
GGUF
Model size
2B params
Architecture
qwen2
Hardware compatibility
Log In to view the estimation

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Zoont/InternVL3-2B-4-Bit-GGUF-with-mmproj