llmware
/

gemma-3-4b-gguf

Model card Files Files and versions

gemma-3-4b-gguf

gemma-3-4b-gguf is an GGUF Q4_K quantized version of Google's Gemma-3-4B with Instruct Training (IT), providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.

gemma-3-4b is a leading open source foundation model from Google.

Model Description

Developed by: Google
Quantized by: llmware
Model type: gemma-3
Parameters: 4 billion
Model Parent: google/gemma-4b-it
Language(s) (NLP): English
License: Apache 2.0
Uses: General purpose chat
RAG Benchmark Accuracy Score: NA
Quantization: int4

Model Card Contact

llmware on github

llmware website

Downloads last month: 77

GGUF

Model size

4B params

Architecture

gemma3

Hardware compatibility

Log In to view the estimation

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support