Phi 4

AmpereComputing 's Collections

Bielik v3.0

DeepSeek R1

DeepSeek V3

Gemma 3

Llama 3.1

Llama 3.2

Llama 3.3

Llama 4

Mistral

Phi 4

Qwen 2.5

Qwen 3

QwQ

updated about 18 hours ago

Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp

Upvote

AmpereComputing/phi-4-mini-instruct-gguf

4B • Updated 3 days ago • 19
AmpereComputing/phi-4-gguf

15B • Updated 3 days ago • 4

Upvote

Collection guide
Browse collections