Inference Providers
Active filters: torchao
gurro/llama-3.1-8B-torchao-int4wo-256
Text Generation
• Updated • 3
jerryzh168/llama3-8b-autoquant
Text Generation
• Updated • 3
medmekk/Llama-3.1-8B-Instruct-torchao-int8_weight_only
medmekk/Llama-3.1-8B-Instruct-torchao-int8wo
Updated
medmekk/Llama-3.1-8B-Instruct-torchao-int8da8w
medmekk/Llama-3.2-3B-Instruct-torchao-int8wo
Updated
medmekk/Llama-3.2-1B-torchao-int8wo
medmekk/Llama-3.2-1B-torchao-int8da8w
medmekk/Llama-3.2-3B-Instruct-torchao-int8da8w
medmekk/Llama-3.1-70B-Instruct-torchao-int8da8w
jerryzh168/Meta-Llama-3-8B-torchao-int8_weight_only
jerryzh168/Meta-Llama-3-8B-torchao-int4_weight_only-gs_128
jerryzh168/Meta-Llama-3-8B-torchao-int4_weight_only-gs_64
HF-Quantization/Llama-3.2-1B-TORCHAO-W8
HF-Quantization/Llama-3.2-1B-TORCHAO-W8A8
Updated
HF-Quantization/Llama-3.2-1B-TORCHAO-W4
HF-Quantization/Llama-3.3-70B-Instruct-TORCHAO-W4
jpablomch/Meta-Llama-3-8B-Instruct-torchao
Text Generation
• Updated • 1
jerryzh168/llama3-8b-int4wo-128
Text Generation
• Updated • 3
jerryzh168/llama3-8b-int8wo
Text Generation
• Updated • 17
alpindale/Meta-Llama-3-8B-torchao-int8_weight_only
Text Generation
• Updated • 2
Text Generation
• Updated • 2
drisspg/float8_dynamic_act_float8_weight-opt-125m
Text Generation
• Updated • 33
marksaroufim/Meta-Llama-3-8B-torchao-int8_weight_only
Text Generation
• Updated • 2
Text Generation
• Updated • 2
Image-Text-to-Text
• Updated • 3
jerryzh168/gemma3-4b-it-float8dq
Image-Text-to-Text
• Updated • 7
vymenets/yv-llama-quantized
Text Generation
• Updated • 3