Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

RedHatAI
/

quantization

Model card Files Files and versions

1.09 GB

2 contributors

History: 31 commits

danieldk's picture

danieldk HF Staff

Build

32721c3 6 months ago

build
Build 6 months ago
compressed_tensors
Sync with vLLM 9 months ago
core
Sync with vLLM 9 months ago
cutlass_extensions
Sync with vLLM 9 months ago
cutlass_w8a8
Sync with vLLM 9 months ago
fp8
Sync with vLLM 9 months ago
gptq_marlin
Sync with vLLM 9 months ago
marlin
Add full Marlin support and tests for Marlin/CUTLASS 10 months ago
tests
Add full Marlin support and tests for Marlin/CUTLASS 10 months ago
torch-ext
Add support for ROCm 6 months ago
.gitattributes

1.56 kB

Build 10 months ago
LICENSE

11.4 kB

Add cutlass_w8a8 10 months ago
README.md

195 Bytes

Update README.md (#1) 7 months ago
build.toml

3.14 kB

Add support for ROCm 6 months ago
dispatch_utils.h

1.49 kB

Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` 10 months ago
flake.lock

3.03 kB

Add support for ROCm 6 months ago
flake.nix

335 Bytes

Add support for ROCm 6 months ago
vectorization.cuh

778 Bytes

Sync with vLLM 9 months ago