Commit History

vulkan: Add bfloat16 support (llama/12554)
b21f8a1

jeffbolznv commited on

vulkan: workaround for AMD Windows driver 16 bit unpack8 bug (llama/12472)
417a5d6

Eve commited on

vulkan: matmul dequantization improvements (llama/12015)
ffdf466

Eve commited on

vulkan: initial support for IQ1_S and IQ1_M quantizations (llama/11528)
0d2e888

Rémy O commited on

vulkan: initial support for IQ4_XS quantization (llama/11501)
ed46ad5

Rémy O commited on

vulkan: implement initial support for IQ2 and IQ3 quantizations (llama/11360)
bd93c1b

Rémy Oudompheng jeffbolznv commited on

vulkan: small mul_mat_vec optimizations (llama/10665)
ec98109

Eve commited on

vulkan: further optimize mul_mat_vec using larger loads (llama/10387)
50a2978

jeffbolznv commited on

ggml : build backends as libraries (llama/10256)
3dc93f3

Diego Devesa ggerganov R0CKSTAR commited on