whisper.cpp / ggml-sycl.cpp

Commit History

ggml : group all experts in a single ggml_mul_mat_id (llama/6505)
f0b5c67

slaren ggerganov commited on

fix mul_mat_id() for new input, make the ut pass (llama/6682)
6d1ba81

Neo Zhang Jianyu commited on

fix memcpy() crash, add missed cmd in guide, fix softmax (llama/6622)
6901743

Neo Zhang Jianyu commited on

remove row=1 cond (llama/6532)
8499e3f
unverified

Abhilash Majumder commited on

support/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M (llama/6521)
873102e
unverified

Neo Zhang Jianyu commited on

Fixed minor bug when enabling FP16 for non intel targets (llama/6464)
f84edd5
unverified

Ouadie EL FAROUKI AidanBeltonS commited on

Disable iqx on windows as WA (llama/6435)
7a97623
unverified

hengyu commited on

fix set main gpu crash (llama/6339)
3bdb5e6
unverified

Neo Zhang Jianyu commited on

sync : ggml (#2001)
cbbfa9e
unverified

ggerganov commited on

llama : add pipeline parallelism support (llama/6017)
b5bb3f3
unverified

slaren compilade ggerganov commited on

Update get version (llama/6025)
9a4e508
unverified

AidanBeltonS commited on

ggml : reuse quantum structs across backends (llama/5943)
bb0625f
unverified

ggerganov commited on

sycl : update IQ1_S kernels (WIP - not working!) (llama/5995)
16dc72c
unverified

ggerganov commited on

Add q3_s and q1_s (llama/5886)
2957823
unverified

Abhilash Majumder commited on

ggml : add ggml-common.h to deduplicate shared code (llama/5940)
0a37735
unverified

ggerganov commited on

Revert "[SYCL] fix error when set main gpu to non-zero (llama/5901)" (llama/5918)
d7e8525
unverified

Neo Zhang Jianyu commited on

fix error when set main gpu to non-zero (llama/5901)
829c347
unverified

Neo Zhang Jianyu commited on

add wait() to make code stable (llama/5895)
41c3c12
unverified

Neo Zhang Jianyu commited on

ggml : introduce ggml_status (ggml/750)
151c676
unverified

Michael Podvitskiy slaren ggerganov commited on

Support multiple GPUs (split mode) on SYCL backend (llama/5806)
b1865d2
unverified

Neo Zhang Jianyu commited on

Use batched mul_mat pathway (llama/5591)
4a30367
unverified

AidanBeltonS Abhilash Majumder commited on

Add support for soft_max ALiBi (llama/5639)
86d6a5e
unverified

AidanBeltonS Abhilash Majumder commited on

code : normalize enum names (llama/5697)
93e0830
unverified

ggerganov commited on

Introduce backend GUIDs (ggml/743)
a7eb9f6
unverified

UEXTM.com slaren commited on

conext add name (llama/5624)
3c39d4b
unverified

hengyu commited on

Update ggml_sycl_op_mul_mat_vec_q (llama/5502)
963ffd5
unverified

AidanBeltonS Abhilash Majumder commited on

ggml-sycl: Replace 3d ops with macro (llama/5458)
12970f1
unverified

Abhilash Majumder commited on

src : relocate new backend sources
44cd2d4
unverified

ggerganov commited on