Commits · natasa365/whisper.cpp

ggml : group all experts in a single ggml_mul_mat_id (llama/6505)

f0b5c67

slaren

ggerganov commited on Apr 18, 2024

fix mul_mat_id() for new input, make the ut pass (llama/6682)

6d1ba81

Neo Zhang Jianyu commited on Apr 15, 2024

fix memcpy() crash, add missed cmd in guide, fix softmax (llama/6622)

6901743

Neo Zhang Jianyu commited on Apr 14, 2024

remove row=1 cond (llama/6532)

8499e3f
unverified

Abhilash Majumder commited on Apr 8, 2024

support/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M (llama/6521)

873102e
unverified

Neo Zhang Jianyu commited on Apr 7, 2024

Fixed minor bug when enabling FP16 for non intel targets (llama/6464)

f84edd5
unverified

Ouadie EL FAROUKI AidanBeltonS commited on Apr 5, 2024

Disable iqx on windows as WA (llama/6435)

7a97623
unverified

hengyu commited on Apr 3, 2024

fix set main gpu crash (llama/6339)

3bdb5e6
unverified

Neo Zhang Jianyu commited on Mar 28, 2024

sync : ggml (#2001)

cbbfa9e
unverified

ggerganov commited on Mar 27, 2024

llama : add pipeline parallelism support (llama/6017)

b5bb3f3
unverified

slaren

compilade

ggerganov commited on Mar 13, 2024

Update get version (llama/6025)

9a4e508
unverified

AidanBeltonS commited on Mar 13, 2024

ggml : reuse quantum structs across backends (llama/5943)

bb0625f
unverified

ggerganov commited on Mar 12, 2024

sycl : update IQ1_S kernels (WIP - not working!) (llama/5995)

16dc72c
unverified

ggerganov commited on Mar 12, 2024

Add q3_s and q1_s (llama/5886)

2957823
unverified

Abhilash Majumder commited on Mar 11, 2024

ggml : add ggml-common.h to deduplicate shared code (llama/5940)

0a37735
unverified

ggerganov commited on Mar 9, 2024

Revert "[SYCL] fix error when set main gpu to non-zero (llama/5901)" (llama/5918)

d7e8525
unverified

Neo Zhang Jianyu commited on Mar 7, 2024

fix error when set main gpu to non-zero (llama/5901)

829c347
unverified

Neo Zhang Jianyu commited on Mar 7, 2024

add wait() to make code stable (llama/5895)

41c3c12
unverified

Neo Zhang Jianyu commited on Mar 6, 2024

fix mul_mat fault in CI/unit-test (llama/5862)

91bb65e
unverified

Neo Zhang Jianyu

jinliangtao compilade

Cebtenzzre Xuan Son Nguyen

ggerganov Kawrakow

ikawrakow

Cebtenzzre Michael Podvitskiy

phymbert github-actions[bot] Nindaleth Black_Fox

iamlemec slaren

dranger003

leejet Minsoo Cheong Dane Madsen hutli

emozilla commited on Mar 5, 2024

ggml : introduce ggml_status (ggml/750)

151c676
unverified

Michael Podvitskiy slaren

ggerganov commited on Mar 4, 2024

Support multiple GPUs (split mode) on SYCL backend (llama/5806)

b1865d2
unverified

Neo Zhang Jianyu commited on Mar 2, 2024

Use batched mul_mat pathway (llama/5591)

4a30367
unverified

AidanBeltonS Abhilash Majumder commited on Mar 1, 2024

Add support for soft_max ALiBi (llama/5639)

86d6a5e
unverified

AidanBeltonS Abhilash Majumder commited on Feb 26, 2024

code : normalize enum names (llama/5697)

93e0830
unverified

ggerganov commited on Feb 25, 2024

Introduce backend GUIDs (ggml/743)

a7eb9f6
unverified

UEXTM.com slaren commited on Feb 24, 2024

conext add name (llama/5624)

3c39d4b
unverified

hengyu commited on Feb 21, 2024

Update ggml_sycl_op_mul_mat_vec_q (llama/5502)

963ffd5
unverified

AidanBeltonS Abhilash Majumder commited on Feb 20, 2024

ggml-sycl: Replace 3d ops with macro (llama/5458)

12970f1
unverified

Abhilash Majumder commited on Feb 12, 2024

src : relocate new backend sources

44cd2d4
unverified

ggerganov commited on Feb 10, 2024

Spaces:

natasa365
/

whisper.cpp

Running

Commit History

ggml : group all experts in a single ggml_mul_mat_id (llama/6505)

f0b5c67

fix mul_mat_id() for new input, make the ut pass (llama/6682)

6d1ba81

fix memcpy() crash, add missed cmd in guide, fix softmax (llama/6622)

6901743

remove row=1 cond (llama/6532)

8499e3f
unverified

support/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M (llama/6521)

873102e
unverified

Fixed minor bug when enabling FP16 for non intel targets (llama/6464)

f84edd5
unverified

Disable iqx on windows as WA (llama/6435)

7a97623
unverified

fix set main gpu crash (llama/6339)

3bdb5e6
unverified

sync : ggml (#2001)

cbbfa9e
unverified

llama : add pipeline parallelism support (llama/6017)

b5bb3f3
unverified

Update get version (llama/6025)

9a4e508
unverified

ggml : reuse quantum structs across backends (llama/5943)

bb0625f
unverified

sycl : update IQ1_S kernels (WIP - not working!) (llama/5995)

16dc72c
unverified

Add q3_s and q1_s (llama/5886)

2957823
unverified

ggml : add ggml-common.h to deduplicate shared code (llama/5940)

0a37735
unverified

Revert "[SYCL] fix error when set main gpu to non-zero (llama/5901)" (llama/5918)

d7e8525
unverified

fix error when set main gpu to non-zero (llama/5901)

829c347
unverified

add wait() to make code stable (llama/5895)

41c3c12
unverified

fix mul_mat fault in CI/unit-test (llama/5862)

91bb65e
unverified

ggml : introduce ggml_status (ggml/750)

151c676
unverified

Support multiple GPUs (split mode) on SYCL backend (llama/5806)

b1865d2
unverified

Use batched mul_mat pathway (llama/5591)

4a30367
unverified

Add support for soft_max ALiBi (llama/5639)

86d6a5e
unverified

code : normalize enum names (llama/5697)

93e0830
unverified

Introduce backend GUIDs (ggml/743)

a7eb9f6
unverified

conext add name (llama/5624)

3c39d4b
unverified

Update ggml_sycl_op_mul_mat_vec_q (llama/5502)

963ffd5
unverified

ggml-sycl: Replace 3d ops with macro (llama/5458)

12970f1
unverified

src : relocate new backend sources

44cd2d4
unverified

Commit History

ggml : group all experts in a single ggml_mul_mat_id (llama/6505) f0b5c67

fix mul_mat_id() for new input, make the ut pass (llama/6682) 6d1ba81

fix memcpy() crash, add missed cmd in guide, fix softmax (llama/6622) 6901743

remove row=1 cond (llama/6532) 8499e3f unverified

support/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M (llama/6521) 873102e unverified

Fixed minor bug when enabling FP16 for non intel targets (llama/6464) f84edd5 unverified

Disable iqx on windows as WA (llama/6435) 7a97623 unverified

fix set main gpu crash (llama/6339) 3bdb5e6 unverified

sync : ggml (#2001) cbbfa9e unverified

llama : add pipeline parallelism support (llama/6017) b5bb3f3 unverified

Update get version (llama/6025) 9a4e508 unverified

ggml : reuse quantum structs across backends (llama/5943) bb0625f unverified

sycl : update IQ1_S kernels (WIP - not working!) (llama/5995) 16dc72c unverified

Add q3_s and q1_s (llama/5886) 2957823 unverified

ggml : add ggml-common.h to deduplicate shared code (llama/5940) 0a37735 unverified

Revert "[SYCL] fix error when set main gpu to non-zero (llama/5901)" (llama/5918) d7e8525 unverified

fix error when set main gpu to non-zero (llama/5901) 829c347 unverified

add wait() to make code stable (llama/5895) 41c3c12 unverified

fix mul_mat fault in CI/unit-test (llama/5862) 91bb65e unverified

ggml : introduce ggml_status (ggml/750) 151c676 unverified

Support multiple GPUs (split mode) on SYCL backend (llama/5806) b1865d2 unverified

Use batched mul_mat pathway (llama/5591) 4a30367 unverified

Add support for soft_max ALiBi (llama/5639) 86d6a5e unverified

code : normalize enum names (llama/5697) 93e0830 unverified

Introduce backend GUIDs (ggml/743) a7eb9f6 unverified

conext add name (llama/5624) 3c39d4b unverified

Update ggml_sycl_op_mul_mat_vec_q (llama/5502) 963ffd5 unverified

ggml-sycl: Replace 3d ops with macro (llama/5458) 12970f1 unverified

src : relocate new backend sources 44cd2d4 unverified

ggml : group all experts in a single ggml_mul_mat_id (llama/6505)

f0b5c67

fix mul_mat_id() for new input, make the ut pass (llama/6682)

6d1ba81

fix memcpy() crash, add missed cmd in guide, fix softmax (llama/6622)

6901743

remove row=1 cond (llama/6532)

8499e3f
unverified

support/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M (llama/6521)

873102e
unverified

Fixed minor bug when enabling FP16 for non intel targets (llama/6464)

f84edd5
unverified

Disable iqx on windows as WA (llama/6435)

7a97623
unverified

fix set main gpu crash (llama/6339)

3bdb5e6
unverified

sync : ggml (#2001)

cbbfa9e
unverified

llama : add pipeline parallelism support (llama/6017)

b5bb3f3
unverified

Update get version (llama/6025)

9a4e508
unverified

ggml : reuse quantum structs across backends (llama/5943)

bb0625f
unverified

sycl : update IQ1_S kernels (WIP - not working!) (llama/5995)

16dc72c
unverified

Add q3_s and q1_s (llama/5886)

2957823
unverified

ggml : add ggml-common.h to deduplicate shared code (llama/5940)

0a37735
unverified

Revert "[SYCL] fix error when set main gpu to non-zero (llama/5901)" (llama/5918)

d7e8525
unverified

fix error when set main gpu to non-zero (llama/5901)

829c347
unverified

add wait() to make code stable (llama/5895)

41c3c12
unverified

fix mul_mat fault in CI/unit-test (llama/5862)

91bb65e
unverified

ggml : introduce ggml_status (ggml/750)

151c676
unverified

Support multiple GPUs (split mode) on SYCL backend (llama/5806)

b1865d2
unverified

Use batched mul_mat pathway (llama/5591)

4a30367
unverified

Add support for soft_max ALiBi (llama/5639)

86d6a5e
unverified

code : normalize enum names (llama/5697)

93e0830
unverified

Introduce backend GUIDs (ggml/743)

a7eb9f6
unverified

conext add name (llama/5624)

3c39d4b
unverified

Update ggml_sycl_op_mul_mat_vec_q (llama/5502)

963ffd5
unverified

ggml-sycl: Replace 3d ops with macro (llama/5458)

12970f1
unverified

src : relocate new backend sources

44cd2d4
unverified