Commits · natasa365/whisper.cpp

ruby : follow audio library change (#2851)

b94e7d3
unverified

KitaitiMakoto commited on Feb 28, 2025

whisper : support GGML_BACKEND_DL (#2843)

2e6437e
unverified

Diego Devesa

ggerganov commited on Feb 27, 2025

common : separate whisper sources (#2846)

0447b9d
unverified

ggerganov commited on Feb 27, 2025

common : fix build min/max (#2845)

07533a2
unverified

ggerganov commited on Feb 27, 2025

examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759)

7a280a4
unverified

Dmitry Atamanov commited on Feb 27, 2025

stream : stop on ^C when no audio is received (#2822)

45399ad
unverified

petterreinholdtsen Petter Reinholdtsen commited on Feb 27, 2025

sync : ggml

7926873

ggerganov commited on Feb 26, 2025

Support pure float16 add/sub/mul/div operations in the CUDA (and CPU) backend (ggml/1121)

2b94a24

cmdr2 commited on Feb 25, 2025

metal : copy kernels for quant to F32/F16 conversions (llama/12017)

6c8e7ec

Garf

ggerganov commited on Feb 25, 2025

opencl: fix for small models (llama/11950)

4532dc6

lhez Shawn Gu Skyler Szot commited on Feb 24, 2025

Optimize mul_mat for Q4_0 on Intel GPU (llama/12035)

14fd317

Neo Zhang Jianyu arthw commited on Feb 24, 2025

SYCL: Fix GGML_SYCL_DEBUG macro (llama/11995)

310a36c

qnixsynapse commited on Feb 24, 2025

ggml-cpu: Support s390x SIMD Instruction Set (llama/12019)

4aa54ec

Aaron Teo Jinyang He junchao-zhao commited on Feb 22, 2025

CUDA: app option to compile without FlashAttention (llama/12025)

fbc5f16

JohannesGaessler commited on Feb 22, 2025

CUDA: optimize FA for GQA + large batches (llama/12014)

6662d54

JohannesGaessler commited on Feb 22, 2025

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (llama/12000)

6cb8158

Garf commited on Feb 22, 2025

CUDA: correct the lowest Maxwell supported by CUDA 12 (llama/11984)

6641178

PureJourney

JohannesGaessler commited on Feb 21, 2025

MUSA: support ARM64 and enable dp4a .etc (llama/11843)

ab96dac

Bodhi Bodhi Hu commited on Feb 21, 2025

ggml-cpu: Add CPU backend support for KleidiAI library (llama/11390)

9de6d81

Charles Xu commited on Feb 20, 2025

ggml: aarch64: implement SVE kernels for q3_K_q8_K vector dot (llama/11917)

1a1acd2

Prashant Vithule vithulep

ggerganov commited on Feb 20, 2025

CUDA: use async data loading for FlashAttention (llama/11894)

5b9980d

JohannesGaessler Diego Devesa commited on Feb 17, 2025

vulkan: implement several ops relevant for ggml_opt (llama/11769)

3c2171d

Rémy O commited on Feb 17, 2025

vulkan: support multi/vision rope, and noncontiguous rope (llama/11902)

1c7a669

jeffbolznv commited on Feb 16, 2025

metal : fix the crash caused by the lack of residency set support on Intel Macs. (llama/11904)

afbd891

Hale Chan commited on Feb 16, 2025

metal : optimize dequant q6_K kernel (llama/11892)

376cbe6

Adrian Kretz commited on Feb 15, 2025

repo : update links to new url (llama/11886)

9705bb5

ggerganov commited on Feb 15, 2025

vulkan: initial support for IQ1_S and IQ1_M quantizations (llama/11528)

0d2e888

Rémy O commited on Feb 15, 2025

opencl: Fix rope and softmax (llama/11833)

bf3b6f8

lhez commited on Feb 14, 2025

cuda : add ampere to the list of default architectures (llama/11870)

1d19dec

Diego Devesa commited on Feb 14, 2025

ggml: optimize some vec dot functions for LoongArch ASX (llama/11842)

e3acbfc

Jinyang He commited on Feb 14, 2025

vulkan: linux builds + small subgroup size fixes (llama/11767)

e3f0e78

Eve commited on Feb 14, 2025

llamafile: use member variable instead of constant for iq4nlt (llama/11780)

0cb2d04

jmorganca commited on Feb 13, 2025

musa: bump MUSA SDK version to rc3.1.1 (llama/11822)

ff2d3eb

R0CKSTAR commited on Feb 13, 2025

ggml-cpu : add chunking support to mul_mat_id (llama/11666)

e59d9a7

Diego Devesa commited on Feb 13, 2025

ggml : x2 speed for WASM by optimizing SIMD (llama/11453)

464a186

Xuan-Son Nguyen camel-cdr commited on Feb 12, 2025

HIP: Remove GCN from list of devices that avoid MMQ (llama/11831)

78aed55

uvos commited on Feb 12, 2025

HIP: Switch to std::vector in rocblas version check (llama/11820)

e144c94

uvos commited on Feb 12, 2025

cleanup: fix compile warnings associated with gnu_printf (llama/11811)

ef6a968

bandoti commited on Feb 12, 2025

ggml : fix multi-threaded clamp_f32 (llama/11824)

1b1d6a8

Richard commited on Feb 12, 2025

ggml-cpu: Fix duplicate MATMUL_INT8 (llama/11817)

05b9e78

ownia commited on Feb 12, 2025

CUDA: fix CUDART_VERSION checks (llama/11821)

04f123a

JohannesGaessler commited on Feb 12, 2025

Fix #11802: Compile bug - RegQueryValueExA changed to RegQueryValueEx (llama/11803)

86969ac

Sheldon Robinson commited on Feb 11, 2025

CUDA: use arch list for compatibility check (llama/11775)

b88e163

JohannesGaessler Diego Devesa commited on Feb 10, 2025

fix: typos in documentation files (llama/11791)

5c6d350

Maxim Evtush commited on Feb 10, 2025

vulkan: Make Vulkan optional at runtime (ggml/11493). (llama/11494)

762f497

Danny Milosavljevic

jeffbolznv commited on Feb 10, 2025

vulkan: add environment variable GGML_VK_PREFER_HOST_MEMORY to avoid VRAM allocation (llama/11592)

f9fd130

Wagner Bruna commited on Feb 10, 2025

vulkan: account for lookup tables when checking shared memory size (llama/11502)

758970f

jeffbolznv commited on Feb 9, 2025

ggml: Fix data race in ggml threadpool (llama/11736)

5554d5f

Karol Kontny commited on Feb 8, 2025

CUDA: fix min. version for movmatrix (llama/11751)

9ac5316

JohannesGaessler commited on Feb 8, 2025

vulkan: print shared memory size (llama/11719)

fb33a94

jeffbolznv commited on Feb 7, 2025

Commit History

ruby : follow audio library change (#2851) b94e7d3 unverified

whisper : support GGML_BACKEND_DL (#2843) 2e6437e unverified

common : separate whisper sources (#2846) 0447b9d unverified

common : fix build min/max (#2845) 07533a2 unverified

examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759) 7a280a4 unverified

stream : stop on ^C when no audio is received (#2822) 45399ad unverified

sync : ggml 7926873

Support pure float16 add/sub/mul/div operations in the CUDA (and CPU) backend (ggml/1121) 2b94a24

metal : copy kernels for quant to F32/F16 conversions (llama/12017) 6c8e7ec

opencl: fix for small models (llama/11950) 4532dc6

Optimize mul_mat for Q4_0 on Intel GPU (llama/12035) 14fd317

SYCL: Fix GGML_SYCL_DEBUG macro (llama/11995) 310a36c

ggml-cpu: Support s390x SIMD Instruction Set (llama/12019) 4aa54ec

CUDA: app option to compile without FlashAttention (llama/12025) fbc5f16

CUDA: optimize FA for GQA + large batches (llama/12014) 6662d54

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (llama/12000) 6cb8158

CUDA: correct the lowest Maxwell supported by CUDA 12 (llama/11984) 6641178

MUSA: support ARM64 and enable dp4a .etc (llama/11843) ab96dac

ggml-cpu: Add CPU backend support for KleidiAI library (llama/11390) 9de6d81

ggml: aarch64: implement SVE kernels for q3_K_q8_K vector dot (llama/11917) 1a1acd2

CUDA: use async data loading for FlashAttention (llama/11894) 5b9980d

vulkan: implement several ops relevant for ggml_opt (llama/11769) 3c2171d

vulkan: support multi/vision rope, and noncontiguous rope (llama/11902) 1c7a669

metal : fix the crash caused by the lack of residency set support on Intel Macs. (llama/11904) afbd891

metal : optimize dequant q6_K kernel (llama/11892) 376cbe6

repo : update links to new url (llama/11886) 9705bb5

vulkan: initial support for IQ1_S and IQ1_M quantizations (llama/11528) 0d2e888

opencl: Fix rope and softmax (llama/11833) bf3b6f8

cuda : add ampere to the list of default architectures (llama/11870) 1d19dec

ggml: optimize some vec dot functions for LoongArch ASX (llama/11842) e3acbfc

vulkan: linux builds + small subgroup size fixes (llama/11767) e3f0e78

llamafile: use member variable instead of constant for iq4nlt (llama/11780) 0cb2d04

musa: bump MUSA SDK version to rc3.1.1 (llama/11822) ff2d3eb

ggml-cpu : add chunking support to mul_mat_id (llama/11666) e59d9a7

ggml : x2 speed for WASM by optimizing SIMD (llama/11453) 464a186

HIP: Remove GCN from list of devices that avoid MMQ (llama/11831) 78aed55

HIP: Switch to std::vector in rocblas version check (llama/11820) e144c94

cleanup: fix compile warnings associated with gnu_printf (llama/11811) ef6a968

ggml : fix multi-threaded clamp_f32 (llama/11824) 1b1d6a8

ggml-cpu: Fix duplicate MATMUL_INT8 (llama/11817) 05b9e78

CUDA: fix CUDART_VERSION checks (llama/11821) 04f123a

Fix #11802: Compile bug - RegQueryValueExA changed to RegQueryValueEx (llama/11803) 86969ac

CUDA: use arch list for compatibility check (llama/11775) b88e163

fix: typos in documentation files (llama/11791) 5c6d350

vulkan: Make Vulkan optional at runtime (ggml/11493). (llama/11494) 762f497

vulkan: add environment variable GGML_VK_PREFER_HOST_MEMORY to avoid VRAM allocation (llama/11592) f9fd130

vulkan: account for lookup tables when checking shared memory size (llama/11502) 758970f

ggml: Fix data race in ggml threadpool (llama/11736) 5554d5f

CUDA: fix min. version for movmatrix (llama/11751) 9ac5316

vulkan: print shared memory size (llama/11719) fb33a94

ruby : follow audio library change (#2851)

b94e7d3
unverified

whisper : support GGML_BACKEND_DL (#2843)

2e6437e
unverified

common : separate whisper sources (#2846)

0447b9d
unverified

common : fix build min/max (#2845)

07533a2
unverified

examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759)

7a280a4
unverified

stream : stop on ^C when no audio is received (#2822)

45399ad
unverified

sync : ggml

7926873

Support pure float16 add/sub/mul/div operations in the CUDA (and CPU) backend (ggml/1121)

2b94a24

metal : copy kernels for quant to F32/F16 conversions (llama/12017)

6c8e7ec

opencl: fix for small models (llama/11950)

4532dc6

Optimize mul_mat for Q4_0 on Intel GPU (llama/12035)

14fd317

SYCL: Fix GGML_SYCL_DEBUG macro (llama/11995)

310a36c

ggml-cpu: Support s390x SIMD Instruction Set (llama/12019)

4aa54ec

CUDA: app option to compile without FlashAttention (llama/12025)

fbc5f16

CUDA: optimize FA for GQA + large batches (llama/12014)

6662d54

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (llama/12000)

6cb8158

CUDA: correct the lowest Maxwell supported by CUDA 12 (llama/11984)

6641178

MUSA: support ARM64 and enable dp4a .etc (llama/11843)

ab96dac

ggml-cpu: Add CPU backend support for KleidiAI library (llama/11390)

9de6d81

ggml: aarch64: implement SVE kernels for q3_K_q8_K vector dot (llama/11917)

1a1acd2

CUDA: use async data loading for FlashAttention (llama/11894)

5b9980d

vulkan: implement several ops relevant for ggml_opt (llama/11769)

3c2171d

vulkan: support multi/vision rope, and noncontiguous rope (llama/11902)

1c7a669

metal : fix the crash caused by the lack of residency set support on Intel Macs. (llama/11904)

afbd891

metal : optimize dequant q6_K kernel (llama/11892)

376cbe6

repo : update links to new url (llama/11886)

9705bb5

vulkan: initial support for IQ1_S and IQ1_M quantizations (llama/11528)

0d2e888

opencl: Fix rope and softmax (llama/11833)

bf3b6f8

cuda : add ampere to the list of default architectures (llama/11870)

1d19dec

ggml: optimize some vec dot functions for LoongArch ASX (llama/11842)

e3acbfc

vulkan: linux builds + small subgroup size fixes (llama/11767)

e3f0e78

llamafile: use member variable instead of constant for iq4nlt (llama/11780)

0cb2d04

musa: bump MUSA SDK version to rc3.1.1 (llama/11822)

ff2d3eb

ggml-cpu : add chunking support to mul_mat_id (llama/11666)

e59d9a7

ggml : x2 speed for WASM by optimizing SIMD (llama/11453)

464a186

HIP: Remove GCN from list of devices that avoid MMQ (llama/11831)

78aed55

HIP: Switch to std::vector in rocblas version check (llama/11820)

e144c94

cleanup: fix compile warnings associated with gnu_printf (llama/11811)

ef6a968

ggml : fix multi-threaded clamp_f32 (llama/11824)

1b1d6a8

ggml-cpu: Fix duplicate MATMUL_INT8 (llama/11817)

05b9e78

CUDA: fix CUDART_VERSION checks (llama/11821)

04f123a

Fix #11802: Compile bug - RegQueryValueExA changed to RegQueryValueEx (llama/11803)

86969ac

CUDA: use arch list for compatibility check (llama/11775)

b88e163

fix: typos in documentation files (llama/11791)

5c6d350

vulkan: Make Vulkan optional at runtime (ggml/11493). (llama/11494)

762f497

vulkan: add environment variable GGML_VK_PREFER_HOST_MEMORY to avoid VRAM allocation (llama/11592)

f9fd130

vulkan: account for lookup tables when checking shared memory size (llama/11502)

758970f

ggml: Fix data race in ggml threadpool (llama/11736)

5554d5f

CUDA: fix min. version for movmatrix (llama/11751)

9ac5316

vulkan: print shared memory size (llama/11719)

fb33a94