Commits · natasa365/whisper.cpp

CUDA/HIP: optimize mmv paths taken for HIP devices (llama/14324)

1a9d2d3

uvos

JohannesGaessler commited on Jun 23, 2025

CUDA: mul_mat_v support for batch sizes > 1 (llama/14262)

2d1e6e7

JohannesGaessler commited on Jun 23, 2025

HIP: enable vec fattn on RDNA4 (llama/14323)

b6dc6a1

uvos commited on Jun 22, 2025

CUDA: add mean operation (llama/14313)

7cee55b

am17an commited on Jun 22, 2025

Add support for VK_EXT_debug_utils to add labels to Vulkan objects. (llama/13792)

2c3741a

Markus Tavenrath commited on Jun 21, 2025

metal : fix thread-safety (llama/14300)

2bd85b6

ggerganov commited on Jun 21, 2025

ggml-cpu : "align corners" for bilinear upscale/downscale (ggml/1285)

88e7829

Acly commited on Jul 1, 2025

ggml-quants : rename best_mad to best_error (ggml/1283)

cd9270f

danbev commited on Jun 24, 2025

ci : use selective copy for musa image (#3296)

e11fac2
unverified

danbev commited on Jun 27, 2025

ci: set fail-fast to false in docker.yml (#3294)

fc20738
unverified

danbev commited on Jun 27, 2025

ruby : add Whisper::VERSION (#3292)

6468892
unverified

KitaitiMakoto commited on Jun 27, 2025

whisper : add version function (#3289)

0b952d7
unverified

danbev commited on Jun 26, 2025

ci : add should_release variable (#3288)

7272681
unverified

danbev commited on Jun 26, 2025

docs : add cmake "-j" flag in README.md (#3284)

5fe3033
unverified

toboil-features commited on Jun 26, 2025

ci : add support for tag-based releases (#3287)

f21cf37
unverified

danbev commited on Jun 25, 2025

release : v1.7.6

5cade6e
unverified

ggerganov commited on Jun 25, 2025

bench : update benches

d4f72cd
unverified

ggerganov commited on Jun 25, 2025

bench : print system info before ctx check

835c3e8
unverified

ggerganov commited on Jun 25, 2025

stream : add nullptr check of whisper_context (#3283)

9f0c009
unverified

danbev commited on Jun 25, 2025

ci : enable main-cuda build (#3282)

9bee7f3
unverified

danbev commited on Jun 25, 2025

bindings.java : update java example (#3281)

f001158
unverified

Joas Dev commited on Jun 25, 2025

coreml : backport CoreML features to macos < 14 (#3255)

dc0917f
unverified

glaszig commited on Jun 24, 2025

ci : reduce musa image size (#3277)

a45c78b
unverified

danbev commited on Jun 24, 2025

whisper : add .gitignore entries for OpenVINO support (#3276)

ca0545e
unverified

Yukimasa Funaoka commited on Jun 24, 2025

command: output commands to text file (#3273)

a482bd7
unverified

Aaron Ang commited on Jun 24, 2025

ci : add apt-get clean to musa Dockerfile (#3275)

32a61ec
unverified

danbev commited on Jun 23, 2025

ruby : specify Apple frameworks explicitly on build (#3270)

728defc
unverified

KitaitiMakoto commited on Jun 23, 2025

talk-llama : sync llama.cpp

ade9bc3

ggerganov commited on Jun 20, 2025

sync : ggml

48a7292

ggerganov commited on Jun 20, 2025

CUDA: add conv_2d_transpose (llama/14287)

a728b83

am17an commited on Jun 20, 2025

sycl: add usage of enqueue_functions extension (llama/14244)

2e59a96

Nicolò Scipione commited on Jun 20, 2025

Implement GGML_CPU_ALL_VARIANTS for PowerPC (llama/14286)

0bcd751

Christian Kastner Diego Devesa commited on Jun 20, 2025

cuda : synchronize graph capture and cublas handle destruction (llama/14288)

39c4fa5

Diego Devesa commited on Jun 20, 2025

ggml : fix repack work size for mul_mat_id (llama/14292)

4b0d2de

ggerganov commited on Jun 20, 2025

ggml: Update KleidiAI to v1.9.0 (llama/14277)

90ccf35

Charles Xu commited on Jun 20, 2025

CUDA: add conv_2d_dw (llama/14265)

5cca3ec

am17an commited on Jun 20, 2025

ggml-cpu : remove unnecesary arm feature detection (llama/14281)

62cf694

Diego Devesa commited on Jun 19, 2025

build : suppress gcc15 compile warnings (llama/14261)

0454008

fanyang commited on Jun 19, 2025

sycl: Cleanup codepaths in Get Rows in sycl backend (llama/14215)

feee739

Anton Mitkov commited on Jun 19, 2025

llamafile : support s390x SIMD instruction set (llama/14273)

26bafb6

taronaeo commited on Jun 19, 2025

Vulkan: Set device max size for host memory to avoid OOM warning and fallback to CPU buffer (llama/14249)

08debcd

OccamRazor commited on Jun 19, 2025

metal : add mean kernel (llama/14267)

a726ecc

ggerganov commited on Jun 19, 2025

ggml-cpu: reduce asm calls for hsum (llama/14037)

17c0dfa

taronaeo commited on Jun 18, 2025

ggml-cpu: fix uncaught underscore terminators (llama/14023)

c005248

taronaeo commited on Jun 18, 2025

ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (llama/14258)

9d1d21b

Charles Xu commited on Jun 18, 2025

Add `ggml_roll` (ggml/1274)

71923e5

Acly commited on Jun 18, 2025

android : update CMakeLists.txt to use FetchContent for ggml (#3268)

e5d47d0
unverified

danbev commited on Jun 19, 2025

cmake : fix android build (#3265)

e70bf99
unverified

ggerganov

danbev commited on Jun 19, 2025

examples : add stereo to mono conversion in read_audio_data (#3266)

5451562
unverified

danbev commited on Jun 18, 2025

talk-llama : sync llama.cpp

fc04dc0

ggerganov commited on Jun 18, 2025

Commit History

CUDA/HIP: optimize mmv paths taken for HIP devices (llama/14324) 1a9d2d3

CUDA: mul_mat_v support for batch sizes > 1 (llama/14262) 2d1e6e7

HIP: enable vec fattn on RDNA4 (llama/14323) b6dc6a1

CUDA: add mean operation (llama/14313) 7cee55b

Add support for VK_EXT_debug_utils to add labels to Vulkan objects. (llama/13792) 2c3741a

metal : fix thread-safety (llama/14300) 2bd85b6

ggml-cpu : "align corners" for bilinear upscale/downscale (ggml/1285) 88e7829

ggml-quants : rename best_mad to best_error (ggml/1283) cd9270f

ci : use selective copy for musa image (#3296) e11fac2 unverified

ci: set fail-fast to false in docker.yml (#3294) fc20738 unverified

ruby : add Whisper::VERSION (#3292) 6468892 unverified

whisper : add version function (#3289) 0b952d7 unverified

ci : add should_release variable (#3288) 7272681 unverified

docs : add cmake "-j" flag in README.md (#3284) 5fe3033 unverified

ci : add support for tag-based releases (#3287) f21cf37 unverified

release : v1.7.6 5cade6e unverified

bench : update benches d4f72cd unverified

bench : print system info before ctx check 835c3e8 unverified

stream : add nullptr check of whisper_context (#3283) 9f0c009 unverified

ci : enable main-cuda build (#3282) 9bee7f3 unverified

bindings.java : update java example (#3281) f001158 unverified

coreml : backport CoreML features to macos < 14 (#3255) dc0917f unverified

ci : reduce musa image size (#3277) a45c78b unverified

whisper : add .gitignore entries for OpenVINO support (#3276) ca0545e unverified

command: output commands to text file (#3273) a482bd7 unverified

ci : add apt-get clean to musa Dockerfile (#3275) 32a61ec unverified

ruby : specify Apple frameworks explicitly on build (#3270) 728defc unverified

talk-llama : sync llama.cpp ade9bc3

sync : ggml 48a7292

CUDA: add conv_2d_transpose (llama/14287) a728b83

sycl: add usage of enqueue_functions extension (llama/14244) 2e59a96

Implement GGML_CPU_ALL_VARIANTS for PowerPC (llama/14286) 0bcd751

cuda : synchronize graph capture and cublas handle destruction (llama/14288) 39c4fa5

ggml : fix repack work size for mul_mat_id (llama/14292) 4b0d2de

ggml: Update KleidiAI to v1.9.0 (llama/14277) 90ccf35

CUDA: add conv_2d_dw (llama/14265) 5cca3ec

ggml-cpu : remove unnecesary arm feature detection (llama/14281) 62cf694

build : suppress gcc15 compile warnings (llama/14261) 0454008

sycl: Cleanup codepaths in Get Rows in sycl backend (llama/14215) feee739

llamafile : support s390x SIMD instruction set (llama/14273) 26bafb6

Vulkan: Set device max size for host memory to avoid OOM warning and fallback to CPU buffer (llama/14249) 08debcd

metal : add mean kernel (llama/14267) a726ecc

ggml-cpu: reduce asm calls for hsum (llama/14037) 17c0dfa

ggml-cpu: fix uncaught underscore terminators (llama/14023) c005248

ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (llama/14258) 9d1d21b

Add `ggml_roll` (ggml/1274) 71923e5

android : update CMakeLists.txt to use FetchContent for ggml (#3268) e5d47d0 unverified

cmake : fix android build (#3265) e70bf99 unverified

examples : add stereo to mono conversion in read_audio_data (#3266) 5451562 unverified

talk-llama : sync llama.cpp fc04dc0

CUDA/HIP: optimize mmv paths taken for HIP devices (llama/14324)

1a9d2d3

CUDA: mul_mat_v support for batch sizes > 1 (llama/14262)

2d1e6e7

HIP: enable vec fattn on RDNA4 (llama/14323)

b6dc6a1

CUDA: add mean operation (llama/14313)

7cee55b

Add support for VK_EXT_debug_utils to add labels to Vulkan objects. (llama/13792)

2c3741a

metal : fix thread-safety (llama/14300)

2bd85b6

ggml-cpu : "align corners" for bilinear upscale/downscale (ggml/1285)

88e7829

ggml-quants : rename best_mad to best_error (ggml/1283)

cd9270f

ci : use selective copy for musa image (#3296)

e11fac2
unverified

ci: set fail-fast to false in docker.yml (#3294)

fc20738
unverified

ruby : add Whisper::VERSION (#3292)

6468892
unverified

whisper : add version function (#3289)

0b952d7
unverified

ci : add should_release variable (#3288)

7272681
unverified

docs : add cmake "-j" flag in README.md (#3284)

5fe3033
unverified

ci : add support for tag-based releases (#3287)

f21cf37
unverified

release : v1.7.6

5cade6e
unverified

bench : update benches

d4f72cd
unverified

bench : print system info before ctx check

835c3e8
unverified

stream : add nullptr check of whisper_context (#3283)

9f0c009
unverified

ci : enable main-cuda build (#3282)

9bee7f3
unverified

bindings.java : update java example (#3281)

f001158
unverified

coreml : backport CoreML features to macos < 14 (#3255)

dc0917f
unverified

ci : reduce musa image size (#3277)

a45c78b
unverified

whisper : add .gitignore entries for OpenVINO support (#3276)

ca0545e
unverified

command: output commands to text file (#3273)

a482bd7
unverified

ci : add apt-get clean to musa Dockerfile (#3275)

32a61ec
unverified

ruby : specify Apple frameworks explicitly on build (#3270)

728defc
unverified

talk-llama : sync llama.cpp

ade9bc3

sync : ggml

48a7292

CUDA: add conv_2d_transpose (llama/14287)

a728b83

sycl: add usage of enqueue_functions extension (llama/14244)

2e59a96

Implement GGML_CPU_ALL_VARIANTS for PowerPC (llama/14286)

0bcd751

cuda : synchronize graph capture and cublas handle destruction (llama/14288)

39c4fa5

ggml : fix repack work size for mul_mat_id (llama/14292)

4b0d2de

ggml: Update KleidiAI to v1.9.0 (llama/14277)

90ccf35

CUDA: add conv_2d_dw (llama/14265)

5cca3ec

ggml-cpu : remove unnecesary arm feature detection (llama/14281)

62cf694

build : suppress gcc15 compile warnings (llama/14261)

0454008

sycl: Cleanup codepaths in Get Rows in sycl backend (llama/14215)

feee739

llamafile : support s390x SIMD instruction set (llama/14273)

26bafb6

Vulkan: Set device max size for host memory to avoid OOM warning and fallback to CPU buffer (llama/14249)

08debcd

metal : add mean kernel (llama/14267)

a726ecc

ggml-cpu: reduce asm calls for hsum (llama/14037)

17c0dfa

ggml-cpu: fix uncaught underscore terminators (llama/14023)

c005248

ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (llama/14258)

9d1d21b

Add `ggml_roll` (ggml/1274)

71923e5

android : update CMakeLists.txt to use FetchContent for ggml (#3268)

e5d47d0
unverified

cmake : fix android build (#3265)

e70bf99
unverified

examples : add stereo to mono conversion in read_audio_data (#3266)

5451562
unverified

talk-llama : sync llama.cpp

fc04dc0