Spaces:

natasa365
/

whisper.cpp

Running

App Files Files Community

whisper.cpp / ggml /src

Commit History

fix scratch size of softmax (llama/8642)

6519fd2

KevinLy commited on Jul 23, 2024

ggml: fix compile error for RISC-V (llama/8623)

4eec44b

Mark Zhuang commited on Jul 22, 2024

CUDA: MMQ code deduplication + iquant support (llama/8495)

6d14124

JohannesGaessler commited on Jul 20, 2024

gguf : handle null name during init (llama/8587)

2f95156

ggerganov HF Staff commited on Jul 20, 2024

ggml : fix quant dot product with odd number of blocks (llama/8549)

0083f96

slaren

ggerganov HF Staff commited on Jul 19, 2024

ggml : add friendlier error message to fopen errors (llama/8575)

ab5b4e0

HanClinto commited on Jul 19, 2024

CUDA: fix partial offloading for ne0 % 256 != 0 (llama/8572)

afc137c

JohannesGaessler commited on Jul 18, 2024

Add Ascend NPU backend (llama/6035)

3175a17

hipudding

wangshuai09 commited on Jul 17, 2024

make/cmake: add missing force MMQ/cuBLAS for HIP (llama/8515)

5096c91

JohannesGaessler commited on Jul 16, 2024

Refactor lora adapter support (llama/8332)

76bcfc6

Xuan Son Nguyen slaren

compilade commited on Jul 15, 2024

add concat through dim 1/2 (llama/8483)

acf23d9

hengyu commited on Jul 15, 2024

Vulkan MMQ Fix (llama/8479)

e2989d0

OccamRazor commited on Jul 15, 2024

vulkan : cmake integration (llama/8119)

a094e22

bandoti commited on Jul 13, 2024

metal : template-ify some of the kernels (llama/8447)

3c3094f

ggerganov HF Staff commited on Jul 13, 2024

ggml : minor naming changes (llama/8433)

e0c6dff

ggerganov HF Staff commited on Jul 12, 2024

fix the mul_mat_id ut issues (llama/8427)

374488a

ClarkChin

hengyu Chen Xi commited on Jul 12, 2024

ggml : add NVPL BLAS support (ggml/8329) (llama/8425)

4816a87

ntukanov ntukanov commited on Jul 11, 2024

cuda : suppress 'noreturn' warn in no_device_code (llama/8414)

13c1163

danbev commited on Jul 11, 2024

CUDA: optimize and refactor MMQ (llama/8416)

a3fe534

JohannesGaessler commited on Jul 11, 2024

Use multi_ptr to clean up deprecated warnings (llama/8256)

6dbe297

AidanBeltonS commited on Jul 10, 2024

ggml : move sgemm sources to llamafile subfolder (llama/8394)

1554348

ggerganov HF Staff commited on Jul 10, 2024

ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (llama/5780)

9509586

Dibakar Gope commited on Jul 10, 2024

sycl : Reenabled mmvq path for the SYCL Nvidia Backend (llama/8372)

b969571

Alberto Cabrera Pérez commited on Jul 9, 2024

sycl : fix powf call in device code (llama/8368)

011fbfd

Alberto Cabrera Pérez commited on Jul 8, 2024

ggml : loop tiling optimizations for scalar path (ggml/898)

1c4b0ca

Mahesh Madhav commited on Jul 25, 2024

ggml: add support for float16 input tensors in pooling operations (ggml/895)

8248d8e

Ivan Filipov vanaka11 commited on Jul 22, 2024

vulkan : initialize vk_buffer_struct members to VK_NULL_HANDLE (ggml/893)

8c409e3

Tony Wasserka Tony Wasserka commited on Jul 20, 2024

whisper : use vulkan as gpu backend when available (#2302)

0755fa0
unverified

Matt Stephenson commited on Jul 16, 2024

ggml : sync sycl (skip) (#0)

bf6ccee

ggerganov HF Staff commited on Jul 8, 2024

ggml : remove unnecessary UNUSED macro call (ggml/880)

ab9a7d0

danbev commited on Jul 8, 2024

cmake : add GGML_BUILD and GGML_SHARED macro definitions (llama/8281)

a8f9bda

KafuuChino commited on Jul 5, 2024

Enabled more data types for oneMKL gemm_batch (llama/8236)

08501f8

Ouadie EL FAROUKI commited on Jul 5, 2024

CUDA: MMQ support for iq4_nl, iq4_xs (llama/8278)

8411e3c

JohannesGaessler commited on Jul 5, 2024

CUDA: revert part of the RDNA1 optimizations (llama/8309)

fcd0c52

Daniele commited on Jul 5, 2024

CUDA: fix MMQ stream-k rounding if ne00 % 128 != 0 (llama/8311)

04d4209

JohannesGaessler commited on Jul 5, 2024

Fix WARP_SIZE=16 bug of Intel GPU (llama/8266)

1ce11e2

KevinLy commited on Jul 5, 2024

rm get_work_group_size() by local cache for performance (llama/8286)

08fd758

Neo Zhang Jianyu arthw commited on Jul 5, 2024

Define and optimize RDNA1 (llama/8085)

6aa5a89

Daniele commited on Jul 3, 2024

fix typo (llama/8267)

0c9c7c8

Judd Judd commited on Jul 3, 2024

Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (llama/8258)

cc49462

HanClinto commited on Jul 2, 2024

cuda : update supports_op for matrix multiplication (llama/8245)

2314334

slaren commited on Jul 2, 2024

Fix win build conflict of math library (llama/8230)

5a33963

KevinLy commited on Jul 2, 2024

Fix the sub group size of Intel (llama/8106)

2dd429e

KevinLy commited on Jul 2, 2024

CUDA: refactor and optimize IQ MMVQ (llama/8215)

afa1447

JohannesGaessler commited on Jul 1, 2024

Update SYCL-Rope op and Refactor (llama/8157)

06acee2

zhentaoyu commited on Jul 1, 2024

CUDA: fix MMQ stream-k for --split-mode row (llama/8167)

ef3d018

JohannesGaessler commited on Jun 27, 2024

feat: cuda implementation for `ggml_conv_transpose_1d` (ggml/854)

025493b

John Balis slaren commited on Jul 2, 2024

ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) (llama/8140)

e83fdad
unverified

slaren commited on Jun 26, 2024

whisper : reorganize source code + improve CMake (#2256)

f75c2e3
unverified

ggerganov HF Staff commited on Jun 26, 2024

Commit History

fix scratch size of softmax (llama/8642) 6519fd2

ggml: fix compile error for RISC-V (llama/8623) 4eec44b

CUDA: MMQ code deduplication + iquant support (llama/8495) 6d14124

gguf : handle null name during init (llama/8587) 2f95156

ggml : fix quant dot product with odd number of blocks (llama/8549) 0083f96

ggml : add friendlier error message to fopen errors (llama/8575) ab5b4e0

CUDA: fix partial offloading for ne0 % 256 != 0 (llama/8572) afc137c

Add Ascend NPU backend (llama/6035) 3175a17

make/cmake: add missing force MMQ/cuBLAS for HIP (llama/8515) 5096c91

Refactor lora adapter support (llama/8332) 76bcfc6

add concat through dim 1/2 (llama/8483) acf23d9

Vulkan MMQ Fix (llama/8479) e2989d0

vulkan : cmake integration (llama/8119) a094e22

metal : template-ify some of the kernels (llama/8447) 3c3094f

ggml : minor naming changes (llama/8433) e0c6dff

fix the mul_mat_id ut issues (llama/8427) 374488a

ggml : add NVPL BLAS support (ggml/8329) (llama/8425) 4816a87

cuda : suppress 'noreturn' warn in no_device_code (llama/8414) 13c1163

CUDA: optimize and refactor MMQ (llama/8416) a3fe534

Use multi_ptr to clean up deprecated warnings (llama/8256) 6dbe297

ggml : move sgemm sources to llamafile subfolder (llama/8394) 1554348

ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (llama/5780) 9509586

sycl : Reenabled mmvq path for the SYCL Nvidia Backend (llama/8372) b969571

sycl : fix powf call in device code (llama/8368) 011fbfd

ggml : loop tiling optimizations for scalar path (ggml/898) 1c4b0ca

ggml: add support for float16 input tensors in pooling operations (ggml/895) 8248d8e

vulkan : initialize vk_buffer_struct members to VK_NULL_HANDLE (ggml/893) 8c409e3

whisper : use vulkan as gpu backend when available (#2302) 0755fa0 unverified

ggml : sync sycl (skip) (#0) bf6ccee

ggml : remove unnecessary UNUSED macro call (ggml/880) ab9a7d0

cmake : add GGML_BUILD and GGML_SHARED macro definitions (llama/8281) a8f9bda

Enabled more data types for oneMKL gemm_batch (llama/8236) 08501f8

CUDA: MMQ support for iq4_nl, iq4_xs (llama/8278) 8411e3c

CUDA: revert part of the RDNA1 optimizations (llama/8309) fcd0c52

CUDA: fix MMQ stream-k rounding if ne00 % 128 != 0 (llama/8311) 04d4209

Fix WARP_SIZE=16 bug of Intel GPU (llama/8266) 1ce11e2

rm get_work_group_size() by local cache for performance (llama/8286) 08fd758

Define and optimize RDNA1 (llama/8085) 6aa5a89

fix typo (llama/8267) 0c9c7c8

Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (llama/8258) cc49462

cuda : update supports_op for matrix multiplication (llama/8245) 2314334

Fix win build conflict of math library (llama/8230) 5a33963

Fix the sub group size of Intel (llama/8106) 2dd429e

CUDA: refactor and optimize IQ MMVQ (llama/8215) afa1447

Update SYCL-Rope op and Refactor (llama/8157) 06acee2

CUDA: fix MMQ stream-k for --split-mode row (llama/8167) ef3d018

feat: cuda implementation for `ggml_conv_transpose_1d` (ggml/854) 025493b

ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) (llama/8140) e83fdad unverified

whisper : reorganize source code + improve CMake (#2256) f75c2e3 unverified