sycl: fixed semantics of block offset calculation (llama/14814) d3d52a4 Alberto Cabrera Pérez commited on Jul 24, 2025
metal : fix fusion across different encoders (llama/14849) 17d67da ggerganov HF Staff commited on Jul 24, 2025
sycl: fix undefined variable in work group size check (llama/14843) bcbbf47 Donghyeon Jeong commited on Jul 24, 2025
CUDA: fix overflow in FA, tune performance (llama/14840) 10ac92f JohannesGaessler commited on Jul 23, 2025
CUDA: fix compilation with GGML_CUDA_F16 (llama/14837) 2746afd JohannesGaessler commited on Jul 23, 2025
CUDA: fix quantized KV cache + multiple sequences (llama/14822) 88864af JohannesGaessler ggerganov HF Staff commited on Jul 23, 2025
ggml: fix loongarch quantize_row_q8_1 error (llama/14827) 0bd2be3 lixing-star commited on Jul 23, 2025
vulkan: fix rms_norm_mul to handle broadcasting dim0 (llama/14817) 0c16b60 jeffbolznv commited on Jul 22, 2025
cuda : implement bf16 cpy ops and enable bf16 cont (llama/14763) b54b644 Sigbjørn Skjæret commited on Jul 22, 2025
ggml: adds CONV_2D op and direct GEMM Vulkan implementation (llama/14316) 5885084 etasnadi commited on Jul 19, 2025
vulkan: Add logging for bf16 features to ggml_vk_print_gpu_info (#13274) (llama/14707) 0855a18 Peter0x44 commited on Jul 19, 2025
Vulkan: Fix fprintf format-security warning (llama/14770) 77a1c11 OccamRazor commited on Jul 19, 2025
Support static xcframework packaging in build-xcframework.sh (#3322) 78de49d unverified Rich Waters danbev commited on Jul 26, 2025
examples : add note about WHISPER_WASM_SINGLE_FILE [no ci] (#3332) 4a1f367 unverified danbev commited on Jul 24, 2025
server : hide language probabilities option behind flag (#3328) 606bf70 unverified sachaarbonel commited on Jul 21, 2025
go: fix Mac OS X builds (#3310) 2fd8067 unverified BVK Chaitanya Chaitanya Bayapuneni commited on Jul 21, 2025
cuda : Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs (llama/14741) bb523fb Oliver Simons commited on Jul 18, 2025
use max work group size for device to replace the magic number (llama/14732) e5e9b79 Neo Zhang Jianyu commited on Jul 18, 2025
llama : add high-throughput mode (llama/14363) b2d73a2 ggerganov HF Staff JohannesGaessler commited on Jul 16, 2025
vulkan: fix noncontig check for mat_mul_id splitting (llama/14683) 4d0d8b8 jeffbolznv commited on Jul 15, 2025
vulkan: add RTE variants for glu/add/sub/mul/div (llama/14653) bac21a7 jeffbolznv commited on Jul 15, 2025
cuda: fix build warnings in set-rows.cu (unused variable) (llama/14687) 1e145c7 yeahdongcn commited on Jul 15, 2025
sycl: Batched mulmat rework for oneDNN dispatch (llama/14617) 2722bea Anton Mitkov commited on Jul 14, 2025
ggml : add build-time message to remind about ggml_set_rows (llama/14661) 0f5d4ba ggerganov HF Staff commited on Jul 13, 2025
metal : Add missing unary ops Metal support (llama/14660) 2ed022e Yavor Ivanov commited on Jul 13, 2025