Commit History

CUDA/HIP: optimize mmv paths taken for HIP devices (llama/14324)
1a9d2d3

uvos JohannesGaessler commited on

CUDA: mul_mat_v support for batch sizes > 1 (llama/14262)
2d1e6e7

JohannesGaessler commited on

HIP: enable vec fattn on RDNA4 (llama/14323)
b6dc6a1

uvos commited on

CUDA: add mean operation (llama/14313)
7cee55b

am17an commited on

Add support for VK_EXT_debug_utils to add labels to Vulkan objects. (llama/13792)
2c3741a

Markus Tavenrath commited on

metal : fix thread-safety (llama/14300)
2bd85b6

ggerganov commited on

ggml-cpu : "align corners" for bilinear upscale/downscale (ggml/1285)
88e7829

Acly commited on

ggml-quants : rename best_mad to best_error (ggml/1283)
cd9270f

danbev commited on

ci : use selective copy for musa image (#3296)
e11fac2
unverified

danbev commited on

ci: set fail-fast to false in docker.yml (#3294)
fc20738
unverified

danbev commited on

ruby : add Whisper::VERSION (#3292)
6468892
unverified

KitaitiMakoto commited on

whisper : add version function (#3289)
0b952d7
unverified

danbev commited on

ci : add should_release variable (#3288)
7272681
unverified

danbev commited on

docs : add cmake "-j" flag in README.md (#3284)
5fe3033
unverified

toboil-features commited on

ci : add support for tag-based releases (#3287)
f21cf37
unverified

danbev commited on

release : v1.7.6
5cade6e
unverified

ggerganov commited on

bench : update benches
d4f72cd
unverified

ggerganov commited on

bench : print system info before ctx check
835c3e8
unverified

ggerganov commited on

stream : add nullptr check of whisper_context (#3283)
9f0c009
unverified

danbev commited on

ci : enable main-cuda build (#3282)
9bee7f3
unverified

danbev commited on

bindings.java : update java example (#3281)
f001158
unverified

Joas Dev commited on

coreml : backport CoreML features to macos < 14 (#3255)
dc0917f
unverified

glaszig commited on

ci : reduce musa image size (#3277)
a45c78b
unverified

danbev commited on

whisper : add .gitignore entries for OpenVINO support (#3276)
ca0545e
unverified

Yukimasa Funaoka commited on

command: output commands to text file (#3273)
a482bd7
unverified

Aaron Ang commited on

ci : add apt-get clean to musa Dockerfile (#3275)
32a61ec
unverified

danbev commited on

ruby : specify Apple frameworks explicitly on build (#3270)
728defc
unverified

KitaitiMakoto commited on

talk-llama : sync llama.cpp
ade9bc3

ggerganov commited on

sync : ggml
48a7292

ggerganov commited on

CUDA: add conv_2d_transpose (llama/14287)
a728b83

am17an commited on

sycl: add usage of enqueue_functions extension (llama/14244)
2e59a96

Nicolò Scipione commited on

Implement GGML_CPU_ALL_VARIANTS for PowerPC (llama/14286)
0bcd751

Christian Kastner Diego Devesa commited on

cuda : synchronize graph capture and cublas handle destruction (llama/14288)
39c4fa5

Diego Devesa commited on

ggml : fix repack work size for mul_mat_id (llama/14292)
4b0d2de

ggerganov commited on

ggml: Update KleidiAI to v1.9.0 (llama/14277)
90ccf35

Charles Xu commited on

CUDA: add conv_2d_dw (llama/14265)
5cca3ec

am17an commited on

ggml-cpu : remove unnecesary arm feature detection (llama/14281)
62cf694

Diego Devesa commited on

build : suppress gcc15 compile warnings (llama/14261)
0454008

fanyang commited on

sycl: Cleanup codepaths in Get Rows in sycl backend (llama/14215)
feee739

Anton Mitkov commited on

llamafile : support s390x SIMD instruction set (llama/14273)
26bafb6

taronaeo commited on

Vulkan: Set device max size for host memory to avoid OOM warning and fallback to CPU buffer (llama/14249)
08debcd

OccamRazor commited on

metal : add mean kernel (llama/14267)
a726ecc

ggerganov commited on

ggml-cpu: reduce asm calls for hsum (llama/14037)
17c0dfa

taronaeo commited on

ggml-cpu: fix uncaught underscore terminators (llama/14023)
c005248

taronaeo commited on

ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (llama/14258)
9d1d21b

Charles Xu commited on

Add `ggml_roll` (ggml/1274)
71923e5

Acly commited on

android : update CMakeLists.txt to use FetchContent for ggml (#3268)
e5d47d0
unverified

danbev commited on

cmake : fix android build (#3265)
e70bf99
unverified

ggerganov danbev commited on

examples : add stereo to mono conversion in read_audio_data (#3266)
5451562
unverified

danbev commited on

talk-llama : sync llama.cpp
fc04dc0

ggerganov commited on