Spaces:
Running
Running
Commit History
ggml : fix compile warnings (llama/0) 80d6ec0
llamafile : fix include path (llama/0) e443f89
vulkan: Optimize some mat-vec mul quant shaders (llama/10296) dc0e685
ggml : optimize Q4_0 into Q4_0_X_Y repack (llama/10324) abf6f22
Dan Johansson commited on
Make updates to fix issues with clang-cl builds while using AVX512 flags (llama/10314) 2868c2b
Srihari-mcw commited on
ggml: new optimization interface (ggml/988) dd33ace
ggml : remove duplicated sources from the last sync (ggml/1017) 026d20b
ggml : fix some build issues c5ba1d1
slaren commited on
sync : leftovers (ggml/0) 0f6c498
cmake : restore CMakeLists.txt (llama/10256) 51a70ff
AVX BF16 and single scale quant optimizations (llama/10212) e6ffed3
Eve commited on
sycl: Use syclcompat::dp4a (llama/10267) ce0dc30
Romain Biessy commited on
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (llama/9921) 3541ee8
Charles Xu Diego Devesa commited on
ggml : build backends as libraries (llama/10256) 3dc93f3
scripts : update sync 1741306
release : v1.7.2 414329d unverified
sycl: fix example build (#2570) a0dcffc unverified
Stefan Sydow commited on
ci : use local ggml in Android build (#2567) 72b7501 unverified
ggml : tmp workaround for whisper.cpp (skip) (#2565) ef26f48 unverified
update : readme d1fa03c unverified
scripts : fix sync path 9a2f912 unverified
whisper.swiftui : switch Mac dest to Mac (Designed for iPad) (#2562) 13f2beb unverified
cmake : fix ppc64 check (#0) f3c3fca
whisper : include ggml-cpu.h (#0) cb35171
build : fixes 11d19cb
talk-llama : sync llama.cpp 6bb34fb
whisper : fix build (#0) dfd316d
sync : ggml 9e83be6
sycl : Fixes to broken builds and test-backend-ops (llama/10257) 9cfb13b
Alberto Cabrera Pérez commited on
vulkan: Optimize contiguous copies (llama/10254) 9974bd6
vulkan: Throttle the number of shader compiles during the build step. (llama/10222) 9677a2f
metal : more precise Q*K in FA vec kernel (llama/10247) 9160e8f
vulkan: Fix newly added tests for permuted mul_mat and 1D im2col (llama/10226) 76b8073
metal : reorder write loop in mul mat kernel + style (llama/10231) 661360d
metal : fix build and some more comments (llama/10229) 93fc215
metal : fix F32 accumulation in FA vec kernel (llama/10232) 228e0b2
metal : hide debug messages from normal log efefcbb
ggml: fix zero division in ‘dne’ calculation in CUDA COUNT_EQUAL operator when ‘ne’ is small (#10213) 0ecc4d6
ggml : optimize llamafile cpu matrix multiplication for ppc64le (llama/10156) 18bdb35
amritahs-ibm commited on
metal : opt-in compile flag for BF16 (llama/10218) 5f667d1
metal : improve clarity (minor) (llama/10171) d68ae7c
metal : optimize FA kernels (llama/10171) 44ff932
ggml : add ggml-cpu.h to the public headers (llama/10204) 936a35f
Diego Devesa commited on
fix q4_0_8_8 format for corrupted tokens issue (llama/10198) 4700b48
snadampal EC2 Default User commited on
metal : add BF16 support (llama/8439) 847669b
metal : fix from ptr buffer name (llama/10189) c4d59b9
Diego Devesa commited on