JohannesGaessler's picture
CUDA: app option to compile without FlashAttention (llama/12025)
fbc5f16