This PR adds support for using CPU (without flash attention)

michael-guenther changed pull request status to open
michael-guenther changed pull request status to merged

lgtm

Sign up or log in to comment