Files
ollama37/CMakeLists.txt
Michael Yang b42aba40ed cuda: enable flash attention
ggml added an option to disable flash attention so explicitly enable it
2025-02-28 19:40:34 +00:00

4.8 KiB