llama.cpp

mirror of https://github.com/RYDE-WORK/llama.cpp.git synced 2026-01-20 13:43:26 +08:00

History

* fix group_norm ut

* split softmax

* fix softmax

* add concat support condition

* revert debug code

* move QK_WARP_SIZE to presets.hpp

2024-07-05 13:06:13 +08:00

2024-06-26 18:33:02 +03:00

2024-07-02 12:18:10 -04:00

2024-07-05 13:06:13 +08:00

CMakeLists.txt

2024-06-26 21:34:14 +02:00

ggml_vk_generate_shaders.py

2024-06-26 18:33:02 +03:00