llama.cpp

mirror of https://github.com/RYDE-WORK/llama.cpp.git synced 2026-01-19 21:23:26 +08:00

History

* ggml-backend : fix async copy from CPU

* cuda : more reliable async copy, fix stream used when the devices are the same

2024-08-07 13:29:02 +02:00

2024-06-26 18:33:02 +03:00

2024-08-06 10:26:46 +03:00

2024-08-07 13:29:02 +02:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

cann: update cmake (#8765 )

2024-07-30 12:37:35 +02:00