mirror of
https://github.com/RYDE-WORK/llama.cpp.git
synced 2026-01-19 21:23:26 +08:00
* ggml-cpu : add chunking support to mul_mat_id * allocate chunk counter in wdata parallelize src1 quantization by column to allows parallelization even when there is only one row * disable for arm * cleanup * better way to disable for arm * fix uninitialized counter when using 1 thread only * revert test-backend-ops changes