mirror of
https://github.com/RYDE-WORK/llama.cpp.git
synced 2026-01-24 08:33:17 +08:00
* convert_hf : faster lazy safetensors This makes '--dry-run' much, much faster. * convert_hf : fix memory leak in lazy MoE conversion The '_lazy' queue was sometimes self-referential, which caused reference cycles of objects old enough to avoid garbage collection until potential memory exhaustion.