llama.cpp

mirror of https://github.com/RYDE-WORK/llama.cpp.git synced 2026-01-20 05:33:37 +08:00

History

Georgi Gerganov f5a77a629b

* Major refactoring - introduce C-style API

* Clean up

* Add <cassert>

* Add <iterator>

* Add <algorithm> ....

* Fix timing reporting and accumulation

* Measure eval time only for single-token calls

* Change llama_tokenize return meaning

2023-03-22 07:32:36 +02:00

ggml-vocab.bin

Introduce C-style API (#370 )

2023-03-22 07:32:36 +02:00