llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-27 23:50:20 -05:00

History

Georgi Gerganov 11ac9800af

llama : improve infill support and special token detection (#9798 )

* llama : improve infill support

ggml-ci

* llama : add more FIM token strings

ggml-ci

* server : update prompt on slot restore (#9800)

* gguf : deprecate old FIM token KVs

2024-10-12 08:21:51 +03:00

llama.h

llama : improve infill support and special token detection (#9798 )

2024-10-12 08:21:51 +03:00