llama.cpp/tools at 40d5358d3c730b81729ba81cd5c44ed596d02510 - llama.cpp - Jared's Git Server

jdelony/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-27 23:50:20 -05:00

History

ScrewTSW b65bb4baae

server: expose prompt token counts in /slots endpoint (#23454 )

Add n_prompt_tokens, n_prompt_tokens_processed, and n_prompt_tokens_cache
to the /slots JSON response. These fields are already tracked internally
but were not exposed, making it impossible for clients to monitor prompt
evaluation progress during processing.

2026-05-21 13:29:13 +02:00

..

app : add batched-bench, fit-params, quantize & perplexity (#23459 )

2026-05-21 10:29:44 +03:00

mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (#23329 )

2026-05-21 00:35:37 +02:00

mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (#23329 )

2026-05-21 00:35:37 +02:00

cvector-generator

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

app : add batched-bench, fit-params, quantize & perplexity (#23459 )

2026-05-21 10:29:44 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

app : introduce the llama unified executable (#23296 )

2026-05-20 13:22:22 +02:00

mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (#23329 )

2026-05-21 00:35:37 +02:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

app : add batched-bench, fit-params, quantize & perplexity (#23459 )

2026-05-21 10:29:44 +03:00

app : add batched-bench, fit-params, quantize & perplexity (#23459 )

2026-05-21 10:29:44 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

fix: rpc-server cache may not work in Windows environments (#22394 )

2026-04-27 17:25:09 +03:00

server: expose prompt token counts in /slots endpoint (#23454 )

2026-05-21 13:29:13 +02:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

logs : reduce (#23021 )

2026-05-14 13:05:52 +03:00

ui: Improve Git Hooks for UI development (#23403 )

2026-05-21 08:27:50 +02:00

CMakeLists.txt

ui: Restructure repo to use tools/ui folder and ui / UI / llama-ui / LLAMA_UI naming (#23064 )

2026-05-16 02:02:40 +02:00