ik_llama.cpp/common at 6c665f38fd8d2178c4ae4c190fd374f3a87c057d - ik_llama.cpp - Jared's Git Server

jdelony/ik_llama.cpp

mirror of https://github.com/ikawrakow/ik_llama.cpp.git synced 2026-06-28 04:30:15 -05:00

History

Nexes the Elder 6c665f38fd

sweep-bench: add -minilog argument to reduce verbose logging (#1468 )

Purpose:
Add --minilog flag to llama-sweep-bench that filters log output to show only essential GPU/layer distribution information while suppressing verbose model metadata and per-layer device assignment messages.

Changes:
- Add llama_selective_log_callback with blacklist approach (sweep-bench.cpp)

Blacklisted patterns (hidden):
- Per-layer device assignments ('Setting default device in layer')
- KV metadata dump header and entries
- Tensor type counts
- Model validation messages
- EOG/special token cache info
- Metadata printout (llm_load_print_meta, print_info)
- Layer sizes table
- Tensor loading info (llm_load_tensors)
- Separator lines
- Most common cases of incomplete/continuation lines are also hidden

All other log output is shown, including:
- GPU VRAM info
- Split/buffer distribution per device
- Graph split estimates
- Final benchmark table and timings

2026-03-20 09:40:56 +01:00

..

Merge mainline llama.cpp (#3 )

2024-07-27 07:55:01 +02:00

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

base64.hpp

llava : expose as a shared library for downstream projects (#3613 )

2023-11-07 00:36:23 +03:00

build-info.cpp.in

build : link against build info instead of compiling against it (#3879 )

2023-11-02 08:50:16 +02:00

chat-parser-xml-toolcall.cpp

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

chat-parser-xml-toolcall.h

Allow arbitrary arguments order for Q3C, Q3CN, and Qwen3.5 (#1352 )

2026-03-03 15:39:16 +01:00

chat-parser.cpp

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

chat-parser.h

Refactor chat and server file (#1062 )

2025-12-15 08:27:20 +01:00

chat-peg-parser.cpp

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

chat-peg-parser.h

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

chat.cpp

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

chat.h

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

CMakeLists.txt

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

common.cpp

sweep-bench: add -minilog argument to reduce verbose logging (#1468 )

2026-03-20 09:40:56 +01:00

common.h

sweep-bench: add -minilog argument to reduce verbose logging (#1468 )

2026-03-20 09:40:56 +01:00

console.cpp

check C++ code with -Wmissing-declarations (#3184 )

2023-09-15 15:38:27 -04:00

console.h

gguf : new file format with flexible meta data (beta) (#2398 )

2023-08-21 23:07:43 +03:00

json-partial.cpp

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

json-partial.h

Move minja and nlohmann/json to vendor (#802 )

2025-09-27 09:12:35 +02:00

json-schema-to-grammar.cpp

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

json-schema-to-grammar.h

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

llguidance.cpp

Tool calls support from mainline (#723 )

2025-09-01 08:38:49 +03:00

log.cpp

Refactor chat and server file (#1062 )

2025-12-15 08:27:20 +01:00

log.h

Server: refactor and rename functions (#1151 )

2026-01-18 08:16:57 +02:00

ngram-cache.cpp

spec : add self speculative decoding, ngram and refactor (#1261 )

2026-02-13 19:04:55 +01:00

ngram-cache.h

spec : add self speculative decoding, ngram and refactor (#1261 )

2026-02-13 19:04:55 +01:00

ngram-map.cpp

spec : add self speculative decoding, ngram and refactor (#1261 )

2026-02-13 19:04:55 +01:00

ngram-map.h

spec : add self speculative decoding, ngram and refactor (#1261 )

2026-02-13 19:04:55 +01:00

ngram-mod.cpp

spec : add self speculative decoding, ngram and refactor (#1261 )

2026-02-13 19:04:55 +01:00

ngram-mod.h

spec : add self speculative decoding, ngram and refactor (#1261 )

2026-02-13 19:04:55 +01:00

peg-parser.cpp

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

peg-parser.h

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

regex-partial.cpp

server : support multi-modal context checkpoints and prompt caching (#1398 )

2026-03-13 08:07:57 +01:00

regex-partial.h

Tool calls support from mainline (#723 )

2025-09-01 08:38:49 +03:00

sampling.cpp

Adaptive P sampler: update review logic, delete old code comments, put prep stage after logit bias (#1386 )

2026-03-14 12:34:12 +01:00

sampling.h

Adaptive P sampler: update review logic, delete old code comments, put prep stage after logit bias (#1386 )

2026-03-14 12:34:12 +01:00

speculative.cpp

Add MTP decoding support for GLM-4.x MoE (#1270 )

2026-02-22 18:14:39 +01:00

speculative.h

Add MTP decoding support for GLM-4.x MoE (#1270 )

2026-02-22 18:14:39 +01:00

train.cpp

Server: refactor and rename functions (#1151 )

2026-01-18 08:16:57 +02:00

train.h

sync : ggml (backend v2) (#3912 )

2023-11-13 14:16:23 +02:00

unicode.cpp

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00

unicode.h

common : introduce composable PEG parser combinators for chat parsing and new jinja template engine (#1369 )

2026-03-09 11:03:33 +01:00