ymcki
a0ed91a442
models : kda chunk size = 16 ( #19827 )
...
* models : add llm_build_delta_net_base
* cont : keep qwen35 and qwen35moe graphs intact
* cont : add comments [no ci]
* add kimi linear to delta-net-base
* removed unnecessary ggml_cont from g_exp_t
* removed ggml_cont from g_diff_exp_t. moved ggml_cont for o to kimi-linear.cpp
* removed unnecessary diag mask
* cont : simplify
* cont : avoid graph splits
* scale q after mul instead of beginning
* scale q after mul instead of beginning
* identical ppl
* cont : fix scale and decay mask
* minor : remove TODO
* block implementation for kda
* remove space at the end of line 101
* concat+pad
* pad+binary row concat
* chunk size 16 for kda
* removed minor differences to master
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2026-03-05 17:01:23 +02:00
..
2026-01-05 09:14:04 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-01 18:38:51 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-02 19:01:56 +02:00
2026-01-05 09:14:04 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-03-05 08:50:21 +01:00
2025-10-31 23:40:23 +01:00
2026-03-05 17:01:23 +02:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-11-10 22:55:30 +01:00
2026-02-26 12:14:09 +01:00
2025-10-31 23:40:23 +01:00
2026-01-13 23:28:38 +01:00
2025-10-31 23:40:23 +01:00
2026-02-16 14:35:04 +02:00
2025-10-31 23:40:23 +01:00
2026-01-05 09:14:04 +01:00
2026-01-02 19:01:56 +02:00
2026-01-23 18:22:34 +02:00
2026-01-02 19:01:56 +02:00
2025-10-31 23:40:23 +01:00
2025-12-16 11:25:26 +01:00
2026-02-18 17:51:40 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-02-16 14:35:04 +02:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-02-19 13:30:17 +01:00
2025-11-04 12:29:15 +01:00
2026-02-16 14:35:04 +02:00
2026-02-25 00:01:13 +02:00
2026-02-19 09:54:48 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2025-12-24 14:02:36 +01:00
2026-01-02 20:11:59 +01:00
2026-02-16 14:35:04 +02:00
2026-02-16 14:35:04 +02:00
2025-12-24 23:07:08 +01:00
2026-01-22 22:09:01 +02:00
2025-10-31 23:40:23 +01:00
2025-12-01 12:26:52 +01:00
2026-03-05 08:50:21 +01:00
2026-02-19 08:52:21 +01:00
2025-10-31 23:40:23 +01:00
2026-02-16 14:35:04 +02:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2026-02-03 14:20:57 +01:00
2025-11-04 12:29:15 +01:00
2026-02-19 17:05:25 +01:00
2025-11-05 10:28:58 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2026-02-16 14:35:04 +02:00
2025-12-28 17:28:31 +01:00
2025-11-04 12:29:15 +01:00
2026-01-22 22:09:01 +02:00
2025-12-15 18:51:43 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-02-26 21:01:08 +08:00
2026-01-23 18:22:34 +02:00
2026-01-23 18:22:34 +02:00
2026-02-25 00:01:13 +02:00
2026-02-26 21:01:08 +08:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-24 14:16:56 +08:00
2026-02-16 14:35:04 +02:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-02-16 14:35:04 +02:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-02-06 21:06:14 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00