Xuan-Son Nguyen
506bb6e010
model: try to improve Qwen3 Next ( #18683 )
...
* qwen3next: simplify qkvz projection
* use ggml_swiglu_split
* revert swiglu_split, but remove redundant repeat()
* fix missing reshape
* rm 2 redundant transposes
* move mul_mat(k,q) to outside of chunking
* rm redundant cont
* improve g_cs_chunk
* add comments about no cont
* use std::pair instead of ggml_concat
* vectorize key_gdiff calculation
* rm unused tensor
* avoid ggml_concat inside loop
* bring back ggml_concat as it may not work on other backend
* nits
2026-01-11 12:53:33 +01:00
..
2026-01-05 09:14:04 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-01 18:38:51 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-02 19:01:56 +02:00
2026-01-05 09:14:04 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-01 19:25:54 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-11-10 22:55:30 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-05 09:14:04 +01:00
2026-01-02 19:01:56 +02:00
2026-01-09 23:42:38 +01:00
2026-01-02 19:01:56 +02:00
2025-10-31 23:40:23 +01:00
2025-12-16 11:25:26 +01:00
2025-12-16 11:25:26 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-27 16:04:29 +02:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2025-12-24 14:02:36 +01:00
2026-01-02 20:11:59 +01:00
2025-10-31 23:40:23 +01:00
2025-12-24 23:07:08 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-12-01 12:26:52 +01:00
2026-01-11 12:53:33 +01:00
2026-01-05 09:14:04 +01:00
2025-10-31 23:40:23 +01:00
2025-12-16 07:19:26 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-05 10:28:58 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-12-28 17:28:31 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-12-15 18:51:43 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-01-11 12:53:33 +01:00
2025-11-07 19:27:58 +01:00
2025-11-07 19:27:58 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-24 14:16:56 +08:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00