YiChen Lv
d789527482
spec : Support Step3.5/3.7 flash mtp3 (#24340)
* add mtp_layer_offset + include nextn flags in graph reuse
* add llama_set_mtp_layer_offset + llama_model_n_nextn_layer API
* offset head select + require all MTP blocks
* speculative multi-head process()
* speculative multi-head draft()
* gather outputs via inp_out_ids
* cleanup
* fix core
* minor cleanup
* merged draft_multi_head into draft()
* mtp rename nextn
* Apply suggestions from code review
Co-authored-by: Aman Gupta <amangupta052@gmail.com>
* clean-up comments
* fix for multi seq
* apply suggestions && chain-heads comment
* add a reference for chain_heads discussion
---------
Co-authored-by: Aman Gupta <amangupta052@gmail.com>
2026-06-21 11:33:18 +03:00
..
2026-06-14 15:07:31 +02:00
2026-06-20 19:45:27 +02:00
2026-05-29 16:30:55 +02:00
2023-11-07 00:36:23 +03:00
2026-04-17 11:11:46 +03:00
2026-04-17 11:11:46 +03:00
2026-06-15 15:37:04 +02:00
2026-05-25 08:56:18 +03:00
2026-04-03 09:07:59 +03:00
2026-06-15 15:37:04 +02:00
2026-06-15 15:37:04 +02:00
2026-06-15 22:10:09 +02:00
2026-06-15 15:37:04 +02:00
2026-06-15 17:33:54 +02:00
2026-05-25 08:56:18 +03:00
2026-06-04 17:45:40 +02:00
2026-06-19 22:28:38 +02:00
2026-06-20 01:02:26 +02:00
2026-04-06 20:54:06 +02:00
2026-03-05 10:47:28 +01:00
2026-04-27 08:06:39 +03:00
2026-04-27 08:06:39 +03:00
2026-06-18 12:45:23 +02:00
2026-06-18 12:45:23 +02:00
2026-06-13 08:09:52 +03:00
2026-06-13 08:09:52 +03:00
2026-06-17 18:04:58 +02:00
2026-06-17 18:04:58 +02:00
2026-03-09 17:47:54 +01:00
2026-06-04 17:45:40 +02:00
2026-06-04 17:45:40 +02:00
2025-11-18 18:54:15 +01:00
2026-01-20 18:23:25 +01:00
2026-06-20 17:43:04 -05:00
2025-12-16 04:05:23 -06:00
2026-01-04 22:22:16 +02:00
2026-06-17 09:19:11 +03:00
2026-05-14 13:05:52 +03:00
2026-01-28 19:42:42 +02:00
2026-01-28 19:42:42 +02:00
2026-05-19 15:32:58 +03:00
2026-03-31 13:50:51 +02:00
2026-05-29 09:21:37 +03:00
2026-01-30 21:27:27 +02:00
2026-06-20 21:15:06 -05:00
2026-06-20 21:15:06 -05:00
2026-06-18 12:45:23 +02:00
2026-06-18 12:45:23 +02:00
2026-06-01 11:37:11 +02:00
2026-06-01 11:37:11 +02:00
2026-03-16 08:50:38 +02:00
2025-05-14 19:50:57 +01:00
2026-06-18 12:49:14 +02:00
2026-06-07 22:48:11 +02:00
2026-06-21 11:33:18 +03:00
2026-06-19 13:08:50 +03:00
2026-03-11 10:26:12 +01:00
2026-03-11 10:26:12 +01:00