Aman Gupta
b68d75165a
llama: Add option to merge gate and exp weights (#19139)
* llama: Add option to merge gate and exp weights
* Update convert_hf_to_gguf.py
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* Update convert_hf_to_gguf.py
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* update constants.py
* add gate_up for the all MoE models
* convert: simplify merge tensor condition
* update constants.py
* reduce number of models, add create_tensor_gate_up helper
---------
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2026-02-26 21:01:08 +08:00
..
2026-01-05 09:14:04 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-01 18:38:51 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-02 19:01:56 +02:00
2026-01-05 09:14:04 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-02-26 21:01:08 +08:00
2025-10-31 23:40:23 +01:00
2026-02-19 08:15:17 +02:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-11-10 22:55:30 +01:00
2026-02-26 12:14:09 +01:00
2025-10-31 23:40:23 +01:00
2026-01-13 23:28:38 +01:00
2025-10-31 23:40:23 +01:00
2026-02-16 14:35:04 +02:00
2025-10-31 23:40:23 +01:00
2026-01-05 09:14:04 +01:00
2026-01-02 19:01:56 +02:00
2026-01-23 18:22:34 +02:00
2026-01-02 19:01:56 +02:00
2025-10-31 23:40:23 +01:00
2025-12-16 11:25:26 +01:00
2026-02-18 17:51:40 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-02-16 14:35:04 +02:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-02-19 13:30:17 +01:00
2025-11-04 12:29:15 +01:00
2026-02-16 14:35:04 +02:00
2026-02-25 00:01:13 +02:00
2026-02-19 09:54:48 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2025-12-24 14:02:36 +01:00
2026-01-02 20:11:59 +01:00
2026-02-16 14:35:04 +02:00
2026-02-16 14:35:04 +02:00
2025-12-24 23:07:08 +01:00
2026-01-22 22:09:01 +02:00
2025-10-31 23:40:23 +01:00
2025-12-01 12:26:52 +01:00
2026-02-26 12:14:09 +01:00
2026-02-19 08:52:21 +01:00
2025-10-31 23:40:23 +01:00
2026-02-16 14:35:04 +02:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2026-02-03 14:20:57 +01:00
2025-11-04 12:29:15 +01:00
2026-02-19 17:05:25 +01:00
2025-11-05 10:28:58 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2026-02-16 14:35:04 +02:00
2025-12-28 17:28:31 +01:00
2025-11-04 12:29:15 +01:00
2026-01-22 22:09:01 +02:00
2025-12-15 18:51:43 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-02-26 21:01:08 +08:00
2026-01-23 18:22:34 +02:00
2026-01-23 18:22:34 +02:00
2026-02-25 00:01:13 +02:00
2026-02-26 21:01:08 +08:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-24 14:16:56 +08:00
2026-02-16 14:35:04 +02:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-02-16 14:35:04 +02:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-02-06 21:06:14 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00