Kawrakow 3e573cfea6
MTP: option to use re-quantized output tensor for better TG performance (#1809)
* Option to use re-quantized output tensor for MTP

* Remove quantize extra output option

* Handle interleaved types
2026-05-16 14:40:18 +03:00
..
2024-07-27 07:55:01 +02:00
2025-12-15 08:27:20 +01:00
2026-05-06 09:25:38 +03:00
2023-11-13 14:16:23 +02:00