mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced 2026-06-28 04:30:15 -05:00
* Option to use re-quantized output tensor for MTP * Remove quantize extra output option * Handle interleaved types
* Option to use re-quantized output tensor for MTP * Remove quantize extra output option * Handle interleaved types