This website requires JavaScript.
Explore
Help
Register
Sign In
jdelony
/
ik_llama.cpp
Watch
1
Star
0
Fork
0
You've already forked ik_llama.cpp
mirror of
https://github.com/ikawrakow/ik_llama.cpp.git
synced
2026-06-28 04:30:15 -05:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
ik_llama.cpp
/
ggml
History
Joel Farthing
f43a9f1cf6
Add per-byte CUDA MoE offload threshold (
#1813
)
...
Co-authored-by: Joel Farthing <262452229+joelfarthing@users.noreply.github.com>
2026-05-19 08:35:05 +03:00
..
cmake
Merge mainline llama.cpp (
#3
)
2024-07-27 07:55:01 +02:00
include
MTP: faster recurrent state restore (
#1791
)
2026-05-13 11:00:24 +03:00
src
Add per-byte CUDA MoE offload threshold (
#1813
)
2026-05-19 08:35:05 +03:00
.gitignore
Merge mainline llama.cpp (
#3
)
2024-07-27 07:55:01 +02:00
CMakeLists.txt
ggml : default GGML_WIN_VER to 0x0A00 (Windows 10) (
#1755
)
2026-05-08 13:23:04 +03:00