This website requires JavaScript.
Explore
Help
Register
Sign In
jdelony
/
llama.cpp
Watch
1
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2026-06-27 23:50:20 -05:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
llama.cpp
/
docs
/
ops
History
Neo Zhang
a51142497a
[SYCL] Support Q4_1, Q5_0, Q5_1 in Flash-attention (
#23812
)
...
* support Q4_1, Q5_0, Q5_1 * update ut case
2026-06-01 09:53:53 +03:00
..
BLAS.csv
…
CANN.csv
…
CPU.csv
…
CUDA.csv
…
Metal.csv
…
OpenCL.csv
…
SYCL.csv
[SYCL] Support Q4_1, Q5_0, Q5_1 in Flash-attention (
#23812
)
2026-06-01 09:53:53 +03:00
Vulkan.csv
…
WebGPU.csv
ggml-webgpu: Enables running gpt-oss-20b (
#22906
)
2026-05-12 07:27:40 -07:00
zDNN.csv
…
ZenDNN.csv
…