llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-27 23:50:20 -05:00

History

CUDA: add set rows for f32 and f16 (#14551 )

* CUDA: add set rows for f32 and f16

* Review: change kernel params, use strides from host

* Use 1-d kernel

* Review: use int64_t for blockDim.x, rename nb->s for clarity

2025-07-12 16:31:38 +03:00

cmake

ggml-cpu : rework weak alias on apple targets (#14146 )

2025-06-16 13:54:15 +08:00

include

ggml : add ggml_scale_bias (#14417 )

2025-07-09 18:16:12 +02:00

src

CUDA: add set rows for f32 and f16 (#14551 )

2025-07-12 16:31:38 +03:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

ggml : remove kompute backend (#14501 )

2025-07-03 07:48:32 +03:00