Add Turing and Ampere (A100) GGML to docker build file (#1691)

* Add Turing and Ampere (A100) GGML to docker build file At the moment, the docker file for image builds do not build for CUDA architectures below 8.6, and ik_llama.cpp specifies support for architectures Turing and above, this PR sets the CUDA architecture list to include the architecture for Turing (7.5) and A100 (8.0) * Remove 80 because few ppl have A100s and it does seem like many cuda arches cause issues for build * switch to 86-real and 89-real with 75, 80, 90 using virtual ptx jit * nvm, even adding 90-virtual causes linker error --------- Co-authored-by: Codex <codex@local>
2026-06-28 04:30:15 -05:00 · 2026-05-07 02:58:58 -07:00 · 2026-05-07 02:58:58 -07:00 · 9ddb510787
commit 9ddb510787
parent 75f0ab300e
1 changed files with 2 additions and 2 deletions
--- a/docker/ik_llama-cuda.Containerfile
+++ b/docker/ik_llama-cuda.Containerfile
@ -7,7 +7,7 @@ ARG BASE_CUDA_RUN_CONTAINER=docker.io/nvidia/cuda:${CUDA_VERSION}-runtime-ubuntu
 FROM ${BASE_CUDA_DEV_CONTAINER} AS build

 # Build arguments
-ARG CUDA_DOCKER_ARCH="86;90"
+ARG CUDA_DOCKER_ARCH="75-virtual;80-virtual;86-real;89-real"
 ARG GGML_NATIVE=ON
 ARG USE_CCACHE=true