From 181f71ca11583e251ca4607e5e099ea47e8a7327 Mon Sep 17 00:00:00 2001 From: pdscomp Date: Sun, 1 Mar 2026 12:37:08 -0500 Subject: [PATCH] .github,docker: add cuda13 architecture support (#551) Add `cuda13` as a supported build architecture, targeting the `ghcr.io/ggml-org/llama.cpp:server-cuda13` upstream base image. The `server-cuda13` image ships with CUDA 13 libraries, providing improved performance on recent NVIDIA hardware compared to the existing `server-cuda` (CUDA 12) image. Users with newer GPUs (e.g., RTX 50-series) benefit from reduced model load latency and higher token throughput. - Add `cuda13` to the allowed architectures list in `docker/build-container.sh` - Add `cuda13` to the CI matrix in `.github/workflows/containers.yml` so the container is built and pushed automatically --- .github/workflows/containers.yml | 2 +- docker/build-container.sh | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/.github/workflows/containers.yml b/.github/workflows/containers.yml index 77b7764..a67c510 100644 --- a/.github/workflows/containers.yml +++ b/.github/workflows/containers.yml @@ -29,7 +29,7 @@ jobs: runs-on: ubuntu-latest strategy: matrix: - platform: [intel, cuda, vulkan, cpu, musa, rocm] + platform: [intel, cuda, cuda13, vulkan, cpu, musa, rocm] fail-fast: false steps: - name: Checkout code diff --git a/docker/build-container.sh b/docker/build-container.sh index b1f3c18..d3296d3 100755 --- a/docker/build-container.sh +++ b/docker/build-container.sh @@ -27,7 +27,7 @@ ARCH=$1 PUSH_IMAGES=${2:-false} # List of allowed architectures -ALLOWED_ARCHS=("intel" "vulkan" "musa" "cuda" "cpu" "rocm") +ALLOWED_ARCHS=("intel" "vulkan" "musa" "cuda" "cuda13" "cpu" "rocm") # Check if ARCH is in the allowed list if [[ ! " ${ALLOWED_ARCHS[@]} " =~ " ${ARCH} " ]]; then