fix: force MMQ kernels to bypass rocBLAS TensileLibrary on gfx1103

ROCm 7.2 rocBLAS has no TensileLibrary for gfx1103 (RDNA3 iGPU) and the gfx1102 library kernels crash due to register file differences. Force MMQ (matrix multiply quantized) kernels which are compiled by hipcc for the actual target arch, bypassing rocBLAS entirely. This matches how Ollama successfully runs on AMD 780M / gfx1103. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-03 01:44:34 +02:00
parent cec946de76
commit 94127d7b33
1 changed files with 1 additions and 0 deletions
@@ -38,6 +38,7 @@ RUN HIPCXX="$(hipconfig -l)/clang" HIP_PATH="$(hipconfig -R)" \
    cmake -S . -B build \
        -DGGML_HIP=ON \
        -DGGML_HIP_ROCWMMA_FATTN=OFF \
+        -DGGML_CUDA_FORCE_MMQ=ON \
        -DAMDGPU_TARGETS="$ROCM_DOCKER_ARCH" \
        -DGGML_BACKEND_DL=ON -DGGML_CPU_ALL_VARIANTS=ON \
        -DCMAKE_BUILD_TYPE=Release -DLLAMA_BUILD_TESTS=OFF \