fix: disable WMMA flash attention for gfx1103 iGPU compatibility
EditorConfig Checker / editorconfig (push) Has been cancelled
EditorConfig Checker / editorconfig (push) Has been cancelled
rocWMMA does not support RDNA3 iGPU (gfx1103). Flash attention still works via the vec codepath. This allows native gfx1103 compilation without HSA_OVERRIDE_GFX_VERSION spoofing. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -37,7 +37,7 @@ COPY . .
|
||||
RUN HIPCXX="$(hipconfig -l)/clang" HIP_PATH="$(hipconfig -R)" \
|
||||
cmake -S . -B build \
|
||||
-DGGML_HIP=ON \
|
||||
-DGGML_HIP_ROCWMMA_FATTN=ON \
|
||||
-DGGML_HIP_ROCWMMA_FATTN=OFF \
|
||||
-DAMDGPU_TARGETS="$ROCM_DOCKER_ARCH" \
|
||||
-DGGML_BACKEND_DL=ON -DGGML_CPU_ALL_VARIANTS=ON \
|
||||
-DCMAKE_BUILD_TYPE=Release -DLLAMA_BUILD_TESTS=OFF \
|
||||
|
||||
Reference in New Issue
Block a user