Skip to content

Commit 31863ab

Browse files
Eliasj42Elias Joseph
andauthored
Move sharktank and regression-test MI300 jobs to the ossci cluster. (#20359)
Progress on nod-ai/amd-shark-ai#793. This moves the sharktank and regression-test workflow jobs using MI300 runners to the ossci cluster. ci-exactly: build_packages,regression_test,test_sharktank --------- Signed-off-by: Elias Joseph <eljoseph@amd.com> Co-authored-by: Elias Joseph <eljoseph@amd.com>
1 parent 3fa6355 commit 31863ab

File tree

2 files changed

+14
-2
lines changed

2 files changed

+14
-2
lines changed

.github/workflows/pkgci_regression_test.yml

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,7 @@ jobs:
4848
rocm-chip: gfx942
4949
backend: rocm
5050
sku: mi300
51-
runs-on: nodai-amdgpu-mi300-x86-64
51+
runs-on: linux-mi300-1gpu-ossci-iree-org
5252
- name: amdgpu_rocm_mi308_gfx942
5353
rocm-chip: gfx942
5454
backend: rocm
@@ -91,6 +91,13 @@ jobs:
9191
# -rA -s -m "plat_host_cpu and presubmit" \
9292
# experimental/regression_suite
9393

94+
- name: "Set IREE_TEST_FILES variable"
95+
run: |
96+
case "${{ matrix.name }}" in
97+
"amdgpu_rocm_mi300_gfx942") echo IREE_TEST_FILES="/shark-cache/data/iree-regression-cache" >> $GITHUB_ENV ;;
98+
*) echo "No cache directory assigned for ${{ matrix.name }}" ;;
99+
esac
100+
94101
- name: "Running SDXL special model tests"
95102
if: "!cancelled()"
96103
run: |

.github/workflows/pkgci_test_sharktank.yml

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,11 +30,16 @@ jobs:
3030
gpu: none
3131
runs-on: ubuntu-24.04
3232

33-
- name: rocm_hip
33+
- name: rocm_hip_w7900
3434
target: target_hip
3535
gpu: gfx1100
3636
runs-on: nodai-amdgpu-w7900-x86-64
3737

38+
- name: rocm_hip_mi300
39+
target: target_hip
40+
gpu: gfx942
41+
runs-on: linux-mi300-2gpu-ossci-iree-org
42+
3843
env:
3944
VENV_DIR: ${{ github.workspace }}/venv
4045
GH_TOKEN: ${{ github.token }}

0 commit comments

Comments
 (0)