Skip to content

Commit 785ae21

Browse files
snnntarekziade
authored andcommitted
Move Linux GPU CI pipeline to A10 (microsoft#23235)
Move Linux GPU CI pipeline to A10 machines which are more advanced. Retire onnxruntime-Linux-GPU-T4 machine pool. Disable run_lean_attention test because the new machines do not have enough shared memory. ``` skip loading trt attention kernel fmha_mhca_fp16_128_256_sm86_kernel because no enough shared memory [E:onnxruntime:, sequential_executor.cc:505 ExecuteKernel] Non-zero status code returned while running MultiHeadAttention node. Name:'MultiHeadAttention_0' Status Message: CUDA error cudaErrorInvalidValue:invalid argument ```
1 parent 0431778 commit 785ae21

File tree

3 files changed

+3
-3
lines changed

3 files changed

+3
-3
lines changed

onnxruntime/test/python/transformers/test_mha.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -892,7 +892,7 @@ def test_all(self):
892892
# Run tests sequentially to avoid out of memory issue.
893893
self.run_mha_cpu()
894894
self.run_mha_cuda()
895-
self.run_lean_attention()
895+
# self.run_lean_attention()
896896
self.run_mha_cuda_multi_threading_default()
897897
self.run_mha_cuda_multi_threading_cudnn()
898898
self.run_mha_cuda_multi_threading_efficient()

tools/ci_build/github/azure-pipelines/linux-gpu-ci-pipeline.yml

+1-1
Original file line numberDiff line numberDiff line change
@@ -137,7 +137,7 @@ stages:
137137
skipComponentGovernanceDetection: true
138138
workspace:
139139
clean: all
140-
pool: onnxruntime-Linux-GPU-T4
140+
pool: Onnxruntime-Linux-A10-24G
141141
steps:
142142
- checkout: self
143143
clean: true

tools/ci_build/github/linux/build_cuda_ci.sh

+1-1
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ BUILD_ARGS=('--config'
2121
"--enable_pybind"
2222
"--build_java"
2323
"--cmake_extra_defines"
24-
"CMAKE_CUDA_ARCHITECTURES=75"
24+
"CMAKE_CUDA_ARCHITECTURES=86"
2525
"onnxruntime_BUILD_UNIT_TESTS=ON"
2626
"onnxruntime_ENABLE_CUDA_EP_INTERNAL_TESTS=ON")
2727
if [ -x "$(command -v ninja)" ]; then

0 commit comments

Comments
 (0)