-
Notifications
You must be signed in to change notification settings - Fork 3.5k
CI: Use self-hosted Azure GPU runners #14632
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 67 commits
Commits
Show all changes
68 commits
Select commit
Hold shift + click to select a range
69df775
move config
Borda ab7ef13
dir
Borda cb14d43
update
Borda f2276bc
rev
Borda 44ec3fa
all
Borda ff819f5
Empty-Commit
Borda b070450
Empty-Commit
Borda c2218df
Merge branch 'master' into ci/azure-runner
Borda 41b8181
export
Borda ae53c9f
devices
Borda de871ab
env
Borda f9a3619
env
Borda 77b1dde
env
Borda 519de03
env
Borda 0690053
hard
Borda 7c00074
0,1,2,3,4,5,6,7
Borda dec98bc
all
Borda 9865044
other
Borda 20f9e79
var
Borda 989a69d
var
Borda 6ed7c2e
var
Borda 08882a9
echo
Borda 73f9a0b
[]
Borda 045d79f
{}
Borda 4c59f32
()
Borda 60039a1
var_val
Borda f696493
var_val
Borda dc76d49
var_val
Borda 662fcdd
[]
Borda 41c81d6
[]
Borda f7e89c5
var
Borda 30199cd
var_val
Borda 9fd24a8
val
Borda 130fa24
()
Borda daa0982
env
Borda c26d3ef
env
Borda 3179036
env
Borda e165df8
env
Borda 92b19cd
env
Borda 4084256
env
Borda aa0e79c
readme
Borda 7d54bf8
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] b24dd96
Merge branch 'master' into ci/azure-runner
Borda decb292
Fix test blocklist. Skip nvprof test on CUDA>8.0
carmocca bbdf63f
Fix nvprof skip
carmocca 97ea70e
REVERT ME - DEBUGGING
carmocca c722e68
DEBUG - Is it Bagua?
carmocca cc428d3
skip bagua
Borda 7eab43f
Add print when trap triggers
carmocca a11efba
Skip Bagua installation
carmocca 1ea606b
Merge branch 'ci/azure-runner' of https://github.com/PyTorchLightning…
Borda d81d422
skip bagua
Borda 68c4316
Merge branch 'master' into ci/azure-runner
Borda 09328dc
runif
Borda 313a5b7
dockers
Borda 073ff8e
Apply suggestions from code review
carmocca 8adf5db
Skip Bagua async test
carmocca b1e3d0e
Fix installation
carmocca 98aef0f
DEBUG - skip to standalone
carmocca 905dc62
Revert "DEBUG - skip to standalone"
carmocca 0df21cb
Undo change
carmocca b0c04e6
Fix env var
carmocca be7ec9e
Merge branch 'master' into ci/azure-runner
akihironitta 4ad1014
Merge branch 'master' into ci/azure-runner
carmocca 078b2a3
Merge branch 'master' into ci/azure-runner
otaj f1eed82
Merge branch 'master' into ci/azure-runner
Borda 0445742
args
Borda 08abf66
Apply suggestions from code review
carmocca File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,46 @@ | ||
# Creation GPU self-hosted agent pool | ||
|
||
## Prepare the machine | ||
|
||
This is a slightly modified version of the script from | ||
https://docs.microsoft.com/en-us/azure/devops/pipelines/agents/docker | ||
|
||
```bash | ||
apt-get update | ||
apt-get install -y --no-install-recommends \ | ||
ca-certificates \ | ||
curl \ | ||
jq \ | ||
git \ | ||
iputils-ping \ | ||
libcurl4 \ | ||
libunwind8 \ | ||
netcat \ | ||
libssl1.0 | ||
|
||
curl -sL https://aka.ms/InstallAzureCLIDeb | sudo bash | ||
mkdir /azp | ||
``` | ||
|
||
## Stating the agents | ||
|
||
```bash | ||
export TARGETARCH=linux-x64 | ||
export AZP_URL="https://dev.azure.com/Lightning-AI" | ||
export AZP_TOKEN="xxxxxxxxxxxxxxxxxxxxxxxxxx" | ||
export AZP_POOL="lit-rtx-3090" | ||
|
||
for i in {0..7..2} | ||
do | ||
nohup bash .azure/start.sh \ | ||
"AZP_AGENT_NAME=litGPU-YX_$i,$((i+1))" \ | ||
"CUDA_VISIBLE_DEVICES=$i,$((i+1))" \ | ||
> "agent-$i.log" & | ||
done | ||
``` | ||
|
||
## Check running agents | ||
|
||
```bash | ||
ps aux | grep start.sh | ||
``` |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.