Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU CI & kernels tests fix #1578

Merged
merged 40 commits into from
Jan 17, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
40 commits
Select commit Hold shift + click to select a range
d416c27
fix: minor build bug
chenzhuofu Nov 1, 2024
a8ce685
fix: kernels test building errors
chenzhuofu Dec 16, 2024
cb05058
chore: minor
chenzhuofu Dec 17, 2024
e6e785e
Add example runs-on GPU CI workflow
lockshaw Dec 18, 2024
ca025a8
Try to get the runner to actually run
lockshaw Dec 18, 2024
43c4ddf
chore: add nixGL dep
chenzhuofu Dec 26, 2024
e394222
chore: update flake.lock
chenzhuofu Dec 27, 2024
dfae289
feat: add post-build process to modify GPU test commands
chenzhuofu Dec 27, 2024
0615a91
fix: nixGL flake env
chenzhuofu Jan 14, 2025
257d42e
chore: eliminate `nix run` during test
chenzhuofu Jan 14, 2025
0defe26
Merge branch 'from/lockshaw/runs-on' of github.com:flexflow/flexflow-…
chenzhuofu Jan 14, 2025
26d3252
chore: try to run ci
chenzhuofu Jan 15, 2025
a925155
chore: use base image foe limited memory space
chenzhuofu Jan 15, 2025
42c8af2
chore: minor
chenzhuofu Jan 15, 2025
4bb5efe
chore: minor
chenzhuofu Jan 15, 2025
7feca72
chore: minor
chenzhuofu Jan 15, 2025
9e3c9b9
chore: minor
chenzhuofu Jan 15, 2025
b44268d
chore: minor
chenzhuofu Jan 15, 2025
ae466dc
chore: use small parallelism
chenzhuofu Jan 15, 2025
03e2b49
feat: remove unnecessary env for kernel-tests
chenzhuofu Jan 15, 2025
38e92d2
chore: minor
chenzhuofu Jan 15, 2025
e4ae782
chore: check gpu driver
chenzhuofu Jan 15, 2025
422b982
chore: try to use gpu image
chenzhuofu Jan 15, 2025
8421336
chore: using ubuntu base image for runs-on
chenzhuofu Jan 15, 2025
366d83c
chore: minor
chenzhuofu Jan 15, 2025
3cbee87
chore: minor
chenzhuofu Jan 15, 2025
9808c3d
chore: minor
chenzhuofu Jan 15, 2025
6a45d77
chore: minor
chenzhuofu Jan 15, 2025
13c29af
chore: minor
chenzhuofu Jan 15, 2025
d7d59a3
chore: minor
chenzhuofu Jan 16, 2025
d77866a
Merge branch 'master' of github.com:flexflow/flexflow-train into repo…
chenzhuofu Jan 16, 2025
798f1ec
style: format the code
chenzhuofu Jan 16, 2025
1225f15
Move CI over to proj entirely
lockshaw Jan 17, 2025
f07a906
Merge branch 'master' into repo-refactor-fix-kernels-tests
lockshaw Jan 17, 2025
d60d86a
Minor fixes
lockshaw Jan 17, 2025
1a32bd3
Merge remote-tracking branch 'refs/remotes/flexflow/repo-refactor-fix…
lockshaw Jan 17, 2025
2da9c1b
Move GPU ci into same workflow as cpu ci
lockshaw Jan 17, 2025
fb7e11c
Fix RC_PARAMS
lockshaw Jan 17, 2025
b01339d
Update with merged proj commit
lockshaw Jan 17, 2025
373baf8
Remove now-unused helpers
lockshaw Jan 17, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 0 additions & 9 deletions .github/workflows/helpers/build_target.sh

This file was deleted.

29 changes: 0 additions & 29 deletions .github/workflows/helpers/cmake_cuda.sh

This file was deleted.

8 changes: 8 additions & 0 deletions .github/workflows/helpers/free_space_on_runner_gpu.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
#!/bin/bash
set -euo pipefail
set -x

sudo rm -rf /usr/share/dotnet
sudo rm -rf /usr/local/lib/android
sudo rm -rf /opt/ghc
sudo rm -rf "/usr/local/share/boost"
85 changes: 0 additions & 85 deletions .github/workflows/helpers/gpu_ci_helper.py

This file was deleted.

56 changes: 0 additions & 56 deletions .github/workflows/helpers/install_cudnn.sh

This file was deleted.

31 changes: 0 additions & 31 deletions .github/workflows/helpers/install_dependencies.sh

This file was deleted.

14 changes: 0 additions & 14 deletions .github/workflows/helpers/test_target.sh

This file was deleted.

158 changes: 0 additions & 158 deletions .github/workflows/per-lib-check.yml

This file was deleted.

Loading
Loading