-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failing tests on AMPERE80 with gcc-13 and cuda-12.6 #73
Comments
As of heFFTe 2.4.1, the GPU aware command should be The GPU vector test is passing, which means cuda
heffte/src/heffte_backend_cuda.cu Line 138 in c61c772
|
Thank you for the feedback. Most of the tests are failing. Additionally I added a simple I attach a full log-file of the tests run with the above configuration. |
Try adding the following to the CMake options:
CMake demands that On another note, by default, heFFTe launches all kernels on the default GPU 0. This will not result in a crash, as multiple MPI ranks can use the GPU at the same time. The one GPU being hammered with work may end up running out of memory, but other than that, everything is fine. The benchmark has the |
This should fix the issue when using sufficiently new CMake. Please confirm that this is indeed the fix for you. Otherwise we'll keep digging. |
I tried to update the compiler toolchain used to build HeFFTe (c61c772).
This leads to failing tests with the CUDA backend on an Nvidia A100.
The fftw CPU backend seems to work fine with these compilers.
I use the following toolchain:
Container Image:
https://mpcdf.pages.mpcdf.de/ci-module-image/latest.html
gitlab-registry.mpcdf.mpg.de/mpcdf/ci-module-image/gcc-cuda:latest
The code was compiled using cmake/3.30 with the following preset
and the tests fail with the output
I removed some of the output to improve readability.
I am happy for any help. Thank you in advance.
The text was updated successfully, but these errors were encountered: