Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use TFT with GPU #37

Open
vidarsumo opened this issue Aug 27, 2022 · 3 comments
Open

Use TFT with GPU #37

vidarsumo opened this issue Aug 27, 2022 · 3 comments
Labels
bug Something isn't working

Comments

@vidarsumo
Copy link

I tested TFT on a Azure DSVM which has CUDA and cuDNN installed (https://docs.microsoft.com/en-us/azure/machine-learning/data-science-virtual-machine/tools-included) but it did not use the GPU (V100).
Do I have to do something so TFT uses the GPU?

@cregouby
Copy link
Collaborator

Hello @vidarsumo
Looking for device = in the code gives you the clue that the code seems pretty complete in sending all tensor to device = "gpu" everywhere. But we may have miss something.
Currently the CI/CD workflow of the package is not run on GPU.
But on cloud, there is always a chance that the GPU may not be detected depending on virtualization stack. Can you confirm

  1. you get the expected result from torch::cuda_is_available() and torch::cuda_device_count()
  2. you configured an explicit device = "cuda" in the tft config parameter ?
    Hope it helps

@vidarsumo
Copy link
Author

torch::cuda_is_available() is FALSE so I need to check on that. But I did not find anything in the code where I could set device = "gpu".

@cregouby
Copy link
Collaborator

Hello @vidarsumo
Sorry, my mistake, device = "cuda" is not available on the user API, and is poorly used and configured in the code.
We definitively need to improve on that.

@cregouby cregouby added the bug Something isn't working label Sep 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants