Feature Request: Add ROCm Support for AMD GPUs or OpenCL Support for Integrated Graphics Acceleration Description #107

ShadowLoveElysia · 2024-12-07T06:27:57Z

First of all, thanks for creating such a useful tool. It's been really helpful for a lot of us!

I wanted to bring up something that would make the software even better for users like me who rely on AMD hardware. Currently, the software supports CUDA for GPU acceleration, which is great for NVIDIA users. However, it would be fantastic if we could also have support for ROCm or OpenCL to take advantage of AMD GPUs or integrated graphics.

What I'm suggesting:
ROCM Support: Adding support for ROCm, AMD’s open-source platform for GPU computing, would allow AMD GPU owners to benefit from GPU acceleration within the software.
OpenCL for Integrated Graphics: Similar to how some other tools handle it (like UVR), supporting OpenCL would enable the use of integrated graphics for acceleration, which is particularly beneficial for users with AMD APUs.
Why this would be great:
Improved Performance: Leveraging AMD GPUs or integrated graphics could lead to faster processing times.
Broader Compatibility: This change would cater to a wider range of hardware setups, making the tool more accessible.
I understand that adding new features takes time and effort, but I believe these additions could significantly enhance the user experience for those using AMD hardware. I hope this feature can be considered in future updates.

ZFTurbo · 2024-12-07T07:17:47Z

This project uses torch library, so I think you can use ROCM if you install torch with ROCM support. Check here:
https://pytorch.org/

ShadowLoveElysia · 2024-12-07T07:22:25Z

This project uses torch library, so I think you can use ROCM if you install torch with ROCM support. Check here: https://pytorch.org/

Do you think it would be feasible to use OpenCL for acceleration with integrated graphics? Intel GPUs currently do not support ROCm or CUDA. Adding OpenCL support would enable acceleration using both AMD and Intel integrated graphics, as well as Intel dedicated GPUs. This is how UVR handles it.

ZFTurbo · 2024-12-07T07:33:42Z

I think OpenCL is also possible with this pytorch fork:
https://github.com/artyom-beilis/pytorch_dlprim

May be some changes in code of this repository needed related to keyword 'cuda'. But I personally can't check it.

deton24 · 2024-12-24T12:51:48Z

Actually UVR always used DirectML instead of OpenCL. It was Anjok naming mistake corrected in newer beta Roformer patches.

ZFTurbo · 2024-12-24T13:01:19Z

I think it can be used with this repo with minimum changes:
https://learn.microsoft.com/en-us/windows/ai/directml/pytorch-windows

aqst · 2024-12-28T08:34:17Z

Hi I tried using pytorch-windows and changed a few lines in inference.py like this:

import torch_directml
...
def proc_folder(args):
    ...
    if torch_directml.is_available():
            print('DirectML is available, use --force_cpu to disable it.')
            device = torch_directml.device(args.device_ids[0]) if type(args.device_ids) == list else torch_directml.device(args.device_ids)
            device_name = torch_directml.device_name(args.device_ids[0]) if type(args.device_ids) == list else     torch_directml.device_name(args.device_ids)

but I got this error:

DirectML is available, use --force_cpu to disable it.
Using device:  AMD Radeon RX 6800
Start from checkpoint: E:\Music-Source-Separation-Training-main\checkpoints\MelBandRoformer.ckpt
Instruments: ['vocals', 'other']
Model load time: 1.70 sec
Total files found: 1. Using sample rate: 44100
Processing track: E:\input\test.flac

Processing audio chunks:   0%|          | 0/8037372 [00:00<?, ?it/s]
[F1228 19:15:36.000000000 dml_util.cc:118] Invalid or unsupported data type ComplexFloat.
Process failed with return code 3221226505

I came across this page that says that complex isn't supported in DirectML. Any ideas how it might be possible to work around this?

ZFTurbo · 2024-12-28T15:55:05Z

May be it's possible but for this we need to change stft conversion of data inside MelRoformer model to avoid Complex numbers. I'm not sure if it's easy to do.

KitsuneX07 · 2024-12-29T11:04:48Z

May be it's possible but for this we need to change stft conversion of data inside MelRoformer model to avoid Complex numbers. I'm not sure if it's easy to do.

Maybe this repository would help, but I currently get no time to think about it.

deton24 · 2024-12-29T22:12:03Z

Maybe you could find how Anjok handles Roformers using DirectML in the UVR's code: https://github.com/Anjok07/ultimatevocalremovergui/tree/v5.6.0_roformer_add niedz., 29 gru 2024 o 12:05 KitsuneX07 ***@***.***> napisał(a):

…

May be it's possible but for this we need to change stft conversion of data inside MelRoformer model to avoid Complex numbers. I'm not sure if it's easy to do. Maybe this repository <https://github.com/DakeQQ/STFT-ISTFT-ONNX> would help, but I currently get no time to think about it. — Reply to this email directly, view it on GitHub <#107 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AIJ3EHDVBTRDOFBJSQSINPL2H7JOLAVCNFSM6AAAAABTF5QS7WVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNRUGY4DQMBZGY> . You are receiving this because you commented.Message ID: ***@***.*** com>

ShadowLoveElysia · 2024-12-30T03:28:59Z

I am trying to modify inference.py to adapt to DirectML for inference, but I do not understand how DirectML works. There may be other areas that need modification and adaptation beyond just inference.py, so I might need some time to research and trial-and-error. I may not be able to produce a decent version. If anyone has ideas, they can also try modifying it; no need to wait for me.

ShadowLoveElysia · 2024-12-30T03:46:30Z

I also noticed that the dev developers are trying to add DirectML support, and anjok's approach is worth discussing. As the saying goes, "one generation does the hard work, and the next generation benefits from it." I will try to research this together with my friends.
（Apologies, I accidentally sent my previous two comments before finishing them, which may lead to duplicate messages.）

jarredou · 2024-12-30T04:29:58Z

If needed UVR's beta with roformers and directML code is available here https://github.com/Anjok07/ultimatevocalremovergui/tree/v5.6.0_roformer_add%2Bdirectml

deton24 · 2024-12-30T16:03:50Z

Do you know whether this branch can work on Linux? pon., 30 gru 2024 o 05:30 Jarredou ***@***.***> napisał(a):

…

If needed UVR's beta with roformers and directML code is available here https://github.com/Anjok07/ultimatevocalremovergui/tree/v5.6.0_roformer_add%2Bdirectml — Reply to this email directly, view it on GitHub <#107 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AIJ3EHDCLZAYSWFY7C3C2FD2IDD5ZAVCNFSM6AAAAABTF5QS7WVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNRVGAYTINJUGY> . You are receiving this because you commented.Message ID: ***@***.*** com>

ZFTurbo · 2025-01-01T16:17:35Z

@aqst

In roformer code you can try to change these lines:

stft_repr = torch.stft(raw_audio, **self.stft_kwargs, window=stft_window, return_complex=True)
stft_repr = torch.view_as_real(stft_repr)

On this:

stft_repr = torch.stft(raw_audio, **self.stft_kwargs, window=stft_window, return_complex=False)

It's equal and you will avoid complex64 tensor type.

aqst · 2025-01-02T07:53:06Z

I tried changing that in mel_band_roformer.py but unfortunately I got the same error.

I also saw line 487 of bs_roformer.py and I tried something similar to run that part on the CPU:

stft_repr = torch.stft(raw_audio.cpu(), **self.stft_kwargs, window=stft_window.cpu(), return_complex=True)
stft_repr = torch.view_as_real(stft_repr).to(device)

but then I got this error:

  File "E:\Music-Source-Separation-Training-main\models\bs_roformer\mel_band_roformer.py", line 531, in forward
    x = stft_repr[batch_arange, self.freq_indices]
        ~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
IndexError: shape mismatch: indexing tensors could not be broadcast together with shapes [4, 1], [3958]

I'm not sure what that means but maybe there's some kind of issue in DirectML

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Add ROCm Support for AMD GPUs or OpenCL Support for Integrated Graphics Acceleration Description #107

Feature Request: Add ROCm Support for AMD GPUs or OpenCL Support for Integrated Graphics Acceleration Description #107

ShadowLoveElysia commented Dec 7, 2024 •

edited

Loading

ZFTurbo commented Dec 7, 2024

ShadowLoveElysia commented Dec 7, 2024

ZFTurbo commented Dec 7, 2024

deton24 commented Dec 24, 2024

ZFTurbo commented Dec 24, 2024

aqst commented Dec 28, 2024

ZFTurbo commented Dec 28, 2024

KitsuneX07 commented Dec 29, 2024

deton24 commented Dec 29, 2024 via email

ShadowLoveElysia commented Dec 30, 2024

ShadowLoveElysia commented Dec 30, 2024

jarredou commented Dec 30, 2024

deton24 commented Dec 30, 2024 via email

ZFTurbo commented Jan 1, 2025

aqst commented Jan 2, 2025

Feature Request: Add ROCm Support for AMD GPUs or OpenCL Support for Integrated Graphics Acceleration Description #107

Feature Request: Add ROCm Support for AMD GPUs or OpenCL Support for Integrated Graphics Acceleration Description #107

Comments

ShadowLoveElysia commented Dec 7, 2024 • edited Loading

ZFTurbo commented Dec 7, 2024

ShadowLoveElysia commented Dec 7, 2024

ZFTurbo commented Dec 7, 2024

deton24 commented Dec 24, 2024

ZFTurbo commented Dec 24, 2024

aqst commented Dec 28, 2024

ZFTurbo commented Dec 28, 2024

KitsuneX07 commented Dec 29, 2024

deton24 commented Dec 29, 2024 via email

ShadowLoveElysia commented Dec 30, 2024

ShadowLoveElysia commented Dec 30, 2024

jarredou commented Dec 30, 2024

deton24 commented Dec 30, 2024 via email

ZFTurbo commented Jan 1, 2025

aqst commented Jan 2, 2025

ShadowLoveElysia commented Dec 7, 2024 •

edited

Loading