[SYCLcompat] Add support for device attribute MAX_PITCH and ASYNC_ENGINE_COUNT #16533

the-slow-one · 2025-01-07T06:30:06Z

CU_DEVICE_ATTRIBUTE_AYSNC_ENGINE_COUNT is associated with data transfer between device and host and kernel execution. This doesn't make sense in L0 backend and hence we return 0.

CU_DEVICE_ATTRIBUTE_MAX_PITCH return INT_MAX, indicating there is no limits! Intel GPU shows the same behavior too!

Signed-off-by: Deepak Raj H R

sycl/include/syclcompat/device.hpp

GeorgeWeb

Minor comment, otherwise syclcompat looks okay.

sycl/include/syclcompat/device.hpp

JackAKirk · 2025-01-07T11:38:36Z

CU_DEVICE_ATTRIBUTE_AYSNC_ENGINE_COUNT is associated with value 2 for bidirectional data transfer between host and device while executing the kernel and is associated to 1 in case of unidirectional data transfer and for no support of data transfer and parallel execution it's 0.

I may be misremembering, but I don't think the above is correct. IIRC querying CU_DEVICE_ATTRIBUTE_ASYNC_ENGINE_COUNT can return greater than 2 depending upon the NVLINK capabilities.

In any case I don't understand why you are always returning 0 for get_async_engine_count if you want to map it to the CU_DEVICE_ATTRIBUTE_ASYNC_ENGINE_COUNT device query?
The correct way of dealing with such device queries in a future-proof manner for all existing and future devices is to add a query in urDeviceGetInfo that calls cuDeviceGetAttribute using CU_DEVICE_ATTRIBUTE_ASYNC_ENGINE_COUNT https://github.com/oneapi-src/unified-runtime/blob/main/source/adapters/cuda/device.cpp#L41
and then calling this UR API via the sycl compat entry point using a mapped CU_DEVICE_ATTRIBUTE_ASYNC_ENGINE_COUNT query.

It may also have different mappings to other backends (e.g. AMD), and the call to urDeviceGetInfo allows such backends to return an appropriate value.

GeorgeWeb

Apart from the minor comment I left earlier, looking at this again, @JackAKirk is right that it seems a little strange to return 0 for all backends from get_async_engine_count when this should be correctly queried via device::get_info which calls into the respective backend query. I am still okay with this if the above stated may not be needed or if there's a plan to follow up with it in case it is needed. Thanks!

JackAKirk · 2025-01-14T16:34:22Z

Apart from the minor comment I left earlier, looking at this again, @JackAKirk is right that it seems a little strange to return 0 for all backends from get_async_engine_count when this should be correctly queried via device::get_info which calls into the respective backend query. I am still okay with this if the above stated may not be needed or if there's a plan to follow up with it in case it is needed. Thanks!

Apparently getting l0 path merged first is the priority, so I guess it is OK so long as there is an error message added that is returned if the backend in not l0?

Saying something like "This query is only currently supported on the L0 backend"

GeorgeWeb · 2025-01-14T16:41:44Z

Apart from the minor comment I left earlier, looking at this again, @JackAKirk is right that it seems a little strange to return 0 for all backends from get_async_engine_count when this should be correctly queried via device::get_info which calls into the respective backend query. I am still okay with this if the above stated may not be needed or if there's a plan to follow up with it in case it is needed. Thanks!

Apparently getting l0 path merged first is the priority, so I guess it is OK so long as there is an error message added that is returned if the backend in not l0?

Saying something like "This query is only currently supported on the L0 backend"

Sure okay, as long the associated commit message is also updated, I am okay with it. @the-slow-one
Also, I'll approve after addressing the INT_MAX -> std::numeric_limits<int>::max() feedback comment, unless there is an issue with this that I am not seeing, in which case do let me know. Thanks!

ProGTX · 2025-01-15T15:34:47Z

Saying something like "This query is only currently supported on the L0 backend"

@the-slow-one would it be possible to add something like that? Or maybe create an issue to track the creation of the UR query?

the-slow-one · 2025-01-16T15:47:24Z

Saying something like "This query is only currently supported on the L0 backend"

@the-slow-one would it be possible to add something like that? Or maybe create an issue to track the creation of the UR query?

@ProGTX I have added an issue #16663. I hope the details are sufficient. Please let me know.

ProGTX · 2025-01-16T16:18:39Z

Saying something like "This query is only currently supported on the L0 backend"

@the-slow-one would it be possible to add something like that? Or maybe create an issue to track the creation of the UR query?

@ProGTX I have added an issue #16663. I hope the details are sufficient. Please let me know.

Sounds good, thank you!

Add support for device attribute MAX_PITCH and ASYNC_ENGINE_COUNT

2f987a0

the-slow-one requested a review from a team as a code owner January 7, 2025 06:30

the-slow-one temporarily deployed to WindowsCILock January 7, 2025 06:30 — with GitHub Actions Inactive

the-slow-one temporarily deployed to WindowsCILock January 7, 2025 06:59 — with GitHub Actions Inactive

zhiweij1 reviewed Jan 7, 2025

View reviewed changes

sycl/include/syclcompat/device.hpp Outdated Show resolved Hide resolved

the-slow-one commented Jan 7, 2025

View reviewed changes

sycl/include/syclcompat/device.hpp Outdated Show resolved Hide resolved

the-slow-one commented Jan 7, 2025

View reviewed changes

sycl/include/syclcompat/device.hpp Outdated Show resolved Hide resolved

Remove CUDA related info

81e5e22

the-slow-one temporarily deployed to WindowsCILock January 7, 2025 07:51 — with GitHub Actions Inactive

ziranzha approved these changes Jan 7, 2025

View reviewed changes

the-slow-one requested a review from zhiweij1 January 7, 2025 08:18

the-slow-one temporarily deployed to WindowsCILock January 7, 2025 08:36 — with GitHub Actions Inactive

GeorgeWeb approved these changes Jan 7, 2025

View reviewed changes

sycl/include/syclcompat/device.hpp Outdated Show resolved Hide resolved

GeorgeWeb requested changes Jan 7, 2025

View reviewed changes

the-slow-one mentioned this pull request Jan 10, 2025

[SYCLomatic] Add migration for CU_DEVICE_ATTRIBUTE_ASYNC_ENGINE_COUNT oneapi-src/SYCLomatic#2601

Open

Use C++ int max

024b584

the-slow-one temporarily deployed to WindowsCILock January 15, 2025 06:39 — with GitHub Actions Inactive

the-slow-one requested a review from GeorgeWeb January 15, 2025 07:05

the-slow-one temporarily deployed to WindowsCILock January 15, 2025 07:07 — with GitHub Actions Inactive

GeorgeWeb approved these changes Jan 15, 2025

View reviewed changes

the-slow-one mentioned this pull request Jan 16, 2025

Add support for device attribute query in Unified Runtime for other then L0 backends #16663

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCLcompat] Add support for device attribute MAX_PITCH and ASYNC_ENGINE_COUNT #16533

[SYCLcompat] Add support for device attribute MAX_PITCH and ASYNC_ENGINE_COUNT #16533

the-slow-one commented Jan 7, 2025 •

edited

Loading

GeorgeWeb left a comment •

edited

Loading

JackAKirk commented Jan 7, 2025 •

edited

Loading

GeorgeWeb left a comment •

edited

Loading

JackAKirk commented Jan 14, 2025 •

edited

Loading

GeorgeWeb commented Jan 14, 2025 •

edited

Loading

ProGTX commented Jan 15, 2025

the-slow-one commented Jan 16, 2025 •

edited

Loading

ProGTX commented Jan 16, 2025

[SYCLcompat] Add support for device attribute MAX_PITCH and ASYNC_ENGINE_COUNT #16533

Are you sure you want to change the base?

[SYCLcompat] Add support for device attribute MAX_PITCH and ASYNC_ENGINE_COUNT #16533

Conversation

the-slow-one commented Jan 7, 2025 • edited Loading

GeorgeWeb left a comment • edited Loading

Choose a reason for hiding this comment

JackAKirk commented Jan 7, 2025 • edited Loading

GeorgeWeb left a comment • edited Loading

Choose a reason for hiding this comment

JackAKirk commented Jan 14, 2025 • edited Loading

GeorgeWeb commented Jan 14, 2025 • edited Loading

ProGTX commented Jan 15, 2025

the-slow-one commented Jan 16, 2025 • edited Loading

ProGTX commented Jan 16, 2025

the-slow-one commented Jan 7, 2025 •

edited

Loading

GeorgeWeb left a comment •

edited

Loading

JackAKirk commented Jan 7, 2025 •

edited

Loading

GeorgeWeb left a comment •

edited

Loading

JackAKirk commented Jan 14, 2025 •

edited

Loading

GeorgeWeb commented Jan 14, 2025 •

edited

Loading

the-slow-one commented Jan 16, 2025 •

edited

Loading