-
Notifications
You must be signed in to change notification settings - Fork 751
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SYCLcompat] Add support for device attribute MAX_PITCH and ASYNC_ENGINE_COUNT #16533
base: sycl
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor comment, otherwise syclcompat
looks okay.
I may be misremembering, but I don't think the above is correct. IIRC querying CU_DEVICE_ATTRIBUTE_ASYNC_ENGINE_COUNT can return greater than 2 depending upon the NVLINK capabilities. In any case I don't understand why you are always returning 0 for It may also have different mappings to other backends (e.g. AMD), and the call to |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Apart from the minor comment I left earlier, looking at this again, @JackAKirk is right that it seems a little strange to return 0 for all backends from get_async_engine_count
when this should be correctly queried via device::get_info
which calls into the respective backend query. I am still okay with this if the above stated may not be needed or if there's a plan to follow up with it in case it is needed. Thanks!
Apparently getting l0 path merged first is the priority, so I guess it is OK so long as there is an error message added that is returned if the backend in not l0? Saying something like "This query is only currently supported on the L0 backend" |
Sure okay, as long the associated commit message is also updated, I am okay with it. @the-slow-one |
@the-slow-one would it be possible to add something like that? Or maybe create an issue to track the creation of the UR query? |
@ProGTX I have added an issue #16663. I hope the details are sufficient. Please let me know. |
Sounds good, thank you! |
CU_DEVICE_ATTRIBUTE_AYSNC_ENGINE_COUNT is associated with data transfer between device and host and kernel execution. This doesn't make sense in L0 backend and hence we return 0.
CU_DEVICE_ATTRIBUTE_MAX_PITCH return INT_MAX, indicating there is no limits! Intel GPU shows the same behavior too!
Signed-off-by: Deepak Raj H R