[Codegen][GPU] Add kernel config for LLVMGPUTileAndFuse #17791

qedawkins · 2024-07-02T14:46:51Z

This adds kernel configuration logic for targeting simple thread distribution of linalg-based dispatches on LLVMGPU. The configuration logic is primarily copied from the same logic on the SPIR-V side due to the already well tested heuristics there for the kinds of varied target descriptions that are present on the SPIR-V side.

Currently this is locked behind a flag
iree-codegen-llvmgpu-use-tile-and-fuse. Future patches will add specialized logic for matmul.

compiler/src/iree/compiler/Codegen/Dialect/GPU/TargetUtils/ConfigUtils.cpp

Max191

I did some testing with this, and there are some compiler failures and correctness issues with some dispatches. We should debug these issues before we land this change.

compiler/src/iree/compiler/Codegen/LLVMGPU/KernelConfig.cpp

compiler/src/iree/compiler/Codegen/Utils/Utils.cpp

qedawkins · 2024-08-15T14:12:53Z

I did some testing with this, and there are some compiler failures and correctness issues with some dispatches. We should debug these issues before we land this change.

Talked offline, the correctness issues looked to be a floating point precision ghost. I verified e2e correctness of this patch on SDXL int8 by generating an image.

Max191

I think it is fine to land. There is a similar issue with VectorDistribute fusions, which may be causing minor precision differences. I talked with @MaheshRavishankar and we still want to track this precision difference. Ideally these precision differences should be able to be truned off if necessary to produce bitwise exact results independent of pipelines. Not blocking progress, since numerics are overall accurate, but we should track this.

This adds kernel configuration logic for targeting simple thread distribution of linalg-based dispatches on LLVMGPU. The configuration logic is primarily copied from the same logic on the SPIR-V side due to the already well tested heuristics there for the kinds of varied target descriptions that are present for SPIR-V. Currently this is locked behind a flag `iree-codegen-llvmgpu-use-tile-and-fuse`.

qedawkins force-pushed the simt_kernel_config branch 2 times, most recently from 55a8f7e to b180ec1 Compare August 8, 2024 20:20

qedawkins marked this pull request as ready for review August 9, 2024 17:47

qedawkins requested review from antiagainst, MaheshRavishankar, kuhar and Groverkss as code owners August 9, 2024 17:47

qedawkins requested a review from Max191 August 9, 2024 17:47

kuhar reviewed Aug 10, 2024

View reviewed changes

Max191 requested changes Aug 12, 2024

View reviewed changes

Max191 reviewed Aug 12, 2024

View reviewed changes

compiler/src/iree/compiler/Codegen/LLVMGPU/KernelConfig.cpp Outdated Show resolved Hide resolved

qedawkins force-pushed the simt_kernel_config branch from b180ec1 to 055bfc4 Compare August 13, 2024 20:36

nirvedhmeshram reviewed Aug 13, 2024

View reviewed changes

compiler/src/iree/compiler/Codegen/Utils/Utils.cpp Show resolved Hide resolved

qedawkins force-pushed the simt_kernel_config branch from 055bfc4 to 3d5f4ea Compare August 15, 2024 14:05

qedawkins requested review from Max191 and kuhar August 15, 2024 14:43

Max191 approved these changes Aug 16, 2024

View reviewed changes

qedawkins added 4 commits August 17, 2024 11:18

Address comments

093dde1

restrict config to static scf.forall loops

0dc6308

Make the flags more sensible

5909a05

qedawkins force-pushed the simt_kernel_config branch from 2a42789 to 5909a05 Compare August 17, 2024 17:09

Add 4xdword todo comment

6c9ae7c

qedawkins merged commit 10ba28d into iree-org:main Aug 17, 2024
35 of 36 checks passed

qedawkins deleted the simt_kernel_config branch August 17, 2024 18:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Codegen][GPU] Add kernel config for LLVMGPUTileAndFuse #17791

[Codegen][GPU] Add kernel config for LLVMGPUTileAndFuse #17791

qedawkins commented Jul 2, 2024

Max191 left a comment

qedawkins commented Aug 15, 2024

Max191 left a comment

[Codegen][GPU] Add kernel config for LLVMGPUTileAndFuse #17791

[Codegen][GPU] Add kernel config for LLVMGPUTileAndFuse #17791

Conversation

qedawkins commented Jul 2, 2024

Max191 left a comment

Choose a reason for hiding this comment

qedawkins commented Aug 15, 2024

Max191 left a comment

Choose a reason for hiding this comment