-
Notifications
You must be signed in to change notification settings - Fork 645
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fixing executable linking when other targets are present. (#19035)
The existing linking code was all kinds of wrong when multiple executables with disjoint entry points were present. Linking needs to be reworked in general but this incremental change ensures that targets only link executables that contain variants that use them. There are definitely still corner cases that don't work. This is a small step towards towards heterogeneous devices. The following example now compiles and runs: ```mlir #executable_target_embedded_elf_x86_64_ = #hal.executable.target<"llvm-cpu", "embedded-elf-x86_64", {target_triple = "x86_64-none-elf", native_vector_size = 4 : index}> #executable_target_embedded_elf_x86_64_whatever = #hal.executable.target<"llvm-cpu", "embedded-elf-x86_64", {target_triple = "x86_64-none-elf", native_vector_size = 16 : index}> #executable_target_vmvx_bytecode_fb = #hal.executable.target<"vmvx", "vmvx-bytecode-fb"> util.global private @device_a = #hal.device.target<"local", {ordinal = 0 : index}, [ #executable_target_embedded_elf_x86_64_, #executable_target_embedded_elf_x86_64_whatever ]> : !hal.device util.global private @device_b = #hal.device.target<"local", {ordinal = 1 : index}, [ #executable_target_vmvx_bytecode_fb ]> : !hal.device func.func public @mutli_device_mul_add( // Input argument is resident on device_a (tooling default to first device). %input_a: tensor<4xf32> {iree.abi.affinity = #hal.device.affinity<@device_a>} ) -> ( // Output result is expected to be on device_a (though not required). tensor<4xf32> {iree.abi.affinity = #hal.device.affinity<@device_a>} ) { // Compute on device_a (input is there). %constant_a = arith.constant dense<[0.0, 1.0, 2.0, 3.0]> : tensor<4xf32> %transient_a = arith.mulf %input_a, %constant_a : tensor<4xf32> // Transfer the result from device_a -> device_b. %transient_b = flow.tensor.transfer %transient_a : tensor<4xf32> to #hal.device.affinity<@device_b> // Compute on device_b. %constant_b = arith.constant dense<[4.0, 5.0, 6.0, 7.0]> : tensor<4xf32> %result_b = arith.mulf %transient_b, %constant_b : tensor<4xf32> // Transfer the result from device_b -> device_a. %result_a = flow.tensor.transfer %result_b : tensor<4xf32> to #hal.device.affinity<@device_a> // More compute on device_a - should produce into the result buffer. %result_a2 = arith.addf %result_a, %constant_a : tensor<4xf32> // Return the result on device_a (as required by ABI attr). func.return %result_a2 : tensor<4xf32> } ``` ```sh $ iree-compile --iree-execution-model=async-external iree-run-module-multi.mlir -o module.vmfb $ iree-run-module \ --module=module.vmfb --function=mutli_device_mul_add --input=4xf32=10,11,12,13 \ --device=local-task --device=local-task --task_topology_group_count=1 ``` (testing this in-tree is hard right now due to ergonomics issues - this is all experimental anyway)
- Loading branch information
Showing
7 changed files
with
70 additions
and
56 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters