Enable ukernels on remaining aarch64 targets #19901

bjacob · 2025-02-04T17:36:42Z

As newer aarch64 targets increasingly support SVE and SME, this clause was preventing ukernels from being used in cases where they do speed things up. The reason why this logic was out of place here is that what it controls here is the enablement of ukernels, which are a detail of lowering an already tiled workload. If we wanted to use SVE with a variable vector length, or with a fixed vector length different from NEON's 128bit, that decision needed to be made earlier; conversely, if the workload at this point already has the right shaped to be matched to a NEON ukernel, then SVE is not relevant to it anymore.

FYI @ziereis , this results in substantially faster code in your test case from #19873.

Signed-off-by: Benoit Jacob <jacob.benoit.1@gmail.com>

hanhanW

cc @banach-space you're able to disable the ukernels with --iree-llvmcpu-enable-ukernels=none flag. The decision of whether using data-tiling is in

iree/compiler/src/iree/compiler/Codegen/ExternalInterfaces/CPUEncodingExternalModels.cpp

Lines 389 to 394 in 82255c7

    
           static SmallVector<TileMxNxK> enumerateMatmulTileArm64(TypeRange elementTypes, 
        
                                                                  DictionaryAttr config) { 
        
             // Data-tiling for SVE is not implemented yet. 
        
             if (hasFeature(config, "+sve") || hasFeature(config, "+sve2")) { 
        
               return {}; 
        
             }

banach-space · 2025-02-04T19:55:34Z

cc @banach-space you're able to disable the ukernels with --iree-llvmcpu-enable-ukernels=none flag. The decision of whether using data-tiling is in

iree/compiler/src/iree/compiler/Codegen/ExternalInterfaces/CPUEncodingExternalModels.cpp

Lines 389 to 394 in 82255c7

static SmallVector<TileMxNxK> enumerateMatmulTileArm64(TypeRange elementTypes,

DictionaryAttr config) {

// Data-tiling for SVE is not implemented yet.

if (hasFeature(config, "+sve") || hasFeature(config, "+sve2")) {

return {};

}

Thanks for the ping 🙏🏻

enable ukernels on all aarch64

2a937b4

Signed-off-by: Benoit Jacob <jacob.benoit.1@gmail.com>

bjacob requested review from hanhanW and Max191 February 4, 2025 17:37

bjacob marked this pull request as ready for review February 4, 2025 17:37

bjacob requested a review from MaheshRavishankar as a code owner February 4, 2025 17:37

hanhanW approved these changes Feb 4, 2025

View reviewed changes

bjacob enabled auto-merge (squash) February 4, 2025 17:55

bjacob merged commit eb19497 into iree-org:main Feb 4, 2025
42 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable ukernels on remaining aarch64 targets #19901

Enable ukernels on remaining aarch64 targets #19901

bjacob commented Feb 4, 2025 •

edited

Loading

hanhanW left a comment

banach-space commented Feb 4, 2025

	static SmallVector<TileMxNxK> enumerateMatmulTileArm64(TypeRange elementTypes,
	DictionaryAttr config) {
	// Data-tiling for SVE is not implemented yet.
	if (hasFeature(config, "+sve") \|\| hasFeature(config, "+sve2")) {
	return {};
	}

Enable ukernels on remaining aarch64 targets #19901

Enable ukernels on remaining aarch64 targets #19901

Conversation

bjacob commented Feb 4, 2025 • edited Loading

hanhanW left a comment

Choose a reason for hiding this comment

banach-space commented Feb 4, 2025

bjacob commented Feb 4, 2025 •

edited

Loading