Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CpuMath Enhancement: Optimize codegen of inlining in hardware intrinsics APIs #833

Open
briancylui opened this issue Sep 5, 2018 · 0 comments
Labels
P2 Priority of the issue for triage purpose: Needs to be fixed at some point.

Comments

@briancylui
Copy link
Contributor

Style changes needed to solve part of #823

Details

  • (from Intel partners) In the original code of src\Microsoft.ML.CpuMath\SseIntrinsics.cs, even though VectorSum is inlined, the codegen is not optimized due to register spill and reload. It seems JIT has optimization opportunity over there. Do you mind opening an issue to discuss about it on CoreCLR github repo?

cc: @tannergooding since you might already have addressed this issue.

@briancylui briancylui changed the title Optimize codegen of inlining in hardware intrinsics APIs CpuMath Enhancement: Optimize codegen of inlining in hardware intrinsics APIs Sep 6, 2018
@ganik ganik added the P2 Priority of the issue for triage purpose: Needs to be fixed at some point. label Jan 10, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P2 Priority of the issue for triage purpose: Needs to be fixed at some point.
Projects
None yet
Development

No branches or pull requests

2 participants