Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Current TODO items #2

Open
7 of 18 tasks
oowekyala opened this issue Jun 18, 2024 · 0 comments
Open
7 of 18 tasks

Current TODO items #2

oowekyala opened this issue Jun 18, 2024 · 0 comments

Comments

@oowekyala
Copy link
Collaborator

oowekyala commented Jun 18, 2024

High priority

Cost model (deadline 04.08)

Currently we have cinm.compute with attributes for workgroup shape and DPU memory size.
We assume this specification is correct, that is, the lowering pass cannot change them.
They should be obtained through the cost model.

  • Implement a simple Samsung dialect
  • Implement a pass that annotates Samsung and Upmem kernels with their time estimation
    • Implement the upmem cost estimator in C++

Lower priority

  • Add verifier for shape of scatter map in UPMEM
  • Fix the GPU lowering, was probably broken by recent changes to CNM

Optimization

  • Hoist buffer alloc and free outside of loops
  • Malloc avoidance
    • Avoid tensor reshapes that do a copy (Especially for VA that's a problem)
    • Unify buffers across loop iterations
  • Affine map simplification with dimension sizes
@oowekyala oowekyala pinned this issue Jun 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant