Skip to content

Current TODO items #2

@oowekyala

Description

@oowekyala

High priority

Cost model (deadline 04.08)

Currently we have cinm.compute with attributes for workgroup shape and DPU memory size.
We assume this specification is correct, that is, the lowering pass cannot change them.
They should be obtained through the cost model.

  • Implement a simple Samsung dialect
  • Implement a pass that annotates Samsung and Upmem kernels with their time estimation
    • Implement the upmem cost estimator in C++

Lower priority

  • Add verifier for shape of scatter map in UPMEM
  • Fix the GPU lowering, was probably broken by recent changes to CNM

Optimization

  • Hoist buffer alloc and free outside of loops
  • Malloc avoidance
    • Avoid tensor reshapes that do a copy (Especially for VA that's a problem)
    • Unify buffers across loop iterations
  • Affine map simplification with dimension sizes

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions