[OP] Matmul #24

anadeem2 · 2022-07-07T18:31:59Z

Description

Matmul lowering for Bert like Transformer models native trace. Matmul OP is currently only around 60% complete (sufficient). It does not natively support conversion of higher dim matrix (4D+), so we fallback to CPU to ensure 100% coverage. The problem is we cannot use view as it modifies same memory location and values have not materialized in aten_raf_type. Potential fix are:

implement matrix folding (similar to pytorch)
lower & utilize einsum
In raf_node_lowering try reshape + transpositions to convert higher dim matrix.

Checklist

PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

cc @awslabs/raf-reviewer

ratex/csrc/aten_raf_type.cpp

zachzzc · 2022-07-07T21:03:05Z

tests/python/op/test_nn.py

    verify_step(Model(), [x])


+@pytest.mark.parametrize("shape", [(3, 3, 3)])


2 dimension case should work right? Can we add a test here?
And also can you add a fp16 dtype test

The test case permutes over the shapes. So basically it will do (1x1, 1x2, 1x3, 2x1,2x2...3x3). I tried running fp16, but bmm does not support it so it fails.

zachzzc · 2022-07-07T21:37:41Z

I checked XLA implementation https://github.com/pytorch/xla/blob/cc19c3abcbb3f702d5f468ee08549edd926ef549/torch_xla/csrc/xla_lower_util.cpp#L386
We can revise it to support dim >= 4 in the future referring to this

anadeem2 · 2022-07-11T21:40:25Z

I did reference aten::matmul, not sure if we can implement the matrix folding like them, but here it is for reference. https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/LinearAlgebra.cpp

comaniac · 2022-07-29T22:51:38Z

Should this be PR closed due to #38?

anadeem2 · 2022-08-06T01:20:05Z

We can close this PR now. This was actually the implementation I used to get the full graph first. I had to add autograd piece which is in my debug_branch. However, we now have a better implementation so its no longer needed.

Matmul M5 lowering

4251fe8

anadeem2 requested a review from zachzzc July 7, 2022 18:34

zachzzc suggested changes Jul 7, 2022

View reviewed changes

reorganize CPU fallback

437444b

anadeem2 closed this Aug 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OP] Matmul #24

[OP] Matmul #24

Uh oh!

anadeem2 commented Jul 7, 2022 •

edited

Loading

Uh oh!

Uh oh!

zachzzc Jul 7, 2022

Uh oh!

anadeem2 Jul 11, 2022

Uh oh!

zachzzc commented Jul 7, 2022

Uh oh!

anadeem2 commented Jul 11, 2022

Uh oh!

comaniac commented Jul 29, 2022

Uh oh!

anadeem2 commented Aug 6, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		verify_step(Model(), [x])


		@pytest.mark.parametrize("shape", [(3, 3, 3)])

[OP] Matmul #24

[OP] Matmul #24

Uh oh!

Conversation

anadeem2 commented Jul 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

Uh oh!

zachzzc Jul 7, 2022

Choose a reason for hiding this comment

Uh oh!

anadeem2 Jul 11, 2022

Choose a reason for hiding this comment

Uh oh!

zachzzc commented Jul 7, 2022

Uh oh!

anadeem2 commented Jul 11, 2022

Uh oh!

comaniac commented Jul 29, 2022

Uh oh!

anadeem2 commented Aug 6, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

anadeem2 commented Jul 7, 2022 •

edited

Loading