Intermediate Materialization Avoidance #36

thanay-sisir · 2025-12-29T17:41:00Z

⚡ Optimization Summary: `compute_synchronisation`

1. Technical Mechanism

Vectorization: Refactors the operation from a dense $O(N^2)$ outer product to a sparse, index-driven Hadamard product using torch.triu_indices.
Memory Efficiency: Eliminates the intermediate $(B, N, N)$ tensor allocation, reducing auxiliary memory complexity from $O(B \cdot N^2)$ to $O(1)$.
Device Awareness: Enforces explicit device placement for indices to prevent implicit host-to-device transfer overhead.

2. Stability & Scalability

Prevents "Quadratic Trap": Removes the bottleneck where memory usage scales quadratically with $d_{model}$, preventing OOM errors on larger model configurations.
Eliminates Compute Waste: Bypasses the calculation of the symmetric lower triangle, saving $\approx 50%$ of FLOPs and memory writes previously wasted on discarded data.

lukedarlow

This PR does follow the rules.

However, I would like you to add commentary above these altered lines that shows the old version, explaining their equivalence.

The reason for this is simply to aid readers in understanding what the code is actually doing in relation to the paper.

thanay-sisir · 2026-01-05T02:27:41Z

@lukedarlow when you are going to change that luke.......
Try to merge it as you become free ....😊
Thanks in advance !!!

lukedarlow · 2026-01-05T02:35:35Z

I already requested changes from you.

thanay-sisir · 2026-01-05T03:18:17Z

@lukedarlow ok luke how about that ....?

compute_optimisation

19e4be5

lukedarlow requested changes Dec 30, 2025

View reviewed changes

cross-platform path handling in zip_python_code

a27496f

updated with comments

83f1f5f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Intermediate Materialization Avoidance #36

Intermediate Materialization Avoidance #36

Uh oh!

thanay-sisir commented Dec 29, 2025

Uh oh!

lukedarlow left a comment

Uh oh!

thanay-sisir commented Jan 5, 2026

Uh oh!

lukedarlow commented Jan 5, 2026

Uh oh!

thanay-sisir commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Intermediate Materialization Avoidance #36

Are you sure you want to change the base?

Intermediate Materialization Avoidance #36

Uh oh!

Conversation

thanay-sisir commented Dec 29, 2025

⚡ Optimization Summary: compute_synchronisation

1. Technical Mechanism

2. Stability & Scalability

Uh oh!

lukedarlow left a comment

Choose a reason for hiding this comment

Uh oh!

thanay-sisir commented Jan 5, 2026

Uh oh!

lukedarlow commented Jan 5, 2026

Uh oh!

thanay-sisir commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

⚡ Optimization Summary: `compute_synchronisation`