Attention and control #4

AI-ELka · 2025-09-28T20:19:42Z

No description provided.

josephdviviano · 2025-10-01T03:33:16Z

can you resolve merge conflicts?

AI-ELka · 2025-10-01T17:35:51Z

can you resolve merge conflicts?

I noticed that Mehran's attention code was merged into main (that's where the conflict came from), so I guess I can keep this attention implementation just in case we need it. I was coded it because I wasn't able to solve a problem in the other attention implementation.

orichardson · 2025-10-01T21:15:28Z

@AI-ELka Can you help us determine whether or not that problem persists in the implementation that's currently on the main branch? I've forgotten the details of the issue.

AI-ELka · 2025-10-02T07:26:13Z

@AI-ELka Can you help us determine whether or not that problem persists in the implementation that's currently on the main branch? I've forgotten the details of the issue.

The main problem we had was that the loss remained constant (in a case where it should decrease), but after testing now with the code, this problem seems to be gone.
One thing that pops up now when testing with "uniform" (and not with "from_cpd" or "random") is an assertion error coming from:

print(f"Any unfrozen edge changed? {any_changed}")
assert any_changed, "No learnable edges changed; attention/control masks may be misapplied."

So the main problem we had seems to have been solved, but we now have this issue with the assertion error when using uniform initialization.

Attention and control

806c581

AI-ELka requested a review from orichardson September 28, 2025 20:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Attention and control #4

Attention and control #4

Uh oh!

AI-ELka commented Sep 28, 2025

Uh oh!

josephdviviano commented Oct 1, 2025

Uh oh!

AI-ELka commented Oct 1, 2025

Uh oh!

orichardson commented Oct 1, 2025

Uh oh!

AI-ELka commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Attention and control #4

Are you sure you want to change the base?

Attention and control #4

Uh oh!

Conversation

AI-ELka commented Sep 28, 2025

Uh oh!

josephdviviano commented Oct 1, 2025

Uh oh!

AI-ELka commented Oct 1, 2025

Uh oh!

orichardson commented Oct 1, 2025

Uh oh!

AI-ELka commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants