[Verification Mechanism] Verification Heads == Induction Heads?


## [The Role of Attention for TinyZero Countdown](https://ajyl.github.io/reasoning/2025/03/18/R1-attention.html) suggests that attention heads do verification when the solution is given in the context. Let's better understand these heads.


These seem similar to induction heads -- or more precisely, the "previous token head" that attends to previous occurrences of a token. Can we verify that the verification heads are previous token heads, in contexts outside of our task? 

- [ ] Use random sentences from Wikipedia or something, and check the attention pattern of these heads when a repeated token is given.

- [ ] Check the attention patterns of these heads when given repeated tokens in our Countdown task that **do not** correspond to the solution (ex: "=" token or other numeric tokens)

Relevant code:
https://github.com/ajyl/verify_circuit/blob/main/notebooks/explore_attention.sync.ipynb




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Verification Mechanism] Verification Heads == Induction Heads? #24

The Role of Attention for TinyZero Countdown suggests that attention heads do verification when the solution is given in the context. Let's better understand these heads.

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[Verification Mechanism] Verification Heads == Induction Heads? #24

Description

The Role of Attention for TinyZero Countdown suggests that attention heads do verification when the solution is given in the context. Let's better understand these heads.

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions