-
Notifications
You must be signed in to change notification settings - Fork 14
Open
Description
Dear @XiangLi1999 and @ari-holtzman,
if I understand correctly the paper, in section 3.4, mentions that the amateur (student) model is conditioned on a context window which starts from the last token of the prompt. I cannot find any trace of such a choice in the code, for instance here and here the whole input is passed to the amateur model, as seen by the expert too.
I cannot find the relative study in the ablation script either.
Am I missing some argument/logic that sets the amateur's context window somewhere else in the code?
Best,
Marco
Metadata
Metadata
Assignees
Labels
No labels