-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
- Finding input prompt length in RL_SCHEMA after tokenizer fix
- Generate attention mask for
generate_k_completionsso that the later functions can use the mask. - Fix the famous indexing issue:
student_output_scores, collected_exit_logits = student(completions['tokens'][:, :-1],
prescribed_exit_layer_idxs = prescribed_exit_layers[:,1:])
Metadata
Metadata
Assignees
Labels
No labels