Hi, thanks for the great work!
Regarding the SQuAD knowledge-incorporation (single-passage) experiments: I checked the provided data and found that the two ReSTEM rounds use different batches of 50 passages.
To fully reproduce the results, could you clarify: How were the 50 passages for Round 1 and Round 2 selected from SQuAD-train? What the exact random seeds and start index were used?
Thanks!