Hi @karpathy, Interesting subtle snatchy behavior from autoresearch -> last iteration got a better score by changing the model Random Seed... <img width="259" height="191" alt="Image" src="https://github.com/user-attachments/assets/b0fbf168-e055-4a03-8839-4f7843074c9b" />