Dear authors,
congrats for this amazing research!
I would love to experiment with nGPT, but I only have access to limited compute budget. Therefore, I am asking whether you could provide checkpoints for your baseline and nGPT models at 1B params at 8K context.
This would immensely help me in my own research.
Thanks a lot in advance.
Best regards,
Filipe Laitenberger