Checkpoints for best 1B models at 8K context

Dear authors,

congrats for this amazing research! 

I would love to experiment with nGPT, but I only have access to limited compute budget. Therefore, I am asking whether you could provide checkpoints for your baseline and nGPT models at 1B params at 8K context.

This would immensely help me in my own research.

Thanks a lot in advance.

Best regards,
Filipe Laitenberger