Skip to content

Checkpoints for best 1B models at 8K context #4

@Thiggel

Description

@Thiggel

Dear authors,

congrats for this amazing research!

I would love to experiment with nGPT, but I only have access to limited compute budget. Therefore, I am asking whether you could provide checkpoints for your baseline and nGPT models at 1B params at 8K context.

This would immensely help me in my own research.

Thanks a lot in advance.

Best regards,
Filipe Laitenberger

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions