Skip to content

model specifications & related logics #4

@shivendrra

Description

@shivendrra

to be added:

  • BERT language model
  • decoder-only model with RMSNorm, RoPE, SwiGLU
  • config file for different model size configs
  • run file
  • quantization logic
  • fine-tuning script for model

to be done:

  • test run once with smaller data
  • hyperparameter tuning

Metadata

Metadata

Assignees

Labels

achivergoals to be achieved

Type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions