https://github.com/datawhalechina/tiny-universe/blob/main/content/TinyTransformer/tiny_transformer.py#L281