GitHub - 0xDevansh/lstm: random experiments with lstm and text

Motivation

In this project I want to try out different recurrent architectures, including RNN, LSTM (with and without peephold connections), GRU and if time and skill permits, attention.

I'll try creating a character-level language model and compare their performance. For fun I'll use Agatha Christie novels in the public domain as my dataset

I'll try to implement these from scratch but using pytorch's tensors (for autograd).

What I did

The models I trained so far:

Hastings

This was a character level LSTM, trained on The Mysterious Affair at Styles. It was trained with a hidden dimension of 512 and embedding dimension 64. The training function is trian_character_model in train.py. The model is defined in TextModel.py.

Japp

This was another LSTM, this time using GPT's tiktoken library to tokenize the text. The training was very unstable, which I believe is due to having different training sets for the tokenizer and the model

Poirot

This is what I'm currently wokring on. I'm training a GRU model and also my own BPE tokenizer. Why this should be better than tiktoken?

There are a lot of useless code-related tokens I don't need
50k tokens inflate the model size, make training slower and the model sizes bigger. We don't even use most of those tokens.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
runs		runs
texts/christie		texts/christie
.gitignore		.gitignore
Poirot.py		Poirot.py
README.md		README.md
RNN.py		RNN.py
TextModel.py		TextModel.py
TokenModel.py		TokenModel.py
bpe.py		bpe.py
bpe_merges.pkl		bpe_merges.pkl
encoder.py		encoder.py
train.py		train.py
train_utils.py		train_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Motivation

What I did

Hastings

Japp

Poirot

About

Uh oh!

Releases

Packages

Languages

0xDevansh/lstm

Folders and files

Latest commit

History

Repository files navigation

Motivation

What I did

Hastings

Japp

Poirot

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages