Chess Elo prediction

This repository contains a PyTorch transfomer-based approach for predicting chess Elo based on only the sequence of algebraic moves. The default training data are from Lichess.

Training

Clone the repo and navigate to it
Obtain a list of zstandard-compressed PGN files to train on and save under the name game_list.txt. The default is a list of all games from January 2013 through September 2023 from Lichess.
Change the settings in settings.py.
    a. FILE_DIR - where the data is stored. This location should have enough storage for all the decompressed files (at least several terabytes).
    b. EMBEDDING_DIM - the number of dimensions used for the move embedding (default=72)
    c. SEQUENCE_LENGTH - the maximum number of moves permitted in a game (default=150)
Download and process the data using download_data.py. The parameters passed to parse_file can be modified to filter the data.
a. By default the data is downloaded and parsed using five processes, before being combined and split into training, validation, and testing sets.
Run either create_indices.py to generate an index mapping for every unique move (this will cause training to use a trainable Embedding layer), or run create_embeddings.py to use Word2Vec to pretrain embeddings.
a. Pretrained embeddings from Word2Vec are embeddings.model and move_vecs.wordvectors; note that these embeddings are of length 100.
Run the training using either train.py or train_ddp.py. This defaults to using trainable embeddings, and the setup may require tuning for the available memory. The current settings are for a setup with 4 RTX 2080 Tis with 11 GB of memory each.
    a. train.py will use PyTorch's DataParallel to run on multiple GPUs if applicable, otherwise it will target the GPU or CPU depending on availability.
    b. train_ddp.py uses PyTorch's DistributedDataParallel to run on multiple GPUs. However, this was slower than the simpler DataParallel approach.
    c. The default model is modelAvg - see the How it works section for more details
Training progress can be checked using check_training.py which will generate basic plots to check the current training status
After training, run generate_val_results.py and then get_percentiles.py to run validation scripts.
If desired, run convert_to_onnx.py to create ONNX version of the models.

Inference

Work in progress

Performance

Work in progress

How it works

The data

Downloading and parsing the data

The raw data should be as PGN (Portable Game Notation) files. These are looped through line by line, and the game year/month, time (both base and bonus), black and white Elos, and move sequences parsed out. The result of the game and whether it ended in a checkmate are also parsed out, although ideally the model shouldn't need this information.
a. The data will be parsed with 5 processes (more can cause the Lichess server to refuse to respond). Each process will write to a different text file; every game will be on a separate line. The downloaded and decompressed files both use temporary files.
b. After all the data is downloaded, the first file will be split in two for the validation and testing data, and the other 4 will be combined into training. The original files will be deleted afterwards.

Exploring the data

How stationary the Elo target is can be checked in check_year_scores.py - this loops through two years (default 2015 and 2022) and grabs all games with identical move sequences and compares the Elos.
create_indices.py generates a JSON mapping of every unique move to an integer index; create_onehot.py does the same thing except each value is now a list with zeros everywhere except a 1 at the index.
create_embeddings.py uses gensim's Word2Vec to create pretrained embeddings of each move using a window size of 3 and an embeddings size of 72. This isn't the approach used in the actual models, so is just there as legacy code.

Models

There are several main model options in train.py (train_ddp.py is an older version and has not been updated with all options):

Main models

The main model approaches follow the diagram above.

modelAvg

This model first passes the sequence of moves through an Embedding layer and then a TransformerEncoder. It then takes an mean down the embedding dimension (so the result is the same length as the sequence) and passes the result through linear layers until the black and white Elo are predicted.

modelTime

This model is the same as modelAvg, except that after the mean it concatenates the base and bonus time values prior to the linear layers.

modelResult

This model is the same as modelAvg, except that after the mean it concatenates the result prior to the linear layers.

modelTimeResult

This model is the same as modelAvg, except that after the mean it concatenates the base and bonus times and the result prior to the linear layers.

Other models

A few other models were also experimented with.

modelLinear

This model first passes the sequence of moves through an Embedding layer. It then flattens the result and passes that through linear layers until the black and white Elo are predicted.

This approach is faster to train and achieves similar results to the transformer-based approachs, but results in orders-of-magnitude more weights.

modelFlatten

This model first passes the sequence of moves through an Embedding layer and then a TransformerEncoder. The results are then flattened and passed through linear layers until the black and white Elo are predicted.

This approach is far slower and larger than it needs to be (the worst of both the transformer and linear worlds).

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
elo_prediction_browser		elo_prediction_browser
.gitignore		.gitignore
README.md		README.md
avg_percentiles_black.json		avg_percentiles_black.json
avg_percentiles_white.json		avg_percentiles_white.json
avg_time_percentiles_black.json		avg_time_percentiles_black.json
avg_time_percentiles_white.json		avg_time_percentiles_white.json
check_training.py		check_training.py
check_year_scores.py		check_year_scores.py
chess_elo_diagram.png		chess_elo_diagram.png
chess_elo_diagram.svg		chess_elo_diagram.svg
convert_to_onnx.py		convert_to_onnx.py
create_embeddings.py		create_embeddings.py
create_indices.py		create_indices.py
create_onehot.py		create_onehot.py
download_data.py		download_data.py
embeddings.model		embeddings.model
game_list.txt		game_list.txt
generate_val_results.py		generate_val_results.py
get_percentiles.py		get_percentiles.py
indices.json		indices.json
lengths.json		lengths.json
model_avg.onnx		model_avg.onnx
model_avg.pth		model_avg.pth
model_avg_time.onnx		model_avg_time.onnx
model_avg_time.pth		model_avg_time.pth
move_vecs.wordvectors		move_vecs.wordvectors
settings.py		settings.py
train.py		train.py
train_ddp.py		train_ddp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chess Elo prediction

Training

Inference

Performance

How it works

The data

Downloading and parsing the data

Exploring the data

Models

Main models

modelAvg

modelTime

modelResult

modelTimeResult

Other models

modelLinear

modelFlatten

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

williamdwatson/chess_elo_prediction

Folders and files

Latest commit

History

Repository files navigation

Chess Elo prediction

Training

Inference

Performance

How it works

The data

Downloading and parsing the data

Exploring the data

Models

Main models

modelAvg

modelTime

modelResult

modelTimeResult

Other models

modelLinear

modelFlatten

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages