RLalgcomparison

Comparison of different reinforcement learning algorithms on different game environments.

Simulation

To run an algorithm on a particular environment, simply run python .\test_algorithm.py --task "Twenty48_instert_version" e.g. to run the dqn algorithm for the Twenty48stoch-v0 environment run python .\test_dqn.py --task "Twenty48stoch-v0".

Versions

Twenty48stoch-v0

The standard implementation for a game of 2048 (stochastic environment).

Twenty48determ-v0

The same tile movement mechanics as v0 but all randomness removed (deterministic environment).

Tensorboard Log

To access analytics relating to the training efficiency of an algorithm on a particular version run tensorboard --logdir .\log\Twenty48_instert_version\insert_algorithm e.g. to access the analysis for the dqn algorithm for the Twenty48stoch-v0 environment run tensorboard --logdir .\log\Twenty48stoch-v0\dqn.

Reward Function

The reward is calculated as per the standard implementation of 2048. After a move, the resulting tile values from the newly merged tiles are summed and that becomes the reward for that step. Additionally, any move that does not change the state is rewarded with a negative reward (-0.1).

Manual Testing

A file called manual_instance.py is provided to easily test the functionality of the game environment.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
__pycache__		__pycache__
dist		dist
log		log
twenty48determ.egg-info		twenty48determ.egg-info
twenty48stoch.egg-info		twenty48stoch.egg-info
README.md		README.md
cartpole.py		cartpole.py
manual_instance.py		manual_instance.py
setup.py		setup.py
stoch_unit_tests.py		stoch_unit_tests.py
test_dqn.py		test_dqn.py
test_drqn.py		test_drqn.py
test_pg.py		test_pg.py
test_ppo.py		test_ppo.py
test_rainbow.py		test_rainbow.py
test_sac.py		test_sac.py
twenty48determ.py		twenty48determ.py
twenty48stoch.py		twenty48stoch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RLalgcomparison

Simulation

Versions

Twenty48stoch-v0

Twenty48determ-v0

Tensorboard Log

Reward Function

Manual Testing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

riley-ball/RLalgcomparison

Folders and files

Latest commit

History

Repository files navigation

RLalgcomparison

Simulation

Versions

Twenty48stoch-v0

Twenty48determ-v0

Tensorboard Log

Reward Function

Manual Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages