Bouncing Balls

Simple 2D game created as a research challenge for Reinforcement Learning algorithms. Here is a blog post explaining the modelling and training processes in finer details.

Game Overview

Rules are pretty simple: Moving the hero ball around the 2D plane, hit all the Green Balls while avoiding the Red ones. The balls' initial coordinates and velocity vector are randomly generated at the start of each game. The balls bounce on each of the 4 sides of the environment.

The game terminates if:

All Green balls have been hit
One of the Red balls has been hit
1000 time steps have elapsed

Modeling and Training

Despite its simplicity, the game's environment has a highly dimensional observation space and a pretty sparse reward structure. Solving it efficiently required training a Proximal Policy Optimisation model preliminary calibratred using Behavioural Cloning of a heuristic based expert agent.

Dependencies

The PPO model comes from the excellent StableBaselines3 Python library and as it is required in that case, the game environment implements the Gymnasium API standard.

Scripts

The whole training process can be reconstructed on your machine:

Run bc_training.py to record the simulations of the expert agent actions on 128 game simulations and, run the Behavioural Cloning algorithm using a balanced loss function.
Run train.py to finalise the training of the PPO model on multiple online simulations and save versions of the trained model.
Run compare_performance.py to compare your RL model and the expert agent. If the RL model does not outperform the agent, go back to step 2.
Run simulate.py to see and record your trained agent in action.

Result

The animated GIF at the top of the page shows simulations of a trained Deep RL model with the following performance statistics:

DRL model: 1.975 ± 1.076

Expert Agent: 1.462 ± 1.304

Generally the higher the number of training epochs, the more efficient the Deep RL agent becomes.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
animation.gif		animation.gif
ball.py		ball.py
bc_training.py		bc_training.py
compare_performance.py		compare_performance.py
custom_features_extractor.py		custom_features_extractor.py
expert_agent.py		expert_agent.py
game_environment.py		game_environment.py
player_ball.py		player_ball.py
simulate.py		simulate.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Bouncing Balls

Game Overview

Modeling and Training

Dependencies

Scripts

Result

About

Uh oh!

Releases

Packages

Languages

vegapit/bouncingballs

Folders and files

Latest commit

History

Repository files navigation

Bouncing Balls

Game Overview

Modeling and Training

Dependencies

Scripts

Result

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages