GitHub - sanathbhat/SaperaRL: Building a Reinforcement Learning agent for the popular Snake game

A framework and some AI agents to master the classic snake game.

The following type of agents are available in this repo:

Human Agent (use arrow keys to control snake)
Deep Q-network (DQN) (stable baselines 3) - Deep Learning based agent that optimizes the Q values of states to generate an optimal policy for the agent.
Proximal Policy Optimization (PPO) (stable baselines 3) - Deep Learning based agent that directly optimizes the policy using the PPO algorithm.
Recurrent PPO (stable baselines 3 contrib) - Adds LSTM layers to generate recurrent policies using PPO that can 'plan ahead'

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
agent		agent
config		config
env		env
model		model
ui		ui
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py

Provide feedback