Skip to content

sanathbhat/SaperaRL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A framework and some AI agents to master the classic snake game.

The following type of agents are available in this repo:

  1. Human Agent (use arrow keys to control snake)
  2. Deep Q-network (DQN) (stable baselines 3) - Deep Learning based agent that optimizes the Q values of states to generate an optimal policy for the agent.
  3. Proximal Policy Optimization (PPO) (stable baselines 3) - Deep Learning based agent that directly optimizes the policy using the PPO algorithm.
  4. Recurrent PPO (stable baselines 3 contrib) - Adds LSTM layers to generate recurrent policies using PPO that can 'plan ahead'

About

Building a Reinforcement Learning agent for the popular Snake game

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages