Snake AI with Deep Q-Learning 🐍

An advanced implementation of a Snake game AI that learns through Deep Q-Learning, featuring real-time visualization, multi-agent support, and performance optimizations.

📚 Table of Contents

🚀 Key Features
🛠️ Technical Specifications
📊 Real-time Visualization
🚀 Installation
💻 Usage
⚙️ Performance Features
🔄 Automatic Saving
🎮 Controls
📈 Training Graphs
🎯 Latest Improvements
🚀 Next Steps
📝 Requirements
🔧 Configuration
📚 Project Structure
🤝 Contributing
📄 License
🛡️ Security
📜 Code of Conduct

🚀 Key Features

🧠 Deep Q-Network with Priority Experience Replay (PER)
🎮 Real-time game visualization with informative HUD
🐍 Multi-agent support (2-6 snakes training simultaneously)
📊 Live training statistics and performance graphs
🔄 Automatic checkpointing and model saving
⚡ CUDA-accelerated training with AMP
📈 Adaptive learning parameters
🎯 Enhanced exploration/exploitation balance
🤖 Interactive agent count configuration

🛠️ Technical Specifications

Feature	Description
Framework	PyTorch with CUDA support
Visualization	Pygame, Matplotlib
Neural Network	Input Layer: 17 neurons (enhanced state)
	Hidden Layer: 512 neurons
	Output Layer: 4 actions (movement directions)
Training Parameters	Learning Rate: 0.0005
	Gamma: 0.99
	Initial Epsilon: 1.0
	Epsilon Decay: 0.997
	Memory Size: 100,000
	Batch Size: 64

📊 Real-time Visualization

Individual scores for each snake
Best score achieved
Rolling average (last 100 games)
Exploration rate (Epsilon)
Samples collected
Training performance (FPS)
Loss function evolution
Score distribution histogram
Multi-agent interaction visualization

🚀 Installation

Clone the repository

git clone https://github.com/Anroshka/snake-ai.git
cd snake-ai

Install dependencies:

pip install -r requirements.txt

For CUDA support (recommended):

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

💻 Usage

Start training:

python train_multi.py

You'll be prompted to enter the number of snakes (2-6 recommended) for training.

⚙️ Performance Features

Multi-agent learning environment
Multi-threaded processing
Automatic device selection (GPU/CPU)
Optimized rendering with FPS control
Gradient clipping for stability
Automatic Mixed Precision (AMP)
Priority Experience Replay
Efficient memory management
Advanced reward system

🔄 Automatic Saving

Best model preservation
Checkpoints every 10 episodes
Training statistics graphs
Performance metrics tracking
Individual agent models saving

🎮 Controls

ESC: Exit training
Automatic gameplay during training
Visualization every 10 episodes
Training stats display
Interactive agent count selection

📈 Training Graphs

Multi-agent learning progress
Loss function
Epsilon decay
Score distribution
Moving averages
Agent interaction patterns

🎯 Latest Improvements

Added multi-agent support (2-6 snakes)
Enhanced visualization with semi-transparent HUD
Stabilized FPS for better visualization
Minimum samples threshold for training start
Improved error handling and stability
Better memory management
Adaptive learning parameters
Interactive configuration system

🚀 Next Steps

Add competitive and cooperative training modes
Implement agent specialization
Optimize multi-agent interactions
Develop adaptive learning rate scheduling
Explore different state representations
Add agent personality traits

📝 Requirements

Python 3.10+
CUDA-capable GPU (recommended)
PyTorch 2.0+
Pygame 2.4.0
Matplotlib for visualization

🔧 Configuration

All training parameters can be adjusted in train_multi.py:

Number of agents (2-6)
Episode count
Memory size
Batch size
Learning rate
Epsilon decay
Visualization frequency
Reward system parameters

📚 Project Structure

game.py: Snake game environment with multi-agent support
model.py: DQN implementation with PER
train_multi.py: Multi-agent training loop and visualization
models/: Saved models and checkpoints

🤝 Contributing

We welcome contributions to Snake AI! Please read our Contributing Guidelines for details on how to submit pull requests, report issues, and contribute to the project.

Before contributing, please read our Code of Conduct.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🛡️ Security

For details about our security policy and how to report security vulnerabilities, please see our Security Policy.

📜 Code of Conduct

This project and everyone participating in it is governed by our Code of Conduct. By participating, you are expected to uphold this code.

🙏 Acknowledgments

Thanks to all contributors who have helped shape Snake AI
Special thanks to the PyTorch and Pygame communities
Inspired by various reinforcement learning implementations

📞 Contact

For questions and support, please:

Check existing Issues
Create a new issue if needed
Follow our Security Policy for reporting vulnerabilities

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github		.github
.deepsource.toml		.deepsource.toml
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
game.py		game.py
model.py		model.py
requirements.txt		requirements.txt
train_multi.py		train_multi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Snake AI with Deep Q-Learning 🐍

📚 Table of Contents

🚀 Key Features

🛠️ Technical Specifications

📊 Real-time Visualization

🚀 Installation

💻 Usage

⚙️ Performance Features

🔄 Automatic Saving

🎮 Controls

📈 Training Graphs

🎯 Latest Improvements

🚀 Next Steps

📝 Requirements

🔧 Configuration

📚 Project Structure

🤝 Contributing

📄 License

🛡️ Security

📜 Code of Conduct

🙏 Acknowledgments

📞 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Snake AI with Deep Q-Learning 🐍

📚 Table of Contents

🚀 Key Features

🛠️ Technical Specifications

📊 Real-time Visualization

🚀 Installation

💻 Usage

⚙️ Performance Features

🔄 Automatic Saving

🎮 Controls

📈 Training Graphs

🎯 Latest Improvements

🚀 Next Steps

📝 Requirements

🔧 Configuration

📚 Project Structure

🤝 Contributing

📄 License

🛡️ Security

📜 Code of Conduct

🙏 Acknowledgments

📞 Contact

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages