Lab materials for the Reinforcement Learning class
git clone https://github.com/inmo-jang/rl_labs.git
cd rl_labs
pip install -r requirements.txt
Deterministic Mode
python gyms/simple_maze_grid.py
- Keyboard Control
qorESC: quitr: reset
For Stochastic Mode
python gyms/simple_maze_grid.py --stochastic=True --epsilon=0.2
This means that an action will be randomly chosen 20% of the time.
env = SimpleMazeGrid(n=5, k=3, m=2, render_option=True, random_seed=42)
# n = Length of one side of the grid
# k = Starting index from which the goal can be generated, extending to the end of the grid
# m = Number of obstacles
- Actions:
Discrete(2) - Observation Space:
Discrete(n * n) - Reward:
10when arriving the goal-10when arriving a pit-1otherwise
