To use the launcher, run in the cloned repository:
python -m venv .env
source .env/bin/activate
pip install -r requirements.txtTo use the launcher, just run main.py. All parameters are stored in config.json.
Adding a new model architecture is essentially replacing a Policy. Currently config.json specifies a CnnPolicy
which comes bundled with stable-baselines. See stable_baselines/common/policies.py for examples of how to define
custom policies.
We also include a copy of the code for training algorithms here so that it can be modified more easily.
The complete trained model is stored in stored under saved_models as env_name-model_name-policy_type.pkl.
The config file and 100-step reward averages are stored under saved_metrics as env_name-model_name-policy_type.txt.
To install Jupyter, register a new kernel, and start a notebook, run in the virtual environment:
pip install jupyter
ipython kernel install --user --name=.env
jupyter notebookThen activate the .env kernel in the notebook.
To make the logging work, add and execute after importing the logging module in the notebook:
logger = logging.getLogger()
logger.setLevel(logging.DEBUG)