Reinforcement Learning Project at the University of Trieste, 2023-2024

Author: Luis Fernando Palacios Flores (MAT. SM3800038)

Master's degree: Data Science and Artificial Intelligence (SM38)

Short Description

This project implements the Soft Actor-Critic (SAC) and Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithms to tackle continuous actions on some Gymnasium environments.

Results

Here are some of the results.

Bipedal Walker (TD3):

Humanoid (SAC):

Project Structure

├── env.yaml
├── SAC
│   ├── agent.py
│   └── configs
├── scripts
│   ├── environments_overview.ipynb
│   ├── main.py
│   └── results.ipynb
├── src
│   ├── environment.py
│   ├── networks.py
│   └── utils.py
└── TD3
    ├── agent.py
    └── configs

The env.yaml file allows to create a new conda environment with the same packages utilized in this project, paramount to execute this code. To create the environment execute the command:

conda env create -f env.yaml

The SAC and TD3 directories contain the implementation of the algorithms (agent.py) and the parameters and hyperparameters utilized are setupped in the config subdirectory for each environment.
The main.py contains the code to run the SAC and TD3 algorithms.

How to execute the code?

The code can be executed from the parent directory or the scripts directory. To execute from the parent directory the following command can be used:

python scripts/main.py -env <ENVIRONMENT_NAME> -alg <ALGORITHM>

The options for <ENVIRONMENT_NAME> are:

pendulum
walker
halfcheetah
humanoid
ant
walker2d
car (doesn't work well)

The options for <ALGORITHM>, of course, are:

sac
td3

In the scripts subdirectory the execution is as follows:

python main.py -env <ENVIRONMENT_NAME> -alg <ALGORITHM>

Three subdirectories will be created after executing this command:

checkpoints to save the models and training data
logs to keep track of the hyperparameters utilized
results to store the final results

Inside these directories, specific subdirectories are created for each environment.

Link to the slides

https://docs.google.com/presentation/d/16EGlFeVgT5UstF_6QOyH6OSu9X48F7HwPjW9v2mAvzM/edit?usp=sharing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning Project at the University of Trieste, 2023-2024

Author: Luis Fernando Palacios Flores (MAT. SM3800038)

Master's degree: Data Science and Artificial Intelligence (SM38)

Short Description

Results

Project Structure

How to execute the code?

Link to the slides

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
SAC		SAC
TD3		TD3
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
env.yaml		env.yaml

luispky/ReinforcementLearning-UniTS

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Project at the University of Trieste, 2023-2024

Author: Luis Fernando Palacios Flores (MAT. SM3800038)

Master's degree: Data Science and Artificial Intelligence (SM38)

Short Description

Results

Project Structure

How to execute the code?

Link to the slides

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages