Dedicated to the course project: Automatic Speech Recognition Using Deep Reinforcement Learning, for 11-785: Introduction to Deep Learning at Carnegie Mellon University, Fall 2022.
Run the following commands to install all packages in the given order
pip install -r requirements.txt
git clone --recursive https://github.com/parlance/ctcdecode.git
pip install wget
cd ctcdecode
pip install .
cd ..
To train baseline model:
python baseline_training.py
To fine tune policy gradient model:
python rl_training.py
- baseline_modules.py - Definition of Baseline model
- baseline_training.py - train and evaluate function and calls
- dataloading.py - Datasets and Dataloader definitions
- model_arch.py - Architecture of the baseline
- multinomial_decoder.py - Multinomial Decoder definitions
- rl_loss.py - Reinforcement Learning loss function
- rl_training.py - train and evaluate function and calls
- utils.py - levenshtein distance calculation code