ASR with Deep Reinforcement Learning

Dedicated to the course project: Automatic Speech Recognition Using Deep Reinforcement Learning, for 11-785: Introduction to Deep Learning at Carnegie Mellon University, Fall 2022.

CTC Decode Install

Run the following commands to install all packages in the given order

pip install -r requirements.txt

git clone --recursive https://github.com/parlance/ctcdecode.git
pip install wget
cd ctcdecode
pip install .
cd ..

Instructions to run

To train baseline model:

python baseline_training.py

To fine tune policy gradient model:

python rl_training.py

Files

baseline_modules.py - Definition of Baseline model
baseline_training.py - train and evaluate function and calls
dataloading.py - Datasets and Dataloader definitions
model_arch.py - Architecture of the baseline
multinomial_decoder.py - Multinomial Decoder definitions
rl_loss.py - Reinforcement Learning loss function
rl_training.py - train and evaluate function and calls
utils.py - levenshtein distance calculation code

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASR with Deep Reinforcement Learning

CTC Decode Install

Instructions to run

Files

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
baseline_modules.py		baseline_modules.py
baseline_training.py		baseline_training.py
dataloading.py		dataloading.py
model_arch.py		model_arch.py
multinomial_decoder.py		multinomial_decoder.py
rl_loss.py		rl_loss.py
rl_training.py		rl_training.py
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

ASR with Deep Reinforcement Learning

CTC Decode Install

Instructions to run

Files

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages