About • Installation • How To Use • Credits • License
This repository contains a template for solving ASR task with PyTorch. This template branch is a part of the HSE DLA course ASR homework. Some parts of the code are missing (or do not follow the most optimal design choices...) and students are required to fill these parts themselves (as well as writing their own models, etc.).
See the task assignment here.
Follow these steps to install the project:
-
(Optional) Create and activate new environment using
condaorvenv(+pyenv).a.
condaversion:# create env conda create -n project_env python=PYTHON_VERSION # activate env conda activate project_env
b.
venv(+pyenv) version:# create env ~/.pyenv/versions/PYTHON_VERSION/bin/python3 -m venv project_env # alternatively, using default python version python3 -m venv project_env # activate env source project_env
-
Install all required packages
pip install -r requirements.txt
-
Install
pre-commit:pre-commit install
To train a model, download the files from here and locate them to the working repository. Then, run the following commands:
- Train 30 epochs on train-clean-100.
python3 train.py -cn=deepspeech2- Train 20 epochs with augmentations on train-clean-360.
python3 train.py -cn=deepspeech2_augs- Train 20 epochs on train-other-500.
python3 train.py -cn=deepspeech2_other- Train 20 more epochs on train-other-500.
python3 train.py -cn=deepspeech2_other_round2To run inference (evaluate the model or save predictions), download the files from here and locate them to the working repository. Then, run the following commands:
- To run inference on clean set:
python3 inference.py -cn=inference_clean- To run inference on other set:
python3 inference.py -cn=inference_otherThis repository is based on a PyTorch Project Template.