Skip to content

vbuylova/DeepSpeech2

Repository files navigation

Automatic Speech Recognition (ASR) with PyTorch

AboutInstallationHow To UseCreditsLicense

About

This repository contains a template for solving ASR task with PyTorch. This template branch is a part of the HSE DLA course ASR homework. Some parts of the code are missing (or do not follow the most optimal design choices...) and students are required to fill these parts themselves (as well as writing their own models, etc.).

See the task assignment here.

Installation

Follow these steps to install the project:

  1. (Optional) Create and activate new environment using conda or venv (+pyenv).

    a. conda version:

    # create env
    conda create -n project_env python=PYTHON_VERSION
    
    # activate env
    conda activate project_env

    b. venv (+pyenv) version:

    # create env
    ~/.pyenv/versions/PYTHON_VERSION/bin/python3 -m venv project_env
    
    # alternatively, using default python version
    python3 -m venv project_env
    
    # activate env
    source project_env
  2. Install all required packages

    pip install -r requirements.txt
  3. Install pre-commit:

    pre-commit install

How To Use

To train a model, download the files from here and locate them to the working repository. Then, run the following commands:

  1. Train 30 epochs on train-clean-100.
python3 train.py -cn=deepspeech2
  1. Train 20 epochs with augmentations on train-clean-360.
python3 train.py -cn=deepspeech2_augs
  1. Train 20 epochs on train-other-500.
python3 train.py -cn=deepspeech2_other
  1. Train 20 more epochs on train-other-500.
python3 train.py -cn=deepspeech2_other_round2

To run inference (evaluate the model or save predictions), download the files from here and locate them to the working repository. Then, run the following commands:

  1. To run inference on clean set:
python3 inference.py -cn=inference_clean
  1. To run inference on other set:
python3 inference.py -cn=inference_other

Credits

This repository is based on a PyTorch Project Template.

License

License

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages