Text-to-Speech (TTS) with PyTorch

About • Installation • How To Use • Credits • License

About

This repository contains an implemetation of HiFi-GAN with PyTorch.

Installation

Follow these steps to install the project:

(Optional) Create and activate new environment using conda or venv (+pyenv).

a. conda version:

# create env
conda create -n project_env python=PYTHON_VERSION

# activate env
conda activate project_env

b. venv (+pyenv) version:

# create env
~/.pyenv/versions/PYTHON_VERSION/bin/python3 -m venv project_env

# alternatively, using default python version
python3 -m venv project_env

# activate env
source project_env

Install all required packages
```
pip install -r requirements.txt
```
Install pre-commit:
```
pre-commit install
```

How To Use

To train a model, run the following command:

python3 train.py -cn=hifigan

How To Download

To download pretrained model, use:

gdown https://drive.google.com/uc?id=1n9DVZznWy49nKiSljAbqdQvNiZ_VcPOa

How To Evaluate

To synthesize an audio from audio, your dataset should follow this structure:

NameOfTheDirectoryWithUtterances
└── transcriptions
    ├── UtteranceID1.wav
    ├── UtteranceID2.wav
    .
    .
    .
    └── UtteranceIDn.wav

To get predictions, run

python3 synthesize.py -cn=from_audio '+datasets.test.audio_dir=<PATH-TO-DIR>' 'inferencer.from_pretrained=<PATH-TO-PRETRAINED-MODEL>'

To synthesize an audio from text, your dataset should follow this structure:

NameOfTheDirectoryWithUtterances
└── transcriptions
    ├── UtteranceID1.txt
    ├── UtteranceID2.txt
    .
    .
    .
    └── UtteranceIDn.txt

To get predictions, run

python3 synthesize.py -cn=from_text '+datasets.test.data_dir=<PATH-TO-DIR>' 'inferencer.from_pretrained=<PATH-TO-PRETRAINED-MODEL>'

If you want to pass text from cli, run:

python3 synthesize.py -cn=from_cli '+datasets.test.index=[{text: "<YOUR-TEXT>", path: "text.txt", audio_len: 0}]' 'inferencer.from_pretrained=<PATH-TO-PRETRAINED-MODEL>'

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
src		src
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
synthesize.py		synthesize.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text-to-Speech (TTS) with PyTorch

About

Installation

How To Use

How To Download

How To Evaluate

Report

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Text-to-Speech (TTS) with PyTorch

About

Installation

How To Use

How To Download

How To Evaluate

Report

Credits

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages