Continuous Vocoder

It is a system for speech analysis and synthesis. It can estimate continuous parameters: Fundamental frequency (F0), maximum voiced frequency (MVF) and spectral envelope (MGC). It also generate the speech like input speech using only 3 estimated continuous parameters with lower dimensions.

Continuous vocoder can be used for Statistical Parametric Speech Synthesis (SPSS), Text-to-Speech (TTS), and Voice Conversion (VC).

Requirements:

Sox
SPTK
Octave
Python 3
Python packages: numpy, scipy, pysptk, ssp, struct

Install

$ git clone https://github.com/malradhi/continuous-vocoder.git

Testing / HowTo

In the root directory, simply run:

$ make

You can also have a look at the file cont_features_extraction.py and cont_speech_synthesis.py to see how the continuous vocoder can work.

Legal

http://smartlab.tmit.bme.hu/index-en

Demo

Audio samples are also available in English and Arabic (with emotions) at http://smartlab.tmit.bme.hu/vocoder_Arabic_2018

References

This system is based on the method described in the papers:

[1] Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh, Time-domain envelope modulating the noise component of excitation in a continuous residual-based vocoder for statistical parametric speech synthesis, in Proceedings of the 18th Interspeech conference, pp. 434-438, Stockholm, Sweeden, 2017. https://www.doi.org/10.21437/Interspeech.2017-678

[2] Mohammed Salah Al-Radhi, Omnia Abdo, Tamás Gábor Csapó, Sherif Abdou, Géza Németh, Mervat Fashal, A continuous vocoder for statistical parametric speech synthesis and its evaluation using an audio-visual phonetically annotated Arabic corpus, Computer Speech and Language, ScienceDirect Elsevier, 60, 2019. https://doi.org/10.1016/j.csl.2019.101025

[3] Tamás Gábor Csapó, Géza Németh, Cernak M., "Residual-Based Excitation with Continuous F0 Modeling in HMM-Based Speech Synthesis," In Proceedings of the 3rd International Conference on Statistical Language and Speech Processing (SLSP), Budapest, Hungary, vol. 9449, pp. 27-38, 2015. https://doi.org/10.1007/978-3-319-25789-1_4

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
example/wav		example/wav
ssp		ssp
01_chk_rqmts.sh		01_chk_rqmts.sh
02_analysis.sh		02_analysis.sh
03_synthesis.sh		03_synthesis.sh
Makefile		Makefile
MaximumVoicedFrequencyEstimation_nopp.m		MaximumVoicedFrequencyEstimation_nopp.m
MaximumVoicedFrequencyEstimation_nopp_run.m		MaximumVoicedFrequencyEstimation_nopp_run.m
README.md		README.md
cont_features_extraction.py		cont_features_extraction.py
cont_speech_synthesis.py		cont_speech_synthesis.py
resid_cdbk_awb_0080_pca.bin		resid_cdbk_awb_0080_pca.bin
resid_cdbk_slt_0080_pca.bin		resid_cdbk_slt_0080_pca.bin
run_continuous_vocoder.sh		run_continuous_vocoder.sh
smooth.m		smooth.m
spec_env		spec_env

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Continuous Vocoder

Requirements:

Install

Testing / HowTo

Legal

Demo

References

About

Uh oh!

Releases

Packages

Languages

malradhi/continuous-vocoder

Folders and files

Latest commit

History

Repository files navigation

Continuous Vocoder

Requirements:

Install

Testing / HowTo

Legal

Demo

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages