Audio Authentication with wav2vec-AASIST whisper-AASIST and mfcc-AASIST
Citation:
Tak, Hemlata, et al. "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation." The Speaker and Language Recognition Workshop, 2022.
Glow-TTS
Kim et al. (2020). Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search. NeurIPS 2020.
https://arxiv.org/abs/2005.11129
Tacotron 2
Shen et al. (2018). Natural TTS synthesis by conditioning WaveNet on mel spectrogram predictions. ICASSP 2018.
https://arxiv.org/abs/1712.05884