Skip to content

Prometheus-AI-Project/2024_2_Read_My_Lips

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

8 Commits
Β 
Β 
Β 
Β 

Repository files navigation

πŸ’‹ Read My Lips

Acknowledgement

Thanks to the KAIST-AILab for providing resources. Models are trained on the Vox, LRS2 and LRS3 dataset.

@inproceedings{ahn2024syncvsr,
  author={Young Jin Ahn and Jungwoo Park and Sangha Park and Jonghyun Choi and Kee-Eung Kim},
  title={{SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization}},
  booktitle={Proc. Interspeech 2024},
  doi={10.21437/Interspeech.2024-432}
}

About

Audio-Visual Speech Recognition Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages