Skip to content

bcornish1797/real-time-voice-translator

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LinguaSync: Real-Time Voice Translator

language

Real-Time Voice Translator is a machine learning project that aims to provide a seamless and natural experience of cross-lingual communication. It uses deep neural networks to translate voice from one language to another in real time while preserving the tone and emotion of the speaker. It is a desktop application that supports Windows, Linux, and Mac operating systems.

The application is easy to use: simply select the languages you want to translate between and start speaking. The application will listen to your voice and provide instant translations in real-time. You can also use the application to translate conversations between two or more people.

Dependencies

<=Python3.11, gTTS, pyaudio, playsound==1.2.2, deep-translator, SpeechRecognition, google-transliteration-api, cx-Freeze

Getting started

  1. Clone this project and create virtualenv (recommended) and activate virtualenv.

    # Create virtualenv
    python -m venv env
    
    # Linux/MacOS
    source env/bin/activate
    
    # Windows
    env\Scripts\activate
    
  2. Install require dependencies.

    pip install --upgrade wheel
    
    pip install -r requirements.txt
    
  3. Run code and speech (have fun).

    python main.py
    

Program Flow:

Block Diagram of Voice Translator

Install Windows/Linux/Mac Application DOWNLOAD

I am using cx_Freeze to build executable file of this app. The build settings can be changed by modifying the setup.py file.

Build installer containing all the files:
  • Windows: python setup.py bdist_msi
  • Linux: python setup.py bdist_rpm
  • Mac: python setup.py bdist_mac

GUI

App GUI

About

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Tcl 90.6%
  • HTML 6.4%
  • PLSQL 1.3%
  • Roff 1.2%
  • Python 0.4%
  • DTrace 0.1%