Skip to content
/ FastTTS Public

🎀 Advanced Chinese Text-to-Speech with AI Vocabulary Learning - Karaoke-style word highlighting, multiple TTS engines, Montreal Forced Alignment, and AI-powered vocabulary management for Chinese language learners

Notifications You must be signed in to change notification settings

CGAlei/FastTTS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

12 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

FastTTS - Advanced Text-to-Speech Application

A powerful Text-to-Speech application with word-level synchronization, vocabulary management, and multiple TTS engine support.

Features

  • Multiple TTS Engines: Support for Edge TTS and Minimax TTS
  • Word-Level Synchronization: Precise timing using Montreal Forced Alignment (MFA)
  • Vocabulary Management: Track and rate word difficulty
  • Session Management: Save and replay TTS sessions
  • Web Interface: Modern, responsive UI with karaoke-style playback
  • AI Integration: LLM support for enhanced functionality

Installation

Prerequisites

  • Python 3.10+
  • Conda/Miniconda
  • Git

Setup

  1. Clone the repository:
git clone https://github.com/YOUR_USERNAME/FastTTS.git
cd FastTTS
  1. Create conda environment:
conda env create -f environment.yml
conda activate fasttts-mfa
  1. Install Python dependencies:
pip install -r requirements.txt
  1. Configure environment variables:
cp .env.example .env
# Edit .env with your API keys

Usage

Basic Usage

# Start the application
python main.py

# Or use the launcher scripts
./launch-fasttts.sh       # Linux/Mac
launch-fasttts.bat        # Windows

With MFA Support

# Start with Montreal Forced Alignment
./start_fasttts_mfa.sh    # Linux/Mac
start_fasttts_mfa.bat     # Windows

Configuration

  • API Keys: Configure in .env file
  • TTS Engines: Modify config/defaults.py
  • Database: SQLite database in db/ directory

Testing

# Run tests
python -m pytest test_AI/

# Run rating tests
python run_rating_tests.py

Project Structure

FastTTS/
β”œβ”€β”€ main.py                 # Application entry point
β”œβ”€β”€ components/             # UI components
β”œβ”€β”€ tts/                   # TTS engine implementations
β”œβ”€β”€ alignment/             # MFA alignment tools
β”œβ”€β”€ routes/                # Web routes
β”œβ”€β”€ static/                # CSS/JS assets
β”œβ”€β”€ utils/                 # Utility functions
β”œβ”€β”€ config/                # Configuration files
└── test_AI/               # Test suite

License

This project is licensed under the MIT License.

Contributing

  1. Fork the repository
  2. Create a feature branch
  3. Make your changes
  4. Submit a pull request

Support

For issues and questions, please open an issue on GitHub.

About

🎀 Advanced Chinese Text-to-Speech with AI Vocabulary Learning - Karaoke-style word highlighting, multiple TTS engines, Montreal Forced Alignment, and AI-powered vocabulary management for Chinese language learners

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •