FastTTS - Advanced Text-to-Speech Application

A powerful Text-to-Speech application with word-level synchronization, vocabulary management, and multiple TTS engine support.

Features

Multiple TTS Engines: Support for Edge TTS and Minimax TTS
Word-Level Synchronization: Precise timing using Montreal Forced Alignment (MFA)
Vocabulary Management: Track and rate word difficulty
Session Management: Save and replay TTS sessions
Web Interface: Modern, responsive UI with karaoke-style playback
AI Integration: LLM support for enhanced functionality

Installation

Prerequisites

Python 3.10+
Conda/Miniconda
Git

Setup

Clone the repository:

git clone https://github.com/YOUR_USERNAME/FastTTS.git
cd FastTTS

Create conda environment:

conda env create -f environment.yml
conda activate fasttts-mfa

Install Python dependencies:

pip install -r requirements.txt

Configure environment variables:

cp .env.example .env
# Edit .env with your API keys

Usage

Basic Usage

# Start the application
python main.py

# Or use the launcher scripts
./launch-fasttts.sh       # Linux/Mac
launch-fasttts.bat        # Windows

With MFA Support

# Start with Montreal Forced Alignment
./start_fasttts_mfa.sh    # Linux/Mac
start_fasttts_mfa.bat     # Windows

Configuration

API Keys: Configure in .env file
TTS Engines: Modify config/defaults.py
Database: SQLite database in db/ directory

Testing

# Run tests
python -m pytest test_AI/

# Run rating tests
python run_rating_tests.py

Project Structure

FastTTS/
├── main.py                 # Application entry point
├── components/             # UI components
├── tts/                   # TTS engine implementations
├── alignment/             # MFA alignment tools
├── routes/                # Web routes
├── static/                # CSS/JS assets
├── utils/                 # Utility functions
├── config/                # Configuration files
└── test_AI/               # Test suite

License

This project is licensed under the MIT License.

Contributing

Fork the repository
Create a feature branch
Make your changes
Submit a pull request

Support

For issues and questions, please open an issue on GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
alignment		alignment
components		components
config		config
llm		llm
routes		routes
static		static
tts		tts
utils		utils
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.sesskey		.sesskey
README.md		README.md
TS_CLI_README.md		TS_CLI_README.md
app_context.py		app_context.py
convert_existing_sessions.py		convert_existing_sessions.py
debug_logger.py		debug_logger.py
environment.yml		environment.yml
fasttts		fasttts
fts		fts
ftts-session		ftts-session
ftts_session_cli.py		ftts_session_cli.py
general_setup.md		general_setup.md
launch-fasttts.sh		launch-fasttts.sh
llm_manager.py		llm_manager.py
main.py		main.py
manual_folder_sync.py		manual_folder_sync.py
progress_manager.py		progress_manager.py
requirements.txt		requirements.txt
start_fasttts_mfa.sh		start_fasttts_mfa.sh
text_processor.py		text_processor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FastTTS - Advanced Text-to-Speech Application

Features

Installation

Prerequisites

Setup

Usage

Basic Usage

With MFA Support

Configuration

Testing

Project Structure

License

Contributing

Support

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

CGAlei/FastTTS

Folders and files

Latest commit

History

Repository files navigation

FastTTS - Advanced Text-to-Speech Application

Features

Installation

Prerequisites

Setup

Usage

Basic Usage

With MFA Support

Configuration

Testing

Project Structure

License

Contributing

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages