AGENTS.md - TTS Adapter Project Instructions

Project-specific agent instructions for TTS Adapter codebase.

Quick Routes

Build & Run:

Install: make install
Run server: make up (Docker) or uv run tts-server (local)
Test: make test or make test batch
Health check: make health

Architecture:

Overview: docs/architecture.md
API Reference: docs/api-reference.md
Web UI (i18n, settings, LAN): docs/web-ui.md
Qwen3 Engine: docs/engines/qwen3/README.md

Check universal rules: See ~/.claude/CLAUDE.md for global standards

Critical Project Rules

GPU Required

Qwen3-TTS requires CUDA GPU. No CPU fallback available.

RTX 4070 (12GB): Use 1.7B model with bf16
Lower VRAM: Use 0.6B model

Engine Protocol

All engines must implement TTSEngine protocol:

warmup() - Load model
synthesize(text, language, speaker, instruct) - Single text
synthesize_batch(texts, language, speaker, instruct) - Multiple texts

Configuration Precedence

Constructor args (programmatic)
Environment variables (.env file)
Engine defaults (hardcoded)

Project Structure

tts-adapter/
├── tts_adapter/
│   ├── __init__.py
│   ├── api/                 # FastAPI routes
│   │   ├── __init__.py
│   │   └── routes.py
│   ├── engines/             # TTS engines
│   │   ├── __init__.py      # Engine factory
│   │   └── qwen3.py         # Qwen3-TTS implementation
│   ├── cli.py               # CLI entry point
│   ├── config.py            # Global settings
│   ├── contract.py          # Request/response models
│   └── engine.py            # TTSEngine protocol
├── scripts/
│   └── tts_batch.py         # Batch CLI tool
├── docs/                    # Documentation
├── data/                    # Local data (git-ignored)
│   └── cache/               # HuggingFace model cache
├── Dockerfile               # Multi-stage GPU build
├── compose.yml              # Docker Compose
├── Makefile                 # Build/test automation
├── pyproject.toml           # uv config
└── uv.lock                  # Locked dependencies

Tech Stack

TTS: Qwen3-TTS (qwen-tts package)
API: FastAPI, uvicorn
Config: pydantic-settings
Audio: soundfile

Development Workflow

1. Local Development

# Install
make install
source .venv/bin/activate

# Copy and edit config
cp .env.example .env

# Run server
uv run tts-server

# Test
make test

2. Docker Development

# Build and start
make build
make up

# Check logs
make logs

# Test
make health
make test

3. API Usage

# Single generation
curl -X POST http://localhost:9880/tts \
  -H 'content-type: application/json' \
  -d '{"text":"Привет","language":"Russian","speaker":"Ryan"}' \
  --output out.wav

# Batch generation
curl -X POST http://localhost:9880/tts/batch \
  -H 'content-type: application/json' \
  -d '{"items":[{"id":"001","text":"Первая"},{"id":"002","text":"Вторая"}]}' \
  --output batch.zip

Environment Variables

Variable	Default	Description
`TTS_ENGINE`	`qwen3`	Engine name
`TTS_DEFAULT_SPEAKER`	-	Default speaker
`TTS_DEFAULT_LANGUAGE`	-	Default language
`TTS_HOST`	`0.0.0.0`	Server host
`TTS_PORT`	`9880`	Server port
`TTS_QWEN3_MODEL_ID`	`Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice`	Model
`TTS_QWEN3_DEVICE`	`cuda:0`	CUDA device
`TTS_QWEN3_DTYPE`	`bfloat16`	Data type

Adding New Engines

Create tts_adapter/engines/new_engine.py
Implement TTSEngine protocol
Add engine-specific Settings class with pydantic-settings
Register in engines/__init__.py (_ENGINES dict)
Document in docs/engines/new_engine.md

Git Workflow

Branches: Feature branches from develop
PRs: Target develop branch
See: ~/.claude/CLAUDE.md for universal git rules

References

Universal rules: ~/.claude/CLAUDE.md
Architecture: docs/architecture.md
API Reference: docs/api-reference.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AGENTS.md - TTS Adapter Project Instructions

Quick Routes

Critical Project Rules

GPU Required

Engine Protocol

Configuration Precedence

Project Structure

Tech Stack

Development Workflow

1. Local Development

2. Docker Development

3. API Usage

Environment Variables

Adding New Engines

Git Workflow

References

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

AGENTS.md - TTS Adapter Project Instructions

Quick Routes

Critical Project Rules

GPU Required

Engine Protocol

Configuration Precedence

Project Structure

Tech Stack

Development Workflow

1. Local Development

2. Docker Development

3. API Usage

Environment Variables

Adding New Engines

Git Workflow

References