Qwen3-TTS Voice Cloning

Voice cloning using Qwen3-TTS-12Hz-1.7B-Base with combined reference samples.

Setup

uv init --python 3.12
uv add qwen-tts soundfile torch numpy

Usage

uv run python clone_voice.py

Generates speech using 2 combined reference samples for improved voice quality.

Performance: ~111s for 3 outputs on MPS (Apple Silicon)

Reference Format

Reference files follow the pattern: samples/ref_N (without extension)

ref_1.wav + ref_1.txt
ref_2.wav + ref_2.txt

Add or remove references by editing ref_paths list in clone_voice.py. References are concatenated with 1s silence gaps.

Output

Generated audio samples:

output_0

I'm not the pheasant plucker, I'm the pheasant plucker's mate. I'm only plucking pheasants 'cause the pheasant plucker's running late

output_0.mp4

output_1

Extremely accurate and stunningly beautiful bespoke printer profiles transform creative print making to an extraordinary extent.

output_1.mp4

output_2

Generating code from AI prompts can lead to verbose code, or duplication of existing code instead of using an abstraction. But there are times when this is perfectly acceptable, such as when building proof of concepts, or when topics like program efficiency are unimportant.

output_2.mp4

Realtime TTS

Command Line

For fast realtime generation without re-processing reference audio:

# 1. Create voice model once
uv run python create_voice_prompt.py

# 2. Use for realtime generation
uv run python realtime_tts.py

Web UI Demo

Launch the Gradio web interface:

uv run python demo_app.py

Then open http://localhost:8000 in your browser.

See docs/realtime.md for details.

Configuration

See docs/configuration.md for generation parameters.

Edit syn_texts in clone_voice.py to customize synthesis text.

Model

Location: ~/LLMs/Qwen3-TTS/Qwen3-TTS-12Hz-1.7B-Base

Device: Auto-detects MPS or CPU

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
docs		docs
models		models
output		output
samples		samples
.gitignore		.gitignore
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
README.md		README.md
api_server.py		api_server.py
clone_voice.py		clone_voice.py
create_voice_prompt.py		create_voice_prompt.py
demo.html		demo.html
demo_app.py		demo_app.py
main.py		main.py
pyproject.toml		pyproject.toml
realtime_tts.py		realtime_tts.py
start_api.sh		start_api.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Qwen3-TTS Voice Cloning

Setup

Usage

Reference Format

Output

output_0

output_1

output_2

Realtime TTS

Command Line

Web UI Demo

Configuration

Model

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Qwen3-TTS Voice Cloning

Setup

Usage

Reference Format

Output

output_0

output_1

output_2

Realtime TTS

Command Line

Web UI Demo

Configuration

Model

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages