forked from efforting.tech/tts-server
This branch is 1 commit behind efforting.tech/tts-server:main
Single port (TTS_PORT) handles both the WS upgrade handshake and connections. Adds job queue, generation worker, playback events (queued/started/finished/aborted/error), and abort_current/abort_all commands. Fixes BrokenPipeError when pacat is killed mid-write. Updates all examples to use WebSocket; adds abort-demo.mjs. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Text to voice interface
Overview
This project aims to provide text to voice with voice cloning ability. It is using chatterbox as backend.
Origin
This project started as a vibe-coded experiment but this version is somewhat more hands on.
Running
The quickest way to test this is to setup according to the instructions below and then use the example scripts under examples/.
Setup
Setup venv for python
Run setup-venv.sh.
Note
The default location is a directory called
venvthat is created next to the script, but you can override it by using the environment variablePYTHON_ENVto point to a different location.PYTHON_ENV='/some/path' ./setup-venv.sh
Environment
| Variable | Purpose |
|---|---|
HF_TOKEN_FILE |
Used to resolve a file for the HF_TOKEN secret that is used to download models from Hugging Face. If it is not set it defaults to ~/.secrets/hugging-face.token. |
HF_HUB_CACHE |
Location for hugging face model cache, defaults to ~/.cache/huggingface/hub. |
Description
Languages
Python
95.5%
Shell
4.5%