1.4 KiB
1.4 KiB
Text to voice interface
Overview
This project aims to provide text to voice with voice cloning ability. It is using chatterbox as backend.
Origin
This project started as a vibe-coded experiment but this version is somewhat more hands on.
Running
The quickest way to test this is to setup according to the instructions below and then use the example scripts under examples/.
Setup
Setup venv for python
Run setup-venv.sh.
Note
The default location is a directory called
venvthat is created next to the script, but you can override it by using the environment variablePYTHON_ENVto point to a different location.PYTHON_ENV='/some/path' ./setup-venv.sh
Environment
| Variable | Purpose |
|---|---|
HF_TOKEN_FILE |
Used to resolve a file for the HF_TOKEN secret that is used to download models from Hugging Face. If it is not set it defaults to ~/.secrets/hugging-face.token. |
HF_HUB_CACHE |
Location for hugging face model cache, defaults to ~/.cache/huggingface/hub. |