Text to voice interface

Overview

This project aims to provide text to voice with voice cloning ability. It is using chatterbox as backend.

Origin

This project started as a vibe-coded experiment but this version is somewhat more hands on.

Setup

Setup venv for python

Run setup-venv.sh.

Note

The default location is a directory called venv that is created next to the script, but you can override it by using the environment variable PYTHON_ENV to point to a different location.
PYTHON_ENV='/some/path' ./setup-venv.sh

Environment

Variable	Purpose
`HF_TOKEN_FILE`	Used to resolve a file for the `HF_TOKEN` secret that is used to download models from Hugging Face. If it is not set it defaults to `~/.secrets/hugging-face.token`.
`HF_HUB_CACHE`	Location for hugging face model cache, defaults to `~/.cache/huggingface/hub`.

1.2 KiB Raw Blame History

Text to voice interface

Overview

Origin

Setup

Setup venv for python

Environment

1.2 KiB

Raw Blame History