stt-server

10 Commits 1 Branch 0 Tags

Author	SHA1	Message	Date
Mikael Lövqvist	a4fe95b24a	Merge pull request 'Add stt-server.py and fix setup scripts' (#1 ) from mikael-lovqvists-claude-agent/stt-server:stt-process into main Reviewed-on: #1	2026-06-07 08:51:22 +00:00
mikael-lovqvists-claude-agent	01210e878f	Add silero-vad to both venv setup scripts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:48:59 +00:00
mikael-lovqvists-claude-agent	c0a72679f8	Add torch to both venv setup scripts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:47:03 +00:00
mikael-lovqvists-claude-agent	2af47373c4	Add stt-server.py: self-contained recording + VAD + transcription process Replaces the old stdin/stdout transcription-only server. Now handles the full pipeline in Python: - Launches parec or arecord for mic capture - Runs Silero VAD (via silero-vad, already a faster-whisper dep — no sherpa-onnx needed) - Pre-roll ring buffer (0.2s) prepended to each segment for context - Transcribes with faster-whisper in a separate thread (GPU not blocking VAD) - Emits JSON line events to stdout: ready, vad_start, vad_end, transcript, error Event protocol is designed to map directly to WebSocket subscriptions later. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:41:45 +00:00
mikael-lovqvists-claude-agent	bbde89a2cc	Fix missing faster-whisper deps when using local ctranslate2 build --no-deps skipped av and other required packages. Fix by installing faster-whisper normally first (satisfies all deps, pulls PyPI ctranslate2), then immediately overriding ctranslate2 with the source-built version. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:28:42 +00:00
mikael-lovqvists-claude-agent	346c7c6585	Remove Arch Linux specific package suggestions Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:24:43 +00:00
mikael-lovqvists-claude-agent	3db7058646	Add setup-venv.sh, clean up setup-venv-local-build.sh setup-venv.sh: simple PyPI install path — just pip install faster-whisper. Use this when the PyPI ctranslate2 wheel matches the system CUDA version. setup-venv-local-build.sh: - PYTHON_ENV env var for venv path override (consistent with tts-server) - HF_TOKEN_FILE env var instead of hardcoded path - HF_HUB_CACHE env var surfaced in output when set - Remove stray chmod on faster-whisper-server.py (not part of this repo) - Remove voice-experiment-specific "run with" message - Add python3 to tool prerequisite check - Arch Linux package suggestions extended to cover CUDA and python - Document why each script exists and when to use which Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-07 08:22:40 +00:00
Mikael Lövqvist	c96bf0ecf5	Updated readme	2026-06-07 10:17:43 +02:00
Mikael Lövqvist	0b9f9121bd	Added gitignore	2026-06-07 10:15:22 +02:00
Mikael Lövqvist	4fa4baee17	Initial commit	2026-06-07 10:14:56 +02:00