mikael-lovqvists-claude-agent/stt-server

Go to file

mikael-lovqvists-claude-agent 2af47373c4 Add stt-server.py: self-contained recording + VAD + transcription process

Replaces the old stdin/stdout transcription-only server. Now handles the
full pipeline in Python:
- Launches parec or arecord for mic capture
- Runs Silero VAD (via silero-vad, already a faster-whisper dep — no sherpa-onnx needed)
- Pre-roll ring buffer (0.2s) prepended to each segment for context
- Transcribes with faster-whisper in a separate thread (GPU not blocking VAD)
- Emits JSON line events to stdout: ready, vad_start, vad_end, transcript, error

Event protocol is designed to map directly to WebSocket subscriptions later.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-06-07 08:41:45 +00:00

.gitignore

Add stt-server.py: self-contained recording + VAD + transcription process

2026-06-07 08:41:45 +00:00

README.md

Updated readme

2026-06-07 10:17:43 +02:00

setup-venv-local-build.sh

Fix missing faster-whisper deps when using local ctranslate2 build

2026-06-07 08:28:42 +00:00

setup-venv.sh

Remove Arch Linux specific package suggestions