Add ffmpeg as codec backend; extend codec ID table with archival formats

0x01xx range reserved for ffmpeg-backed formats (H.265, AV1, FFV1, ProRes). Documents libavcodec vs subprocess trade-offs: subprocess suits archival completeness paths, libavcodec suits low-latency encode. Receiver only cares about wire format, not which encoder produced it. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-25 22:47:06 +00:00
parent 44a3326a76
commit 8260d456aa
1 changed files with 27 additions and 1 deletions
--- a/architecture.md
+++ b/architecture.md
@@ -256,8 +256,12 @@ Receivers must know what format a frame payload is in. This is communicated at s
 | `0x0002` | QOI — lossless, single-header implementation, fast; good for screen content |
 | `0x0003` | Raw pixels + ZSTD — lossless; raw BGRA/RGBA compressed with ZSTD at a low level |
 | `0x0004` | H.264 intra — single I-frames via VA-API hardware encode; high compression, GPU required |
 | `0x0100` | H.265 / HEVC — via ffmpeg (libavcodec or subprocess); hardware or software encode |
 | `0x0101` | AV1 — via ffmpeg; best compression, hardware encode on modern GPUs |
 | `0x0102` | FFV1 — via ffmpeg; lossless archival format |
 | `0x0103` | ProRes — via ffmpeg; near-lossless, post-production compatible |
-V4L2 camera streams typically arrive pre-encoded as MJPEG from hardware; no encode step is needed on that path. The codec module is primarily used by the screen grab source.
+V4L2 camera streams typically arrive pre-encoded as MJPEG from hardware; no encode step is needed on that path. The `0x01xx` range is reserved for ffmpeg-backed formats; the receiver cares only about the wire format, not which encoder produced it.
 ### Format Negotiation
@@ -279,6 +283,28 @@ ZSTD at compression level 1 is extremely fast and can achieve meaningful ratios
 Intra-only H.264 via VA-API gives very high compression with GPU offload. This is the most complex option to set up and introduces a GPU dependency, but may be worthwhile for high-resolution grabs over constrained links. Deferred until simpler codecs are validated.
 ### ffmpeg Backend
 ffmpeg (via libavcodec or subprocess) is a practical escape hatch that gives access to a large number of codecs, container formats, and hardware acceleration paths without implementing them from scratch. It is particularly useful for archival formats where the encode latency of a more complex codec is acceptable.
 **Integration options:**
 - **libavcodec** — link directly against the library; programmatic API, tight integration, same process; introduces a large build dependency but gives full control over codec parameters and hardware acceleration (NVENC, VA-API, VideoToolbox, etc.)
 - **subprocess pipe** — spawn `ffmpeg`, pipe raw frames to stdin, read encoded output from stdout; simpler, no build dependency, more isolated from the rest of the node process; latency is higher due to process overhead but acceptable for archival paths where real-time delivery is not required
 The subprocess approach fits naturally into the completeness output path of the relay: frames arrive in order, there is no real-time drop pressure, and the ffmpeg process can be restarted independently if it crashes without taking down the node. libavcodec is the better fit for low-latency encoding (e.g. screen grab over a constrained link).
 **Archival formats of interest:**
 | Format | Notes |
 |---|---|
 | H.265 / HEVC | ~50% better compression than H.264 at same quality; NVENC and VA-API hardware support widely available |
 | AV1 | Best open-format compression; software encode is slow, hardware encode (AV1 NVENC on RTX 30+) is fast |
 | FFV1 | Lossless, designed for archival; good compression for video content; the format used by film archives |
 | ProRes | Near-lossless, widely accepted in post-production toolchains; large files but easy to edit downstream |
 The codec identifier table uses the `0x01xx` range for ffmpeg-backed formats to distinguish them from native implementations. The actual format is fixed at stream open time via `stream_open` — the receiver does not need to know whether the encoder is libavcodec or a native implementation, only what the wire format is.
 ---
 ## X11 / Xorg Integration