From 52fb2ad7223dd3d4df1603b757bd5cb30eabba0e Mon Sep 17 00:00:00 2001 From: Evgeny Demchenko Date: Tue, 19 May 2026 19:22:53 +0100 Subject: [PATCH] benchmarks: actual measured VRAM + network bandwidth (tcpdump-based) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit VRAM breakdown (nvidia-smi pmon): - 4 publishers = 4.4 GB (FHD + 2688x1520 ring buffers + NVDEC) - cctv-backend = 1.0 GB - frigate embeddings_manager = 1.6 GB - frigate detector:onnx = 0.6 GB - Total cuframes-stack = ~7.7 GB Network (10-sec tcpdump capture от camera subnet к R9): - Measured: 31.5 Mbps (всё включая go2rtc on-demand, ONVIF) - cuframes core: ~16 Mbps (4 publishers × main HEVC) - ONVIF/RTSP keepalives: ~1-2 Mbps - Без cuframes setup тех же 4 cam × 3 consumer был бы ~45-50 Mbps Source: production deploy 2026-05-19 measurement. --- BENCHMARKS.md | 29 ++++++++++++++++++++++++++++- 1 file changed, 28 insertions(+), 1 deletion(-) diff --git a/BENCHMARKS.md b/BENCHMARKS.md index 0cd89c7..c72124e 100644 --- a/BENCHMARKS.md +++ b/BENCHMARKS.md @@ -143,9 +143,36 @@ Production деplo замеры — см. интеграционные guides: GPU SM: 4-5% (compute: detector + cuframes consumers) GPU NVDEC: 2-4% (без cuframes ожидаемо было 15-25%) GPU NVENC: 0-1% -VRAM: 4 publishers × ~1 GB ring buffers + consumer contexts ``` +### VRAM breakdown (measured) + +| Component | VRAM | +|---|---:| +| 4× cuframes publishers (3× FHD ring + 1× 2688×1520 для LPR) | **4.4 GB** | +| cctv-backend (composer + grid output) | 1.0 GB | +| frigate.embeddings_manager (face + LPR ONNX models) | 1.6 GB | +| frigate.detector:onnx (D-FINE-S COCO) | 0.6 GB | +| **Total cuframes-stack VRAM** | **~7.7 GB** | + +Из них на сам cuframes accounting — только **4.4 GB** в publishers (ring buffers + +NVDEC decode buffers). Consumers (Frigate, cctv-backend) держат свои CUDA +contexts независимо. + +### Network bandwidth (real tcpdump, 10-sec sample) + +**31.5 Mbps** от camera subnet (4 cameras → R9), измерено через +`tcpdump -w cam-traffic.pcap` за 10 секунд. + +Breakdown approximate: +- 4 publishers × main HEVC RTP/UDP: **~16 Mbps** (cuframes core) +- go2rtc on-demand streams (Frigate UI live preview, если открыт): **0-10 Mbps** +- ONVIF discovery, RTSP keepalives, NTP-from-cameras: **~1-2 Mbps** + +Без cuframes тот же setup (cctv-backend + Frigate detect + Frigate record × 4 +camera) дал бы **~45-50 Mbps** (главное: record path забирал отдельный +main stream от каждой camera). + ### Camera-side benefits Dahua/Hikvision камеры обычно cap'нуты на 4-5 одновременных RTSP streams.