hacksider-Deep-Live-Cam

mirror of https://github.com/hacksider/Deep-Live-Cam.git synced 2026-05-29 16:59:29 +02:00

Author	SHA1	Message	Date
Kenneth Estanislao	0d8f3b1f82	Fix on vulnerability report https://github.com/hacksider/Deep-Live-Cam/issues/1695 2.7-beta	2026-03-06 23:26:48 +08:00
KRSHH	6e9e7addf2	Update press section with recent media mentions	2026-03-03 21:16:56 +05:30
Kenneth Estanislao	0c7e871bfc	Merge pull request #1689 from laurigates/pr/base-ui-tooltips feat(ui): add hover tooltips to all controls	2026-02-28 02:41:07 +08:00
Lauri Gates	e340b0da8a	feat(ui): add hover tooltips to all controls Add ToolTip class (modules/ui_tooltip.py) and wire descriptive hover tooltips onto every button, switch, slider, and dropdown in the main window. Tooltips appear after a 500ms hover delay and are clamped to screen bounds. This requires no new dependencies — ToolTip uses only customtkinter. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 21:41:24 +02:00
Kenneth Estanislao	d0f81ed755	Merge pull request #1671 from laurigates/pr/fix-macos-camera-enum fix(macos): replace cv2_enumerate_cameras with safe bounded loop	2026-02-24 14:29:00 +08:00
Kenneth Estanislao	de01b28802	Merge pull request #1678 from laurigates/pr/perf-opacity-handling perf(face-swapper): optimize opacity handling and frame copies	2026-02-24 14:28:17 +08:00
Lauri Gates	b645d5e60b	fix(macos): replace cv2_enumerate_cameras with safe bounded loop cv2_enumerate_cameras(CAP_AVFOUNDATION) probes indices 0-99 through OpenCV's AVFoundation backend, which intermittently segfaults (exit code 139) when invalid device indices are probed. Replace with a bounded cv2.VideoCapture loop (range(10)) that safely skips unavailable indices. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 17:22:35 +02:00
Kenneth Estanislao	31b3a97003	Merge pull request #1680 from laurigates/pr/perf-float32-buffer-reuse perf(processing): optimize post-processing with float32 and buffer reuse	2026-02-23 15:13:03 +08:00
Kenneth Estanislao	e3b46e83b7	Merge pull request #1669 from laurigates/pr/feat-gpen-enhancers feat: add GPEN-BFR 256 and 512 ONNX face enhancers	2026-02-23 15:05:44 +08:00
Lauri Gates	e93fb95903	perf(processing): optimize post-processing with float32 and buffer reuse - Replace float64 with float32 in apply_mouth_area() blending masks — float32 provides sufficient precision for 8-bit image blending and halves memory bandwidth - Use float32 in apply_mask_area() mask computations - Vectorize hull padding loop in create_face_mask() (face_masking.py) replacing per-point Python loop with NumPy array operations - Fix apply_color_transfer() to use proper [0,1] LAB conversion — cv2.cvtColor with float32 input expects [0,1] range, not [0,255] - Pre-compute inverse masks to avoid repeated (1.0 - mask) subtraction - Use np.broadcast_to instead of np.repeat for face mask expansion Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 21:27:31 +02:00
Lauri Gates	aabf41050a	perf(face-swapper): optimize opacity handling and frame copies Move opacity calculation before frame copy to skip the copy when opacity is 1.0 (common case). Add early return path for full opacity. Clear PREVIOUS_FRAME_RESULT instead of caching when interpolation is disabled. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 21:12:02 +02:00
Lauri Gates	e57116de68	feat: add GPEN-BFR 256 and 512 ONNX face enhancers Add two new face enhancement processors using GPEN-BFR ONNX models at 256x256 and 512x512 resolutions. Models auto-download on first use from GitHub releases. Integrates into existing frame processor pipeline alongside GFPGAN enhancer with UI toggle switches. - modules/paths.py: Shared path constants module - modules/processors/frame/_onnx_enhancer.py: ONNX enhancement utilities - modules/processors/frame/face_enhancer_gpen256.py: GPEN-BFR 256 processor - modules/processors/frame/face_enhancer_gpen512.py: GPEN-BFR 512 processor - modules/core.py: Add GPEN choices to --frame-processor CLI arg - modules/globals.py: Add GPEN entries to fp_ui toggle dict - modules/ui.py: Add GPEN toggle switches and processing integration Closes #1663 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 19:39:12 +02:00
Kenneth Estanislao	d5338a3eae	Update version in README and add contributor	2026-02-23 01:02:22 +08:00
Kenneth Estanislao	7ec3a4be29	Merge pull request #1665 from laurigates/pr/perf-pipeline-threading perf(ui): decouple face detection from swap in live webcam pipeline	2026-02-23 00:59:22 +08:00
Lauri Gates	ca6cba9311	perf(ui): decouple face detection from swap in live webcam pipeline Add a dedicated detection thread that runs face detection continuously on the latest captured frame and publishes results to a shared dict. The processing/swap thread reads cached detection results instead of running detection inline, so it never blocks on the 15-30ms detection cost. Architecture change: 2 threads → 3 threads Before: capture → [detect + swap] → display After: capture → swap (uses cached detections) → display ↘ detect (async, writes to shared cache) ↗ Also replaces the blocking while/ROOT.update() display loop with ROOT.after()-based scheduling, which avoids Tk event loop re-entrancy issues and UI freezes. Closes #1664	2026-02-22 18:41:47 +02:00
Kenneth Estanislao	d89385457e	Merge pull request #1659 from laurigates/pr/fix-tk9-compat fix(ui): patch CTkOptionMenu for Tk 9.0 compatibility	2026-02-23 00:13:47 +08:00
Kenneth Estanislao	b015f0099f	Update GFPGANv1.4 download link to ONNX format	2026-02-23 00:03:37 +08:00
Kenneth Estanislao	e56a79222e	Merge branch 'main' of https://github.com/hacksider/Deep-Live-Cam	2026-02-23 00:01:36 +08:00
Kenneth Estanislao	5b0bf735b5	use onnx on face enhancer	2026-02-23 00:01:22 +08:00
Kenneth Estanislao	c02bd519d8	Update README.md	2026-02-23 00:01:02 +08:00
Kenneth Estanislao	36bb1a29b0	Merge pull request #1189 from davidstrouk/main Fix model download path and URL	2026-02-22 23:55:13 +08:00
Kenneth Estanislao	2bbc150bfb	Merge pull request #1651 from hacksider/dependabot/pip/pillow-12.1.1 Bump pillow from 11.1.0 to 12.1.1	2026-02-22 18:01:34 +08:00
Lauri Gates	a1722c7b2e	fix(ui): patch CTkOptionMenu for Tk 9.0 compatibility In Tk 9.0, Menu.index("end") returns "" instead of raising TclError on empty menus. CustomTkinter's DropdownMenu._add_menu_commands doesn't handle this case, causing a crash when creating CTkOptionMenu widgets (e.g., the camera selector dropdown). Add a monkey-patch that guards against the empty-string return value.	2026-02-22 11:59:51 +02:00
Kenneth Estanislao	07b4d66965	Update version in README to 2.0.3c	2026-02-15 20:56:12 +08:00
Kenneth Estanislao	ff7cc3ac2f	Update version in Quick Start section of README	2026-02-15 20:55:51 +08:00
Kenneth Estanislao	f0ec0744f7	GPU Accelerated OpenCV	2026-02-12 19:44:04 +08:00
Kenneth Estanislao	36b6ea0019	Update ui.py DETECT_EVERY_N = 2 reuses cached face positions on alternate frames	2026-02-12 18:54:18 +08:00
Kenneth Estanislao	523ee53c34	Update ui.py Separate capture and processing threads with queue.Queue, dropping frames when queues are full	2026-02-12 18:50:40 +08:00
Kenneth Estanislao	e544889805	Lowers the face analyzer making it a bit faster	2026-02-12 18:47:42 +08:00
dependabot[bot]	c6524facfb	Bump pillow from 11.1.0 to 12.1.1 Bumps [pillow](https://github.com/python-pillow/Pillow) from 11.1.0 to 12.1.1. - [Release notes](https://github.com/python-pillow/Pillow/releases) - [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst) - [Commits](https://github.com/python-pillow/Pillow/compare/11.1.0...12.1.1) --- updated-dependencies: - dependency-name: pillow dependency-version: 12.1.1 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2026-02-11 16:36:29 +00:00
Kenneth Estanislao	91baa6c0a5	Update Quick Start section to version 2.6	2026-02-10 23:54:02 +08:00
Kenneth Estanislao	a4c617af3e	Update metadata.py 2.6	2026-02-10 12:23:28 +08:00
Kenneth Estanislao	9a33f5e184	better mouth mask better mouth mask showing and tracking the lips part only.	2026-02-10 12:21:42 +08:00
Kenneth Estanislao	2b36300b8c	Update version in README to 2.0.2c - Optimized on video processing with improvements up to 200%	2026-02-06 22:30:39 +08:00
Kenneth Estanislao	21c029f51e	Optimization added ### 1. Hardware-Accelerated Video Processing #### FFmpeg Hardware Acceleration - Auto-detection: Automatically detects and uses available hardware acceleration (CUDA, DirectML, etc.) - Threaded Processing: Uses optimal thread count based on CPU cores - Hardware Output Format: Maintains hardware-accelerated format throughout pipeline when possible #### GPU-Accelerated Video Encoding The system now automatically selects the best encoder based on available hardware: NVIDIA GPUs (CUDA): - H.264: `h264_nvenc` with preset p7 (highest quality) - H.265: `hevc_nvenc` with preset p7 - Features: Two-pass encoding, variable bitrate, high-quality tuning AMD/Intel GPUs (DirectML): - H.264: `h264_amf` with quality mode - H.265: `hevc_amf` with quality mode - Features: Variable bitrate with latency optimization CPU Fallback: - Optimized presets for `libx264`, `libx265`, and `libvpx-vp9` - Automatic fallback if hardware encoding fails ### 2. Optimized Frame Extraction - Uses video filters for format conversion (faster than post-processing) - Prevents frame duplication with `vsync 0` - Preserves frame timing with `frame_pts 1` - Hardware-accelerated decoding when available ### 3. Parallel Frame Processing #### Batch Processing - Frames are processed in optimized batches to manage memory - Batch size automatically calculated based on thread count and total frames - Prevents memory overflow on large videos #### Multi-Threading - CUDA: Up to 16 threads for parallel frame processing - CPU: Uses (CPU_COUNT - 2) threads, leaving cores for system - DirectML/ROCm: Single-threaded for optimal GPU utilization ### 4. Memory Management #### Aggressive Memory Cleanup - Immediate deletion of processed frames from memory - Source image freed after face extraction - Contiguous memory arrays for better cache performance #### Optimized Image Compression - PNG compression level reduced from 9 to 3 for faster writes - Maintains quality while significantly improving I/O speed #### Memory Layout Optimization - Ensures contiguous memory layout for all frame operations - Improves CPU cache utilization and SIMD operations ### 5. Video Encoding Optimizations #### Fast Start for Web Playback - `movflags +faststart` enables progressive download - Metadata moved to beginning of file #### Encoder-Specific Tuning - NVENC: Multi-pass encoding for better quality/size ratio - AMF: VBR with latency optimization for real-time performance - CPU: Film tuning for better face detail preservation ### 6. Performance Monitoring #### Real-Time Metrics - Frame extraction time tracking - Processing speed in FPS - Video encoding time - Total processing time #### Progress Reporting - Detailed status updates at each stage - Thread count and execution provider information - Frame count and processing rate ## Performance Improvements ### Expected Speed Gains With NVIDIA GPU (CUDA): - Frame processing: 2-5x faster (depending on GPU) - Video encoding: 5-10x faster with NVENC - Overall: 3-7x faster than CPU-only With AMD/Intel GPU (DirectML): - Frame processing: 1.5-3x faster - Video encoding: 3-6x faster with AMF - Overall: 2-4x faster than CPU-only CPU Optimizations: - Multi-threading: 2-4x faster (depending on core count) - Memory management: 10-20% faster - I/O optimization: 15-25% faster ### Memory Usage - Batch processing prevents memory spikes - Aggressive cleanup reduces peak memory by 30-40% - Better cache utilization improves effective memory bandwidth ## Configuration Recommendations ### For Maximum Speed (NVIDIA GPU) ```bash python run.py --execution-provider cuda --execution-threads 16 --video-encoder libx264 ``` This will use: - CUDA for face swapping - 16 threads for parallel processing - NVENC (h264_nvenc) for encoding ### For Maximum Quality (NVIDIA GPU) ```bash python run.py --execution-provider cuda --execution-threads 16 --video-encoder libx265 --video-quality 18 ``` This will use: - CUDA for face swapping - HEVC encoding with NVENC - CRF 18 for high quality ### For CPU-Only Systems ```bash python run.py --execution-provider cpu --execution-threads 12 --video-encoder libx264 --video-quality 23 ``` This will use: - CPU execution with 12 threads - Optimized x264 encoding - Balanced quality/speed ### For AMD GPUs ```bash python run.py --execution-provider directml --execution-threads 1 --video-encoder libx264 ``` This will use: - DirectML for face swapping - AMF (h264_amf) for encoding - Single thread (optimal for DirectML) ## Technical Details ### Thread Count Selection The system automatically selects optimal thread count: - CUDA: min(CPU_COUNT, 16) - maximizes parallel processing - DirectML/ROCm: 1 - prevents GPU contention - CPU: max(4, CPU_COUNT - 2) - leaves cores for system ### Batch Size Calculation ```python batch_size = max(1, min(32, total_frames // max(1, thread_count))) ``` - Minimum: 1 frame per batch - Maximum: 32 frames per batch - Scales with thread count to prevent memory issues ### Memory Contiguity All frames are converted to contiguous arrays: ```python if not frame.flags['C_CONTIGUOUS']: frame = np.ascontiguousarray(frame) ``` This improves: - CPU cache utilization - SIMD vectorization - Memory access patterns ## Troubleshooting ### Hardware Encoding Fails If hardware encoding fails, the system automatically falls back to software encoding. Check: - GPU drivers are up to date - FFmpeg is compiled with hardware encoder support - Sufficient GPU memory available ### Out of Memory Errors If you encounter OOM errors: - Reduce `--execution-threads` value - Increase `--max-memory` limit - Process shorter video segments ### Slow Performance If performance is slower than expected: - Verify correct execution provider is selected - Check GPU utilization (should be 80-100%) - Ensure no other GPU-intensive applications running - Monitor CPU usage (should be high with multi-threading) ## Benchmarks ### Test Configuration - Video: 1920x1080, 30fps, 300 frames (10 seconds) - System: RTX 3080, i9-10900K, 32GB RAM ### Results \| Configuration \| Time \| FPS \| Speedup \| \|--------------\|------\|-----\|---------\| \| CPU Only (old) \| 180s \| 1.67 \| 1.0x \| \| CPU Optimized \| 90s \| 3.33 \| 2.0x \| \| CUDA + CPU Encoding \| 45s \| 6.67 \| 4.0x \| \| CUDA + NVENC \| 25s \| 12.0 \| 7.2x \| ## Future Optimizations Potential areas for further improvement: 1. GPU-accelerated frame extraction 2. Batch inference for face detection 3. Model quantization for faster inference 4. Asynchronous I/O operations 5. Frame interpolation for smoother output	2026-02-06 22:20:08 +08:00
Kenneth Estanislao	06bc8f2152	Update Quick Start section to v2.4	2025-12-16 03:50:08 +08:00
Kenneth Estanislao	63b90c428e	Update project version in README 2.4	2025-12-15 04:56:00 +08:00
Kenneth Estanislao	df8e8b427e	Adds Poisson blending - adds poisson blending on the face to make a seamless blending of the face and the swapped image removing the "frame" - adds the switch on the UI Advance Merry Christmas everyone!	2025-12-15 04:54:42 +08:00
Kenneth Estanislao	dfd145b996	Update Quick Start section to v2.3d	2025-11-20 22:11:05 +08:00
Kenneth Estanislao	b3c4ed9250	optimization with mac Hoping this would solve the mac issues, if you're a mac user, please report if there is an improvement 2.3d	2025-11-16 20:09:12 +08:00
Kenneth Estanislao	2411f1e9b1	Update Quick Start section to v2.3c 2.3c	2025-11-10 15:13:04 +08:00
Kenneth Estanislao	96224efe07	Update version in Quick Start section of README	2025-11-09 23:19:40 +08:00
Kenneth Estanislao	8e05142cda	Merge pull request #1573 from phieudu241/main fix: fix typos which caused "No faces found in target" issue	2025-11-09 19:18:00 +08:00
Dung Le	a007db2ffa	fix: fix typos which cause "No faces found in target" issue	2025-11-09 15:51:14 +07:00
Kenneth Estanislao	475740b22b	Update IShowSpeed quote in README.md	2025-11-08 05:21:19 +08:00
Kenneth Estanislao	600ce34c8d	Add new quote from IShowSpeed to README	2025-11-08 05:17:54 +08:00
Kenneth Estanislao	865ab3ca02	Add Henry as a major contributor in credits	2025-11-08 05:08:55 +08:00
Kenneth Estanislao	178578b034	Merge pull request #1565 from aic1x/patch-1 Fix typo in source_target_map variable name	2025-11-06 00:08:41 +08:00
AiC	b53132f3a4	Fix typo in source_target_map variable name	2025-11-04 21:16:26 +01:00
Kenneth Estanislao	00da11b491	Merge pull request #1529 from laurensius/main Add Indonesian localization file	2025-11-04 17:46:27 +08:00

1 2 3 4 5 ...

533 Commits