hacksider-Deep-Live-Cam

mirror of https://github.com/hacksider/Deep-Live-Cam.git synced 2026-04-24 10:06:03 +02:00

Author	SHA1	Message	Date
Kenneth Estanislao	a4c617af3e	Update metadata.py	2026-02-10 12:23:28 +08:00
Kenneth Estanislao	9a33f5e184	better mouth mask better mouth mask showing and tracking the lips part only.	2026-02-10 12:21:42 +08:00
Kenneth Estanislao	21c029f51e	Optimization added ### 1. Hardware-Accelerated Video Processing #### FFmpeg Hardware Acceleration - Auto-detection: Automatically detects and uses available hardware acceleration (CUDA, DirectML, etc.) - Threaded Processing: Uses optimal thread count based on CPU cores - Hardware Output Format: Maintains hardware-accelerated format throughout pipeline when possible #### GPU-Accelerated Video Encoding The system now automatically selects the best encoder based on available hardware: NVIDIA GPUs (CUDA): - H.264: `h264_nvenc` with preset p7 (highest quality) - H.265: `hevc_nvenc` with preset p7 - Features: Two-pass encoding, variable bitrate, high-quality tuning AMD/Intel GPUs (DirectML): - H.264: `h264_amf` with quality mode - H.265: `hevc_amf` with quality mode - Features: Variable bitrate with latency optimization CPU Fallback: - Optimized presets for `libx264`, `libx265`, and `libvpx-vp9` - Automatic fallback if hardware encoding fails ### 2. Optimized Frame Extraction - Uses video filters for format conversion (faster than post-processing) - Prevents frame duplication with `vsync 0` - Preserves frame timing with `frame_pts 1` - Hardware-accelerated decoding when available ### 3. Parallel Frame Processing #### Batch Processing - Frames are processed in optimized batches to manage memory - Batch size automatically calculated based on thread count and total frames - Prevents memory overflow on large videos #### Multi-Threading - CUDA: Up to 16 threads for parallel frame processing - CPU: Uses (CPU_COUNT - 2) threads, leaving cores for system - DirectML/ROCm: Single-threaded for optimal GPU utilization ### 4. Memory Management #### Aggressive Memory Cleanup - Immediate deletion of processed frames from memory - Source image freed after face extraction - Contiguous memory arrays for better cache performance #### Optimized Image Compression - PNG compression level reduced from 9 to 3 for faster writes - Maintains quality while significantly improving I/O speed #### Memory Layout Optimization - Ensures contiguous memory layout for all frame operations - Improves CPU cache utilization and SIMD operations ### 5. Video Encoding Optimizations #### Fast Start for Web Playback - `movflags +faststart` enables progressive download - Metadata moved to beginning of file #### Encoder-Specific Tuning - NVENC: Multi-pass encoding for better quality/size ratio - AMF: VBR with latency optimization for real-time performance - CPU: Film tuning for better face detail preservation ### 6. Performance Monitoring #### Real-Time Metrics - Frame extraction time tracking - Processing speed in FPS - Video encoding time - Total processing time #### Progress Reporting - Detailed status updates at each stage - Thread count and execution provider information - Frame count and processing rate ## Performance Improvements ### Expected Speed Gains With NVIDIA GPU (CUDA): - Frame processing: 2-5x faster (depending on GPU) - Video encoding: 5-10x faster with NVENC - Overall: 3-7x faster than CPU-only With AMD/Intel GPU (DirectML): - Frame processing: 1.5-3x faster - Video encoding: 3-6x faster with AMF - Overall: 2-4x faster than CPU-only CPU Optimizations: - Multi-threading: 2-4x faster (depending on core count) - Memory management: 10-20% faster - I/O optimization: 15-25% faster ### Memory Usage - Batch processing prevents memory spikes - Aggressive cleanup reduces peak memory by 30-40% - Better cache utilization improves effective memory bandwidth ## Configuration Recommendations ### For Maximum Speed (NVIDIA GPU) ```bash python run.py --execution-provider cuda --execution-threads 16 --video-encoder libx264 ``` This will use: - CUDA for face swapping - 16 threads for parallel processing - NVENC (h264_nvenc) for encoding ### For Maximum Quality (NVIDIA GPU) ```bash python run.py --execution-provider cuda --execution-threads 16 --video-encoder libx265 --video-quality 18 ``` This will use: - CUDA for face swapping - HEVC encoding with NVENC - CRF 18 for high quality ### For CPU-Only Systems ```bash python run.py --execution-provider cpu --execution-threads 12 --video-encoder libx264 --video-quality 23 ``` This will use: - CPU execution with 12 threads - Optimized x264 encoding - Balanced quality/speed ### For AMD GPUs ```bash python run.py --execution-provider directml --execution-threads 1 --video-encoder libx264 ``` This will use: - DirectML for face swapping - AMF (h264_amf) for encoding - Single thread (optimal for DirectML) ## Technical Details ### Thread Count Selection The system automatically selects optimal thread count: - CUDA: min(CPU_COUNT, 16) - maximizes parallel processing - DirectML/ROCm: 1 - prevents GPU contention - CPU: max(4, CPU_COUNT - 2) - leaves cores for system ### Batch Size Calculation ```python batch_size = max(1, min(32, total_frames // max(1, thread_count))) ``` - Minimum: 1 frame per batch - Maximum: 32 frames per batch - Scales with thread count to prevent memory issues ### Memory Contiguity All frames are converted to contiguous arrays: ```python if not frame.flags['C_CONTIGUOUS']: frame = np.ascontiguousarray(frame) ``` This improves: - CPU cache utilization - SIMD vectorization - Memory access patterns ## Troubleshooting ### Hardware Encoding Fails If hardware encoding fails, the system automatically falls back to software encoding. Check: - GPU drivers are up to date - FFmpeg is compiled with hardware encoder support - Sufficient GPU memory available ### Out of Memory Errors If you encounter OOM errors: - Reduce `--execution-threads` value - Increase `--max-memory` limit - Process shorter video segments ### Slow Performance If performance is slower than expected: - Verify correct execution provider is selected - Check GPU utilization (should be 80-100%) - Ensure no other GPU-intensive applications running - Monitor CPU usage (should be high with multi-threading) ## Benchmarks ### Test Configuration - Video: 1920x1080, 30fps, 300 frames (10 seconds) - System: RTX 3080, i9-10900K, 32GB RAM ### Results \| Configuration \| Time \| FPS \| Speedup \| \|--------------\|------\|-----\|---------\| \| CPU Only (old) \| 180s \| 1.67 \| 1.0x \| \| CPU Optimized \| 90s \| 3.33 \| 2.0x \| \| CUDA + CPU Encoding \| 45s \| 6.67 \| 4.0x \| \| CUDA + NVENC \| 25s \| 12.0 \| 7.2x \| ## Future Optimizations Potential areas for further improvement: 1. GPU-accelerated frame extraction 2. Batch inference for face detection 3. Model quantization for faster inference 4. Asynchronous I/O operations 5. Frame interpolation for smoother output	2026-02-06 22:20:08 +08:00
Kenneth Estanislao	df8e8b427e	Adds Poisson blending - adds poisson blending on the face to make a seamless blending of the face and the swapped image removing the "frame" - adds the switch on the UI Advance Merry Christmas everyone!	2025-12-15 04:54:42 +08:00
Kenneth Estanislao	b3c4ed9250	optimization with mac Hoping this would solve the mac issues, if you're a mac user, please report if there is an improvement	2025-11-16 20:09:12 +08:00
Dung Le	a007db2ffa	fix: fix typos which cause "No faces found in target" issue	2025-11-09 15:51:14 +07:00
AiC	b53132f3a4	Fix typo in source_target_map variable name	2025-11-04 21:16:26 +01:00
Kenneth Estanislao	b82fdc3f31	Update face_swapper.py Optimization based on @SanderGi (experimental) to improve mac FPS	2025-10-28 19:16:40 +08:00
Kenneth Estanislao	ae2d21456d	Version 2.0c Release! Sharpness and some other improvements added!	2025-10-12 22:33:09 +08:00
Kenneth Estanislao	d0d90ecc03	Creating a fallback and switching of models Models switch depending on the execution provider	2025-08-02 02:56:20 +08:00
Teo Jia Cheng	9690070399	Update __init__.py	2025-05-13 00:14:49 +08:00
Kenneth Estanislao	f3e83b985c	Merge pull request #1210 from KunjShah01/main Update __init__.py	2025-05-12 15:14:58 +08:00
Gordon Böer	bdbd7dcfbc	fix typos in ui.py	2025-05-07 13:23:31 +02:00
KUNJ SHAH	a64940def7	update	2025-05-05 13:19:46 +00:00
KUNJ SHAH	fe4a87e8f2	update	2025-05-05 13:19:29 +00:00
KUNJ SHAH	9ecd2dab83	changes	2025-05-05 13:10:00 +00:00
KUNJ SHAH	c9f36eb350	Update __init__.py	2025-05-05 18:29:44 +05:30
NeuroDonu	890beb0eae	fix & add trt support	2025-04-19 16:03:49 +03:00
NeuroDonu	75b5b096d6	fix	2025-04-19 16:03:24 +03:00
Kenneth Estanislao	01900dcfb5	Revert "Update metadata.py" This reverts commit `90d5c28542`.	2025-04-17 02:39:05 +08:00
Kenneth Estanislao	07e30fe781	Revert "Update face_swapper.py" This reverts commit `104d8cf4d6`.	2025-04-17 02:03:34 +08:00
Kenneth Estanislao	90d5c28542	Update metadata.py - 40% faster than 1.8 - compatible with 50xx GPU - onnxruntime 1.21	2025-04-13 03:34:10 +08:00
Kenneth Estanislao	104d8cf4d6	Update face_swapper.py compatibility with inswapper 1.21	2025-04-13 01:13:40 +08:00
Adrian Zimbran	c728994e6b	fixed import and log message	2025-03-10 23:41:28 +02:00
Adrian Zimbran	65da3be2a4	Fix face swapping crash due to None face embeddings - Add explicit checks for face detection results (source and target faces). - Handle cases when face embeddings are not available, preventing AttributeError. - Provide meaningful log messages for easier debugging in future scenarios.	2025-03-10 23:31:56 +02:00
Kenneth Estanislao	28c4b34db1	Merge pull request #911 from nimishgautam/main Fix cv2 size errors on first run in ui.py	2025-02-05 12:51:39 +08:00
Soul Lee	513e413956	fix: typo souce_target_map → source_target_map	2025-02-03 20:33:44 +09:00
Nimish Gåtam	ccc04983cf	Update ui.py removed unnecessary code as per AI code review (which is a thing now because of course it is)	2025-02-01 12:38:37 +01:00
Nimish Gåtam	2506c5a261	Update ui.py Some checks for first run when models are missing, so it doesn't error out with inv_scale_x > 0 in cv2	2025-02-01 11:52:49 +01:00
qitian	5132f86cdc	add mutil language	2025-01-07 14:04:18 +08:00
Makaru	d07d4a6a26	Update ui.py I pushed it to premain	2025-01-07 01:15:05 +08:00
Kenneth Estanislao	b38831dfdf	Revert "Merge pull request #868 from kier007/main" This reverts commit `c03f697729`, reversing changes made to `d8a5cdbc19`.	2025-01-06 14:14:21 +08:00
Kenneth Estanislao	b518f4337d	Revert "Merge pull request #869 from kier007/patch-1" This reverts commit `b38ef62447`, reversing changes made to `c03f697729`.	2025-01-06 14:14:04 +08:00
Kenneth Estanislao	28513d6c1f	Update metadata.py	2025-01-06 00:27:45 +08:00
Makaru	a3469b7bd4	Update ui.py Added: - If you happen to turn off the map faces switch while the Source x Target Mapper window is open, the Source x Target Mapper window will close.	2025-01-06 00:10:53 +08:00
Makaru	742bcab130	Update ui.py Added: - try-finally Block: This makes sure the camera.release() is called no matter how the while loops end. - Resource Cleanup: The finally block takes care of cleaning up resources to keep the application stable.	2025-01-05 20:19:36 +08:00
Makaru	22940d1b99	Update ui.py The following changes have been implemented: -A "clear" button has been incorporated. -The Source x Target Mapper window has been retained following the submission of data via the "submit" button.	2025-01-05 18:29:01 +08:00
KRSHH	8be7368949	Added URL to official website	2024-12-30 15:51:46 +05:30
Mehdi Mousavi	81da9a23ca	Fix mouth mask description	2024-12-24 09:51:32 +03:30
Mehdi Mousavi	007867a6f6	Add support for --mouth-mask argument	2024-12-24 09:40:06 +03:30
KRSHH	b17e52dea2	Mac Webcam Serial No. Management	2024-12-23 22:45:41 +05:30
KRSHH	77c19d1073	FaceTime Camera Index to 0	2024-12-23 14:58:43 +05:30
Pedro Santos	7472dfb694	fix: add match statement Added for optimization Co-Authored-By: Zephira <zephira58@protonmail.com>	2024-12-23 06:29:36 +00:00
Pedro Santos	41c6916273	Revert "Update face_enhancer.py" This reverts commit `ed7a21687c`.	2024-12-23 06:08:45 +00:00
Kenneth Estanislao	ed7a21687c	Update face_enhancer.py change if from before statement to elif, also fix conditional ladder	2024-12-23 12:45:53 +08:00
KRSHH	5ce991651d	Formatting Moved Windows only modules, to top too.	2024-12-23 09:46:59 +05:30
KRSHH	432984b3b6	Mac Fix Pygrabber Module import only on windows	2024-12-23 09:41:17 +05:30
KRSHH	76b94ac034	Changed Metadata to GitHub Edition	2024-12-22 18:28:38 +05:30
KRSHH	84ca1dc2f2	Make Face Enhancer Model device Conditional Added Co-Author Co-Authored-By: Rishon <rishon@rishon.me>	2024-12-19 21:18:28 +05:30
KRSHH	681c20dbbd	Revert "Make Face Enhancer Model device Conditional" This reverts commit `c240f6e31c`.	2024-12-19 21:16:56 +05:30

1 2 3

142 Commits