hacksider-Deep-Live-Cam/modules at cbf085934780f486ad1cafd284916e3a474d9ba0 - hacksider-Deep-Live-Cam - MS-GitHub-Backup (Gitea)

CalvinBackup/hacksider-Deep-Live-Cam

mirror of https://github.com/hacksider/Deep-Live-Cam.git synced 2026-07-17 23:47:35 +02:00

Files

T

History

Max Buckley cbf0859347 Paste-back blend: uint8 cv2 SIMD, no float32 round-trip

Both face_swapper._fast_paste_back and face_enhancer._paste_back were
doing a numpy float32 round-trip per frame: convert the target crop and
the warped face to float32, blend, clip, cast back to uint8. That's four
crop-sized allocations plus unvectorized elementwise math.

Replace with a fused uint8 blend using cv2.merge + cv2.multiply + cv2.add,
which cv2 dispatches to SIMD (NEON on Apple Silicon / AVX on x86). Stored
alpha templates switched from float32 [0, 1] to uint8 [0, 255] so no
conversion is needed per frame. CUDA paths also simplified — upload uint8
alpha (less bandwidth) and scale on device.

Micro-bench on 1000x1000 RGB crop:
  current (float32 numpy): 9.43 ms
  cv2 uint8 fused:         1.16 ms  (8.1× faster, max diff 2/255)

Visual diff is imperceptible (quantization noise in the last step).

2026-04-22 12:05:39 +02:00

..

Paste-back blend: uint8 cv2 SIMD, no float32 round-trip

2026-04-22 12:05:39 +02:00

__init__.py

Update __init__.py

2025-05-13 00:14:49 +08:00

capturer.py

GPU Accelerated OpenCV

2026-02-12 19:44:04 +08:00

cluster_analysis.py

Added ability to map faces

2024-09-10 05:40:55 +05:30

core.py

Apple Silicon + Windows CUDA perf: 60 FPS pipeline, cross-platform routing

2026-04-22 10:44:59 +02:00

custom_types.py

Version 2.0c Release!

2025-10-12 22:33:09 +08:00

face_analyser.py

Apple Silicon + Windows CUDA perf: 60 FPS pipeline, cross-platform routing

2026-04-22 10:44:59 +02:00

gettext.py

add mutil language

2025-01-07 14:04:18 +08:00

globals.py

feat: AMD DML optimization - GPU face detection, detection throttle, pre-load fix

2026-04-01 23:56:01 +08:00

gpu_processing.py

Apple Silicon + Windows CUDA perf: 60 FPS pipeline, cross-platform routing

2026-04-22 10:44:59 +02:00

metadata.py

ONNX CUDA exhaustive convolution search + IO binding

2026-04-09 16:34:27 +08:00

onnx_optimize.py

Apple Silicon + Windows CUDA perf: 60 FPS pipeline, cross-platform routing

2026-04-22 10:44:59 +02:00

paths.py

feat: add GPEN-BFR 256 and 512 ONNX face enhancers

2026-02-22 19:39:12 +02:00

platform_info.py

Apple Silicon + Windows CUDA perf: 60 FPS pipeline, cross-platform routing

2026-04-22 10:44:59 +02:00

predicter.py

GPU Accelerated OpenCV

2026-02-12 19:44:04 +08:00

run.py

Version 2.0c Release!

2025-10-12 22:33:09 +08:00

tkinter_fix.py

Version 2.0c Release!

2025-10-12 22:33:09 +08:00

typing.py

initial commit

2023-09-24 21:36:57 +08:00

ui_tooltip.py

feat(ui): add hover tooltips to all controls

2026-02-24 21:41:24 +02:00

ui.json

reverted to the old version

2024-09-19 17:38:02 +08:00

ui.py

Apple Silicon + Windows CUDA perf: 60 FPS pipeline, cross-platform routing

2026-04-22 10:44:59 +02:00

utilities.py

Rendering optimization

2026-04-09 16:25:22 +08:00

video_capture.py

Apple Silicon + Windows CUDA perf: 60 FPS pipeline, cross-platform routing

2026-04-22 10:44:59 +02:00