hacksider-Deep-Live-Cam

mirror of https://github.com/hacksider/Deep-Live-Cam.git synced 2026-06-03 11:08:01 +02:00

Author	SHA1	Message	Date
Olivier Booklage	682450755f	Avoid duplicating LD_LIBRARY_PATH entries Skip prepending a directory that is already on LD_LIBRARY_PATH, so a repeated import of run.py does not bloat the variable. Addresses review feedback on #1826.	2026-05-16 14:57:02 +02:00
Olivier Booklage	12a3f6a007	Pre-load NVIDIA shared libraries on Linux Mirrors the Windows preload block from #1775. When onnxruntime-gpu is installed via pip with nvidia-cudnn-cu12, the .so files sit under venv/lib/pythonX.Y/site-packages/nvidia/<pkg>/lib/ and the dynamic linker never sees them. LD_LIBRARY_PATH cannot be set after Python starts. Pre-loads every lib.so via ctypes.CDLL with RTLD_GLOBAL before onnxruntime opens its CUDA provider. Also extends LD_LIBRARY_PATH so child processes (ffmpeg) inherit the path. Fixes "libcudnn.so.9: cannot open shared object file" on pip-only Linux installs.	2026-05-16 14:45:54 +02:00
Kenneth Estanislao	81a1986ef8	Changed to pyqtUI Standardizing the UI from quickstart to github version	2026-05-15 16:33:27 +08:00
Kenneth Estanislao	9c5f01c7f1	some fix for face enhancers	2026-05-15 15:13:57 +08:00
Max Buckley	f65aeae5db	Apple Silicon + Windows CUDA perf: 60 FPS pipeline, cross-platform routing Bundles CoreML graph rewrites, GPU-accelerated pipeline work, Windows CUDA fixes, and Mac/Windows runtime routing into a single drop. CoreML (Apple Silicon): - Decompose Pad(reflect) → Slice+Concat in inswapper_128 so the model runs in one CoreML partition instead of 14 (TEMPORARY: fixed upstream in microsoft/onnxruntime#28073, drop when ORT >= 1.26.0). - Fold Shape/Gather chains to constants in det_10g (21ms → 4ms). - Decompose Split(axis=1) → Slice pairs in GFPGAN (155ms → 89ms). - Route detection model to GPU so the ANE is free for the swap model. - Centralize provider/config selection in create_onnx_session. Pipeline (all platforms): - Parallelize face landmark + recognition post-detection; skip landmark_2d_106 when only face_swapper is active. - Pipeline face detection with swap for ANE overlap. - GPU-accelerated paste_back, MJPEG capture, zero-copy display path. - Standalone pipeline benchmark script. Windows / CUDA: - CUDA graphs + FP16 model + all-GPU pipeline for 1080p 60 FPS. - Auto-detect GPU provider and fix DLL discovery for Windows CUDA execution. Cross-platform: - platform_info helper for Mac/Windows runtime routing. - GFPGAN 30 fps + MSMF camera 60 fps with adaptive pipeline tuning. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-22 10:44:59 +02:00
RohanW11p	9207386e07	Switch to FP32 model by default, add run script Change default face swapper model to FP32 for better GPU compatibility and avoid NaN issues on certain GPUs. Revamped `run.py` to adjust PATH variables for dependencies setup and re-added with expanded configuration.	2026-03-27 17:29:01 +05:30
Kenneth Estanislao	ae2d21456d	Version 2.0c Release! Sharpness and some other improvements added!	2025-10-12 22:33:09 +08:00
Kenneth Estanislao	e616245e3d	initial commit rebranding everything	2023-09-24 21:36:57 +08:00

8 Commits