remove-ai-watermarks

mirror of https://github.com/wiltodelta/remove-ai-watermarks.git synced 2026-07-05 07:57:50 +02:00

Author	SHA1	Message	Date
Victor Kuznetsov	5a612adfef	chore(release): v0.12.0 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-22 11:14:55 -07:00
Victor Kuznetsov	f97fdc5b92	chore(release): v0.11.4	2026-06-19 10:06:04 -07:00
Victor Kuznetsov	be566e8868	chore(release): v0.11.3	2026-06-18 17:28:21 -07:00
Victor Kuznetsov	9f6c26a439	refactor(c2pa): read manifests via official c2pa-python, keep byte-scan fallback extract_c2pa_info now uses the c2pa-python Reader first (any container, whole manifest store incl. ingredient manifests), falling back to the hand-rolled caBX parser for blobs the validator rejects (synthetic/partial, broken wheel). The issuer/source-type/SynthID/soft-binding registry scan is shared by both paths (_populate_registry_fields), so the return-dict contract is unchanged. Also replaces the dead `from c2pa import has_c2pa_metadata` import in metadata.py with a real Reader presence check. c2pa-python added as a core dep (MIT/Apache, ~+5MB RSS, no torch; wheels cover the CI matrix). Validated on the full local spaces corpus (25,725 imgs): 0 regressions; 384 manifests newly parsed (379 non-PNG JPEG/WebP + 2 PNGs the byte-scanner missed); 3 false Adobe/Microsoft->Google attributions fixed via real-manifest parsing. The docs/module-internals.md section for this change already landed in `41f6797`. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-18 17:24:58 -07:00
Victor Kuznetsov	6237429610	chore(release): v0.11.2	2026-06-12 21:37:04 -07:00
Victor Kuznetsov	b08405bece	chore(release): v0.11.1 Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-12 12:15:20 -07:00
Victor Kuznetsov	c8bc4b7c68	chore(release): v0.11.0 Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-06-11 10:03:51 -07:00
Victor Kuznetsov	d763581ed3	chore(release): v0.10.3 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-10 15:53:50 -07:00
Victor Kuznetsov	0a77d3198e	chore(release): v0.10.2 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-10 10:38:50 -07:00
Victor Kuznetsov	9aea5f240f	chore: improve discoverability (PyPI keywords/classifiers, README badges) Research-informed metadata for organic dev discovery: - pyproject: add a keywords field (was absent; biggest PyPI search gap) and expand classifiers (audience, console, security, AI, utilities); rewrite the summary noun-first, naming Nano Banana / SynthID / C2PA verbatim. - README: add PyPI version, Python versions, downloads, and license badges. GitHub topics (comfyui, watermark-remover) and the repo description were updated out of band. PyPI metadata ships on the next release. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-10 10:34:43 -07:00
Victor Kuznetsov	5777458296	chore(release): v0.10.1 Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-06-09 17:08:44 -07:00
Victor Kuznetsov	295e7ada2b	chore: project review (dev tools in extras, dep upgrades, optional-deps guard, stale cleanup) Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>	2026-06-09 17:03:17 -07:00
Victor Kuznetsov	826cfdb82a	chore(release): v0.10.0 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-09 13:24:37 -07:00
Victor Kuznetsov	ea098cf1be	chore(release): v0.9.0 BREAKING: - Drop `--restore-faces` / `--restore-faces-method` CLI flags - Drop `restore`, `photomaker`, `instantid` extras - Drop `restore_faces` / `restore_faces_method` params from InvisibleEngine.remove_watermark and AutoConfig Rationale (full empirical record in docs/synthid-robust-identity-research-2026-06-08.md "Empirical follow-up"): every face-restore approach evaluated 2026-06-04 - 2026-06-08 (GFPGAN-on- cleaned, PhotoMaker-V2, InstantID txt2img, InstantID img2img-on-cleaned at three parameter sweeps) regenerates the face via SDXL diffusion -- output face pixels are diffusion-fresh, so the regenerated face inherits SDXL's "clean skin" aesthetic and loses original identity precision. The result looks MORE AI-generated than the cleaned image, not less. The cleaned controlnet 0.20 image is the least-AI face state we can reach without re-introducing SynthID. License: - MIT -> Apache 2.0 (Apache adds an explicit patent grant + trademark clause; better fit with the upstream Apache projects this library mirrors / depends on -- diffusers, transformers, controlnet-aux, xinsir's controlnet weights) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 21:28:09 -07:00
Victor Kuznetsov	a4554bb5d3	chore(license): switch from MIT to Apache 2.0 Replace LICENSE with the canonical Apache License 2.0 text + a brief copyright notice for "wiltodelta 2025-2026". Update pyproject.toml's `license` field to "Apache-2.0" and the PyPI classifier to "Apache Software License". Update README's License section to point at the LICENSE file and name the copyright holder. Why: Apache 2.0 gives downstream users an explicit patent grant and the trademark-use clause, which MIT doesn't carry. It is also the more common license among the upstream projects this library depends on / mirrors (diffusers, transformers, controlnet-aux, xinsir's canny controlnet weights), so contributions can flow either way without a permission-shape mismatch. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 21:23:57 -07:00
Victor Kuznetsov	20d7eda96a	remove: drop all face-restore code (regeneration, not preservation) Empirical conclusion from the 2026-06-04 - 2026-06-08 Modal cert sweeps: every face-restore approach we built (GFPGAN-on-cleaned, PhotoMaker-V2, InstantID txt2img, InstantID img2img-on-cleaned at three parameter settings) regenerates the face via SDXL diffusion rather than preserves it. Output face pixels are diffusion-fresh, so the regenerated face inherits SDXL "clean skin" aesthetic and loses original identity precision -- it looks MORE AI-generated than the cleaned image, not less. The cleaned image from the main controlnet 0.20 removal pass is the least-AI face state we can reach without re-introducing SynthID. Nothing in the restore family achieves the actual goal (preserve the original person's face). Keeping them around as opt-in invites users to ship something that defeats the point. Removing entirely. Library changes: - Deleted src/remove_ai_watermarks/instantid_restore.py - Deleted src/remove_ai_watermarks/photomaker_restore.py - Deleted tests/test_instantid_restore.py - Deleted tests/test_photomaker_restore.py - Removed `instantid` and `photomaker` extras from pyproject.toml - Removed `[tool.hatch.metadata] allow-direct-references = true` (was only needed for the photomaker git+ URL) - InvisibleEngine.remove_watermark: dropped `restore_faces` + `restore_faces_method` params, removed both `_restore_faces_instantid` and `_restore_faces_photomaker` private methods, removed dispatch - CLI: dropped `_restore_faces_options` decorator, all four cmd_* signatures lose `restore_faces` + `restore_faces_method`, kwarg passes to remove_watermark dropped - _apply_auto: dropped `restore_faces` from tuple shape (was unused after the engine no longer takes it) - auto_config.AutoConfig: dropped `restore_faces` field; `plan()` no longer sets it; `reason` no longer mentions it - Tests updated accordingly (test_auto_config.TestReason no longer asserts "face-restore on" in the reason string) Docs updated: - CLAUDE.md: removed the photomaker extras bullet, the Face restore trade-off bullet, the instantid_restore.py + photomaker_restore.py module bullets; replaced restore mentions in watermark_remover and controlnet bullets and prod recipe with the empirical conclusion - README.md: removed both `--restore-faces` callouts and the install snippet; the feature bullet and auto-mode comment updated - docs/synthid-robust-identity-research.md: added Status-retired notice at the top pointing at the 2026-06-08 followup raiw-app: - modal_cert.py: dropped `--restore-faces` flag entirely; sweep() no longer takes restore_faces; pinned _LIB_SPEC to `[gpu]` extras (no `photomaker` / `instantid` extras), points at main ruff + strict pyright clean; 569 tests pass; 18 restore-specific tests gone. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 21:21:58 -07:00
Victor Kuznetsov	70e8b3a517	feat(face-restore): add InstantID as the default non-commercial restore path Per the 2026-06-08 deep-research synthesis (docs/synthid-robust-identity- research-2026-06-08.md), the entire ArcFace-class identity-adapter ecosystem for SDXL is blocked from commercial use by InsightFace's non-commercial model packs (antelopev2 / buffalo_l). No commercial-safe ArcFace-grade identity stack exists today. The user explicitly opted into shipping a non-commercial restore path (research / personal use; raiw.cc must NOT install the extra). Architectural choice: InstantID over PhotoMaker-V2 as the default. - PhotoMaker-V2 (CLIP+ArcFace dual encoder, txt2img only): documented upstream identity drift on Asian male faces, visually confirmed in our cert sweep (tatsunari rendered as a generic woman; group photo collapsed into a patchwork). - InstantID (ArcFace cross-attention + landmark ControlNet): semantic identity branch + spatial weak landmark control, decoupled. Per InstantID paper (arXiv:2401.07519) and the research report, stronger identity fidelity on single portraits. Critically: NO original face pixels enter the diffusion (ArcFace embedding is semantic, landmark stick figure is pure geometry), so SynthID is not transported. Implementation: - New `src/remove_ai_watermarks/instantid_restore.py` mirrors the `photomaker_restore.py` shape (lazy singletons for pipeline + FaceAnalysis, per-face crop + _composite_faces from photomaker_restore). Loads the InstantID community pipeline via `DiffusionPipeline.from_pretrained( custom_pipeline="pipeline_stable_diffusion_xl_instantid")` -- no upstream Python package needed; diffusers fetches the file from its community examples. - New `instantid` extra in pyproject (insightface + onnxruntime + huggingface-hub). NON-COMMERCIAL block in the comment explains why. - CLI: `--restore-faces-method [instantid\|photomaker]`, default `instantid`. Both methods explicitly labeled NON-COMMERCIAL in the help text. - Engine: dispatch on `restore_faces_method` to either `_restore_faces_instantid` or `_restore_faces_photomaker`. - 9 control-flow tests for InstantID without model download (mirror the photomaker_restore.py test pattern + draw_kps helper checks). 587/587 pass. Diffusers-0.38 compat verified by upstream code inspection: the InstantID pipeline inherits from `StableDiffusionXLControlNetPipeline`, uses only public diffusers APIs (`encode_prompt`, `prepare_image`, `prepare_latents`, `get_guidance_scale_embedding`), uses legacy attention processor API which diffusers preserves for backward compat. No PhotoMaker-V1-style internal text_encoder access. End-to-end execution will be validated by the Modal cert sweep in the next step. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 19:44:17 -07:00
Victor Kuznetsov	65de8df5c5	refactor(face-restore): drop GFPGAN, ship PhotoMaker-V2 as the sole restore (non-commercial) Visual review of the GFPGAN-on-cleaned output (9-face grid, 1448x1086) showed it only polished the already-drifted face without restoring identity — useless for the "restore who is in the photo" intent. Dropping it. The shipped restore path is now PhotoMaker-V2, which delivers true identity-from- embedding face regeneration via a CLIP+ArcFace dual encoder. The ArcFace branch pulls InsightFace antelopev2/buffalo_l model packs at runtime, which InsightFace releases under a research-only license, so the whole extra is NON-COMMERCIAL. raiw.cc and any monetized deployment must NOT install the `photomaker` extra. This is called out at every entry point: CLI flag help, module docstring, pyproject extra block, CLAUDE.md extras bullet, README install snippet. Changes: - Deleted `src/remove_ai_watermarks/face_restore.py` and its tests. - Deleted the `restore` extra (gfpgan/facexlib/basicsr + scipy<1.18 / numba<0.60 pins) and the basicsr setuptools<69 build pin from pyproject.toml. - Restored `src/remove_ai_watermarks/photomaker_restore.py` (V2 this time: `TencentARC/PhotoMaker-V2`, `photomaker-v2.bin`, no `pm_version='v1'` override). - Restored the `photomaker` extra in pyproject with all the upstream-compat pins (einops, peft, onnxruntime, insightface) and the `allow-direct-references` hatch metadata block. - `InvisibleEngine` swapped `_restore_faces` -> `_restore_faces_photomaker`; `--restore-faces-method` removed (only one method, no choice). - CLI flag help, CLAUDE.md, README, docs/synthid.md, and docs/controlnet-removal-pipeline-research.md all updated. - docs/synthid-robust-identity-research.md status notice rewritten to list both abandoned commercial-safe attempts (V1 + GFPGAN-on-cleaned) and the non-commercial trade-off we accepted. ruff + strict pyright(src/) clean; 578 tests pass (the 9 GFPGAN tests are gone, the 11 PhotoMaker tests stay green). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 18:41:01 -07:00
Victor Kuznetsov	01fe98bf54	refactor(face-restore): rollback PhotoMaker, restore GFPGAN on the CLEANED image After 7 cascading upstream-compat fixes (insightface dep, peft dep, pm_version, device, etc.), the PhotoMaker V1 cert sweep still hit a CFG batch-dim mismatch inside the denoising loop. The upstream PhotoMaker `pipeline.py` is forked from diffusers v0.29.1 and our env runs 0.38; SDXL prompt-encoder handling changed significantly between those versions, so making PhotoMaker work end-to-end needs a proper fork or a diffusers downgrade — both expensive. Not worth shipping today. Pivot: restore `face_restore.py` (GFPGAN) with a single-line fix that makes it SynthID-safe by construction. The previous design ran GFPGAN.enhance on the ORIGINAL watermarked image and was oracle-confirmed to re-add SynthID via the weight-0.5 pixel blend. The fix is to run GFPGAN on the diffusion-CLEANED image — whatever pixels GFPGAN derives from are already SynthID-free, so the partial blend cannot transport the watermark. Identity fidelity is lower than a true identity-as-embedding stack would deliver, but it ships and works. Changes: - `src/remove_ai_watermarks/face_restore.py` restored from pre-wipe state with one line changed: `restorer.enhance(cleaned_bgr, ...)` instead of `restorer.enhance(original_bgr, ...)`. `original_bgr` is kept as an unused positional argument for API stability. - `src/remove_ai_watermarks/photomaker_restore.py` and its tests REMOVED. The research note (`docs/synthid-robust-identity-research.md`) keeps a "status notice" documenting why PhotoMaker is parked for now and what the path back in would look like. - `pyproject.toml` `restore` extra restored (gfpgan/facexlib/basicsr + scipy<1.18 + numba<0.60 pins + the basicsr setuptools<69 build pin), plus `photomaker` extra (with its einops/insightface/peft pile) and the `[tool.hatch.metadata] allow-direct-references = true` block REMOVED. - `InvisibleEngine._restore_faces_photomaker` removed; `_restore_faces` restored. The `--restore-faces` CLI flag and its plumbing through cmd_* signatures are unchanged. - CLAUDE.md, README.md, docs/synthid.md, docs/controlnet-removal-pipeline- research.md updated to describe the shipped GFPGAN-on-cleaned design and to reference PhotoMaker only as the parked alternative. ruff + strict pyright(src/) clean; 578 tests pass. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 16:55:45 -07:00
Victor Kuznetsov	9435e12ce6	fix(photomaker extra): add peft dep (required by pipe.fuse_lora) Modal cert sweep #4 got further -- PhotoMaker V1 components actually loaded ("Loading PhotoMaker v1 components [1] id_encoder ... [2] lora_weights") -- and died on the next step: "PEFT backend is required for this method." That's diffusers' fuse_lora call gated on the peft library, which PhotoMaker doesn't declare in its install_requires either. Pin peft>=0.10.0 in the photomaker extra. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 16:23:32 -07:00
Victor Kuznetsov	860bde4a26	fix(photomaker extra): pin `insightface` for import resolution (MIT code only) The upstream PhotoMaker package's `__init__.py` unconditionally imports a face-analyser class from its `insightface_package` submodule, so JUST importing `PhotoMakerStableDiffusionXLPipeline` (the V1 pipeline class we use) raises `ModuleNotFoundError: No module named 'insightface'` if insightface isn't present in the env. The Modal cert sweep caught this on the V1 image. Resolution: pin `insightface>=0.7.3` (and its `onnxruntime` runtime dep) in the `photomaker` extra. The PyPI insightface package is MIT-licensed CODE; the non-commercial restriction sits on the pretrained model packs (antelopev2, buffalo_l) which download only when `FaceAnalysis()` is instantiated. Our V1 path never instantiates the face-analyser -- it loads photomaker-v1.bin (CLIP-only encoder) via `load_photomaker_adapter` -- so the model-pack license does not bind us; we depend only on the MIT code for the import to resolve. Safety guards: - Runtime check in `_get_pipeline`: raises if `_PHOTOMAKER_FILE` is ever pointed at v2 (so a future maintainer can't silently regress to the InsightFace path). - New test class `TestV1OnlyCommercialSafetyGuard`: asserts repo + filename pin to V1 AND asserts the module source never references the face-analyser class (a static check that our codepath stays out of the runtime that would pull the non-commercial model packs). Docs: documented the import dance + legal split inline at the top of `photomaker_restore.py`. ruff clean; 581 tests pass (the 9 PhotoMaker tests plus 3 new V1-guard tests). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 16:13:20 -07:00
Victor Kuznetsov	7e6fc8bfb9	fix(photomaker extra): add einops explicitly (upstream missed it) PhotoMaker imports einops in its forward path but its install_requires doesn't declare it, so the photomaker extra resolved without einops on a clean install and the Modal cert sweep died at the restore-faces step with "No module named 'einops'" -- the post-pass failed gracefully and returned the un-restored cleaned output, so the cert artifact had no face recovery. Pin einops>=0.7.0 in the photomaker extra so the extra is self-contained. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 15:46:28 -07:00
Victor Kuznetsov	439eeadc07	refactor(face-restore): wipe GFPGAN path, --restore-faces is PhotoMaker-only The GFPGAN `restore` extra and its `face_restore.py` module are gone. They were oracle-confirmed to re-introduce SynthID by blending watermarked original face pixels at fidelity weight 0.5 (clean A/B: gemini_3 controlnet 0.20 detected WITH GFPGAN, clean WITHOUT). Keeping them as the default restore method was a footgun for the removal pipeline. PhotoMaker-V2 (added in the previous commit) is the single shipped restore path now -- identity-as-embedding, SynthID-safe by construction. Removed: - src/remove_ai_watermarks/face_restore.py + tests/test_face_restore.py - pyproject.toml `restore` extra (gfpgan/facexlib/basicsr + scipy/numba pins) - pyproject.toml `[tool.uv.extra-build-dependencies] basicsr = [...]` build pin - CLI: `--restore-faces-method` and `--restore-faces-weight` (no method choice to make, no GFPGAN weight knob to expose) - InvisibleEngine._restore_faces method (only _restore_faces_photomaker remains) - All restore-faces-method / restore-faces-weight threading through cmd_* signatures and _process_batch_image Kept: - `--restore-faces / --no-restore-faces`: now binds to PhotoMaker-V2. - All adopted oracle findings about GFPGAN re-introducing SynthID (kept in the research docs as historical context that explains why the path was removed). Docs updated: CLAUDE.md (restore extras bullet collapsed to photomaker, removed face_restore Key-modules bullet, several inline GFPGAN refs scrubbed), README.md (face-identity callout + install section now point to the photomaker extra), docs/synthid.md 5.5 (net recipe), docs/controlnet-removal-pipeline-research.md (recommendations). ruff + strict pyright (src/) clean; 578 tests pass (the 9 GFPGAN tests are gone, the 9 PhotoMaker tests stay green). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 15:35:37 -07:00
Victor Kuznetsov	1439eb0714	feat(photomaker): SynthID-safe face-identity restoration via PhotoMaker-V2 Adds the second face-restore mechanism, selectable via the new CLI option `--restore-faces-method=photomaker`. Unlike the existing GFPGAN path (which runs on the watermarked ORIGINAL and was oracle-confirmed to re-introduce SynthID by partial pixel blending), PhotoMaker carries identity in a SynthID-invariant OpenCLIP embedding and regenerates fresh face pixels conditioned on it — the pixels in the output are diffusion-fresh, so the watermark cannot be transported. The load-bearing assumption (embedding invariance to SynthID-magnitude pixel noise) was empirically validated in the prior commit (smoke test): cosine drift 0.002 under a ±2 LSB low-freq carrier, an order of magnitude less than JPEG90 drift which SynthID survives at >=99% TPR. End-to-end commercial-safe: - PhotoMaker-V2 weights: Apache-2.0 (TencentARC) - ID encoder: OpenCLIP-ViT-H/14 (MIT) - SDXL base: shared with the main pipeline - NO InsightFace (the non-commercial blocker for IP-Adapter FaceID / InstantID / PuLID / Arc2Face) Two-pass architecture (PhotoMaker has no ControlNetImg2img class in diffusers): 1) main controlnet/default removal pass cleans SynthID + drifts faces 2) PhotoMaker txt2img regenerates each face from its embedding, feather-composited back into the cleaned image New module `photomaker_restore.py` mirrors `face_restore.py`: lazy pipeline singleton (double-checked lock), `is_available()` gate, pure `_face_crop_square` and `_composite_faces` helpers, all unit-tested without the model (9 new tests). New `InvisibleEngine._restore_faces_photomaker` runs after the diffusion pass, mirroring `_restore_faces`. CLI flag `--restore-faces-method=[gfpgan\|photomaker]` threaded through `cmd_invisible`/`cmd_all`/`cmd_batch` + `_process_batch_image`. New optional `photomaker` extra (Apache-2.0 + Apache-2.0/MIT deps, no basicsr). `[tool.hatch.metadata] allow-direct-references = true` is required because the upstream PhotoMaker package lives only on GitHub. The next step (separate work) is oracle validation: run a 6-image cert sweep through the new pipeline (default/controlnet at the certified strength + --restore-faces-method=photomaker) and confirm SynthID stays clean while face identity is recovered. The required infrastructure (`raiw-app/modal_cert.py`) is already in place. ruff + strict pyright(src/) clean; 586 tests pass (+ 9 new in tests/test_photomaker_restore.py). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 15:20:29 -07:00
Victor Kuznetsov	6d11c11b52	feat(auto): DBNet text detector, Real-ESRGAN upscaler, batch --auto Three content-quality features for the invisible/all/batch pipeline. DBNet text detector (auto_config): replace the MSER text heuristic with PP-OCRv3 differentiable-binarization via cv2.dnn.TextDetectionModel_DB, using a bundled 2.4 MB Apache-2.0 model (en/cn detection nets are byte-identical, so it ships language-neutral). cv2.dnn is core OpenCV, so no new pip dep. MSER stays as the fallback when the model can't load. Validated on real images: matches MSER everywhere and additionally catches the Doubao CJK mark MSER missed; routing decisions unchanged otherwise. Real-ESRGAN upscaler (new upscaler.py, esrgan extra): optional pre-diffusion super-resolution for the min-resolution floor upscale, loaded via spandrel (MIT, no basicsr) with BSD-3-Clause weights downloaded on first use. New --upscaler {lanczos,esrgan} on invisible/all/batch; default stays lanczos and the engine falls back to lanczos when the extra is absent or the model errors (never breaks removal). It is a manual opt-in knob (the auto plan never selects it) -- as a generic GAN it sharpens photo/texture content strongly but can degrade faces (the diffusion pass regenerates them) and thin text, documented accordingly. batch --auto: wire the content-adaptive --auto (+ --adaptive-polish) into cmd_batch. The plan is recomputed per image and the invisible engine is cached per resolved pipeline (default/controlnet), so a mixed directory builds at most one engine of each kind. Verified end-to-end: 3 mixed images routed correctly with only 2 pipeline loads (controlnet reused). ruff + strict pyright(src/) clean; 558 tests pass. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-04 16:04:33 -07:00
Victor Kuznetsov	5cf68a6a3d	refactor: unify C2PA vendor registry + code-health fixes + uv publish Three P2 cleanups from a library-wide review. Detection -- single C2PA_AI_VENDORS registry (noai/constants.py): - C2PA_ISSUERS, SYNTHID_C2PA_ISSUERS, and identify._ISSUER_PLATFORM now derive from one C2paAiVendor table, so adding a C2PA vendor is one entry instead of edits in three places across two files. Behavior-identical (262 detection tests pass; the kept `needle` field is load-bearing -- it differs from `org` for Google and ByteDance, with no mechanical derivation). Code-health: - region_eraser.erase_lama now accepts grayscale/BGRA like erase_cv2 (it crashed on grayscale and silently dropped alpha on BGRA). +2 regression tests. - batch frees the device cache between images via a shared try_empty_device_cache helper (generalized from the MPS-only _try_clear_mps_cache, now reused by both the MPS->CPU fallback and the batch loop). - batch gained --controlnet-scale (parity with invisible/all). CI / packaging: - publish.yml uploads via `uv publish` (PyPI trusted publishing over OIDC), replacing pypa/gh-action-pypi-publish so uploads no longer depend on that action's bundled twine accepting the Metadata-Version. Workflow filename + pypi environment unchanged, so PyPI's trusted-publisher entry still matches. - hatchling pin relaxed <1.28 -> <1.31 (verified against hatch's changelog: 1.30.0 made Metadata 2.5 the default, 1.30.1 reverted to 2.4; 1.27-1.29 were always 2.4). Kept as belt-and-suspenders so the first uv-publish release ships 2.4, isolating the uploader swap from the metadata-version bump. Docs (CLAUDE.md, pyproject) synced; corrected the inaccurate "hatchling 1.28+ emits 2.5" note. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 21:01:07 -07:00
Victor Kuznetsov	411ef16ec3	feat: GFPGAN face-identity restoration post-pass Add an optional, commercial-safe face-restoration post-pass that recovers face identity the diffusion removal pass drifts (canny holds structure, not likeness) while still scrubbing the pixel watermark in the face regions. - face_restore.py: GFPGANer singleton (CPU unless CUDA), the basicsr torchvision.transforms.functional_tensor shim, and the pure feather _composite_faces helper (unit-tested without the model). GFPGAN re-synthesizes each face from a StyleGAN2 prior, so composited face pixels are GAN-generated (no watermark, no pixel-copy) -- oracle-clean at weight 0.5 with identity preserved. - InvisibleEngine.remove_watermark: restore_faces / restore_faces_weight, best-effort, auto-skips when the extra is absent or no face is detected. - CLI --restore-faces/--no-restore-faces + --restore-faces-weight on invisible/all/batch (on by default). - restore extra (gfpgan/facexlib/basicsr), numpy<2-pinned (scipy<1.18, numba<0.60) and kept out of `all`; basicsr needs Python <3.13 + setuptools<69 to build, so pin .python-version 3.12. Commercial-safe: GFPGAN Apache-2.0, RetinaFace MIT. The CodeFormer alternative is non-commercial and is not shipped. The earlier IP-Adapter FaceID layer was removed (footgun: needs high strength, corrupts faces at the low removal strength). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 16:59:28 -07:00
Victor Kuznetsov	d90d5d886a	feat: controlnet pipeline for text/face-structure preservation Add `--pipeline controlnet` (SDXL base + xinsir canny ControlNet via StableDiffusionXLControlNetImg2ImgPipeline): the canny edge map conditions the img2img regeneration so text and face STRUCTURE stay sharp, while the watermark is still removed by the regeneration (`strength`) -- no original pixels are copied or frozen, so SynthID does not survive. Oracle-verified clean on OpenAI with better text/structure fidelity than plain img2img at equal strength. `--controlnet-scale` tunes structure preservation; fp32 on mps/cpu (fp16-fixed VAE on cuda/xpu). Shares the img2img runner (live progress + MPS->CPU fallback) and the fp16-VAE-fix / device-move helpers with the default pipeline. Remove the superseded subsystems -- ctrlregen (SD1.5 clean-noise), text-protection (differential / region-hires) and face-protection: they either destroyed real content or shielded the watermark by re-using original pixels. controlnet replaces them by regenerating everything under edge conditioning. Canny preserves face structure but not identity; face IDENTITY is a separate face-restoration post-pass (CodeFormer/GFPGAN), researched + prototyped but not yet shipped. An IP-Adapter FaceID attempt was built and removed (footgun: needs high strength, corrupts faces at removal strength). Docs: docs/controlnet-removal-pipeline-research.md, scripts/controlnet_sweep.py. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 16:59:28 -07:00
Victor Kuznetsov	35116d5e97	chore(release): v0.8.9 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 19:04:32 -07:00
Victor Kuznetsov	9cb66992bd	chore(release): v0.8.8 Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-02 09:18:02 -07:00
Victor Kuznetsov	b25276c4f2	chore(release): v0.8.7	2026-06-01 19:33:08 -07:00
Victor Kuznetsov	1708857772	fix(gemini): expand sparkle search area 256 -> 512px from corner The 256px limit caused misses when Gemini places the sparkle further from the corner than the standard 160px (margin 64 + logo 96). Observed variant at ~300px reported in issue #30. 512px covers all known Gemini margin variations with room to spare; matchTemplate on a 512x512 region is still fast on CPU. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 10:42:04 -07:00
Victor Kuznetsov	25cc4750df	chore(release): v0.8.5	2026-06-01 10:31:59 -07:00
Victor Kuznetsov	72812de03c	chore(release): v0.8.4	2026-05-31 20:46:52 -07:00
Victor Kuznetsov	2e23cf9c4b	fix(build): pin hatchling<1.28 to keep Metadata-Version 2.4 (PyPI upload rejected 2.5) hatchling 1.28+ emits Metadata-Version 2.5 (PEP 639); the twine in pypa/gh-action-pypi-publish@release/v1 rejects it, which failed the v0.8.3 PyPI upload (build + tag-match passed, upload step failed, nothing uploaded). 1.27.x emits 2.4, which uploads fine (0.8.2). Pin the build backend; lift once the action twine is 2.5-aware or the workflow uses uv publish. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-31 17:49:19 -07:00
Victor Kuznetsov	c155f81078	chore(release): v0.8.3 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-31 17:41:10 -07:00
Victor Kuznetsov	729f5f2ecd	chore(release): v0.8.2 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-31 15:46:47 -07:00
Victor Kuznetsov	e42b7e9d6a	refactor(cli): plain-text console output; drop rich; quiet transformers cli.py now emits plain ASCII through a small click.echo shim (_Console / _Table / _Progress) instead of rich: no colors, markup tags, panels, progress bar, or Unicode glyphs (Warning: / -> / ... and dropped checkmark/cross marks). identify and metadata tables render as indented plain lines. - drop rich from dependencies (pyproject.toml + uv.lock) - __init__: set TRANSFORMERS_VERBOSITY=error (setdefault) plus a warnings filter so the transformers Siglip2ImageProcessorFast deprecation no longer prints at CLI startup (it fires from the eager noai import) - TestGpuHintMarkup: the [gpu] hint is now printed verbatim; docstring updated - CLAUDE.md: replace the obsolete rich-markup lesson, note the verbosity fix Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-31 15:21:29 -07:00
Victor Kuznetsov	e7c57e3892	chore(release): v0.8.1 — exclude data/ from sdist The 0.8.0 PyPI publish uploaded the wheel but the sdist was rejected (400 File too large): hatchling's default sdist bundled the committed data/ test corpora (synthid_corpus images + the new visible-mark captures), pushing it past PyPI's per-project file-size limit. Add a sdist target that excludes /data, dropping it ~85 MB -> 9.8 MB. The wheel already ships only src/ and is unaffected. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-31 12:57:46 -07:00
Victor Kuznetsov	315320056b	chore(release): v0.8.0 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-31 12:25:02 -07:00
Victor Kuznetsov	5298dcc6a3	chore(release): v0.7.2 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-30 14:35:04 -07:00
Victor Kuznetsov	9be66752c5	chore(release): v0.7.1 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-30 14:21:02 -07:00
Victor Kuznetsov	c928ee6e42	chore(release): v0.7.0 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-30 12:35:32 -07:00
xchacha20-poly1305	0c7ff1874e	feat(device): support xpu backend (#24 ) * feat(device): support xpu backend * Fall back to CPU seed generator when device RNG unsupported (xpu) Some torch-xpu builds have no device-side RNG, so torch.Generator(device="xpu") raises when --seed is used. _make_seed_generator tries the device generator and falls back to a backend-agnostic CPU generator. Adds a fallback unit test. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Victor Kuznetsov <kuznetsov.va@gmail.com> Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-29 11:13:23 -07:00
Victor Kuznetsov	a46268f6eb	Add cross-platform CI test matrix + PyPI classifiers (#25 ) * Add cross-platform CI test matrix, PyPI classifiers CI: new test.yml runs lint (ubuntu) + a test matrix (ubuntu/macos/windows x py3.10/3.12, core+dev, GPU tests skip) on push to main and PRs, closing the gap where only the release publish.yml ran (ubuntu, no tests). Add PyPI classifiers (OS/Python/topic). README Tests badge, CLAUDE.md CI note. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * Make availability tests reflect installed deps, not assume gpu extra The new core+dev CI matrix has no diffusers, so the invisible-engine availability tests (asserting is_available() is True unconditionally) and the two mocked invisible CLI tests (whose command gates on is_available before the mock) failed. Assert availability == actual importability of torch+diffusers, and patch the CLI availability gate so the mocked-engine tests run regardless of the gpu extra. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-05-29 11:04:12 -07:00
Victor Kuznetsov	96b3653b9e	chore(release): v0.6.12 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 18:54:44 -07:00
Victor Kuznetsov	888c8c2556	chore(types): clear strict-pyright debt across src (0 errors) Make `pyright src/` strict-clean via a hybrid: pure-logic files are fully typed (piexif gets a local typings/ stub; PIL info-dict loops guard isinstance(key, str); progress returns Callable[..., None]; availability checks use importlib.util.find_spec instead of unused imports), while the irreducibly-untyped cv2/torch/diffusers boundary files carry a documented per-file `# pyright:` relax pragma (or a ctrlregen executionEnvironment) that disables only the unknown-type rules. Public ndarray-returning signatures on the relaxed engines are annotated NDArray[Any] so strict consumers (cli.py) stay clean. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 14:00:15 -07:00
Victor Kuznetsov	f326bab189	chore(release): v0.6.11 Ships the China TC260 AIGC PNG-chunk and HuggingFace hf-job-id provenance detectors (`223cbcf`). Also syncs src/__init__.__version__, which had drifted to 0.6.9 (not bumped in the 0.6.10 release). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 12:43:52 -07:00
Victor Kuznetsov	a0bf62e601	feat(invisible): preserve text/CJK via Differential Diffusion (--protect-text) (v0.6.10) SDXL img2img regenerates every pixel, so small text and CJK glyphs deform at the strengths that defeat SynthID (issue #21). With --protect-text a CJK-native PP-OCRv3 detector (2.4 MB ONNX, cv2.dnn, no torch, cached on first use) locates text regions and the pass switches to the SDXL Differential-Diffusion community pipeline: a per-pixel change map keeps text regions largely intact while the background is regenerated to strip the watermark. Gated to the SDXL default model; falls back to plain img2img with a warning when unavailable. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 11:59:15 -07:00
Victor Kuznetsov	7db4e231e8	fix(deps): require transformers 5.x with stable tokenizers for SDXL load diffusers 0.38's auto-pipeline registry imports a transformers 5.x-only symbol, so the gpu extra needs transformers>=5. Cap tokenizers to the stable 0.22 line so the global prerelease="allow" no longer drags in the 0.23.0rc0 whose CLIP tokenizer breaks SDXL loading. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 11:58:53 -07:00

1 2

84 Commits