remove-ai-watermarks

mirror of https://github.com/wiltodelta/remove-ai-watermarks.git synced 2026-07-24 16:40:50 +02:00

Author	SHA1	Message	Date
test-user	27539c0da9	docs: tidy roadmap to open items only (drop shipped v0.6.7-0.6.9) The Roadmap is the project TODO; shipped features (Integrity Clash, streaming-MP4 scan window, meta-box XMP blanking) no longer belong under "not yet implemented". Removed them and kept the still-open remainder as its own item (AVIF/HEIF Exif item inside the meta box). Net open TODO: SynthID v2 regression test, local SynthID pixel detector, grow the SynthID corpus, real non-PNG C2PA fixtures, pyright maintenance debt, meta-box Exif item, Canon/Samsung device signers, Resemble PerTh (dead end), video pipeline. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 18:18:28 -07:00
test-user	5bfed00553	feat(metadata): blank AI-label XMP inside the HEIF/AVIF meta box (v0.6.9) HEIF/AVIF store XMP as a meta-box `mime` item whose bytes live in mdat/idat, out of reach of the top-level uuid/jumb box stripper. An AI-label XMP packet there (TC260 AIGC, IPTC "Made with AI", IPTC 2025.1) was therefore left in place. isobmff.blank_ai_xmp_packets locates each XMP packet by its <?xpacket begin ... end?> delimiters and, if it carries an AI marker (_AI_LABEL_MARKERS), overwrites it with spaces of the SAME length. Equal length means no box size or iloc offset shifts -- the coded image stays bit-for-bit intact, the item stays structurally valid, only the AI label content is destroyed. Plain (non-AI) XMP is left alone, mirroring the top-level XMP-uuid content match. Wired into remove_ai_metadata's ISOBMFF branch after strip_c2pa_boxes. Chosen over exiftool (a non-bundled binary dep) to stay pure-Python and droplet-compatible; over full iinf/iloc surgery to avoid offset-rewrite corruption risk. The AI labels we target are all XMP, so this closes the practical gap. An Exif item inside the meta box (rare) still needs iinf/iloc surgery or exiftool -- documented. 4 new tests (TestMetaBoxXmpBlanking): AI packet blanked (same length, marker gone, surrounding image bytes intact), plain XMP preserved, no-packet no-op, and end-to-end remove_ai_metadata on a .heic. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 18:15:48 -07:00
test-user	31f0a82906	feat(metadata): detect C2PA/AIGC/IPTC manifests after a large mdat in MP4 (v0.6.8) Provenance detection no longer relies on a fixed first-MB read. In a streaming / non-faststart MP4 the C2PA manifest sits AFTER a multi-megabyte mdat, beyond the 1 MB scan window, so it was missed. - isobmff.scan_c2pa_region(path): a file-seeking top-level box walker that returns the payloads of uuid/jumb (provenance) boxes, seeking past mdat by size without reading it -- works on multi-GB files. Returns b"" for non-ISOBMFF or on read error. Mirrors the box-size encoding of the existing in-memory _iter_top_level_boxes (largesize / size==0). - metadata.scan_head(path, size): the shared input for every C2PA/AIGC/IPTC byte scan -- first __TEXT __DATA __OBJC others dec hex bytes plus, for ISOBMFF, the late provenance-box payloads. Behavior-neutral (f.read(size)) for non-ISOBMFF inputs. - Routed all six metadata scan sites (has_ai_metadata, aigc_label, iptc_ai_system, synthid_source, exif_generator XMP, get_ai_metadata soft-binding) and identify's head read through scan_head. 6 new tests: late box found by scan_c2pa_region / scan_head, the fixed window provably misses it, non-ISOBMFF -> b"", front-placed (faststart) regression. The remaining gap stays documented: EXIF/XMP stored as items inside the meta box (AVIF/HEIF stills) still needs meta-box surgery or exiftool. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 13:42:29 -07:00
test-user	18160fe269	feat(identify): integrity-clash detection for contradictory provenance (v0.6.7) Surface contradictions between independent provenance signals instead of collapsing to a single verdict -- a strong tell of spoofed, transplanted, or laundered metadata. Inspired by arXiv:2603.02378. Two rules in the new _integrity_clashes helper: - Conflicting AI-origin attributions: two or more distinct AI vendors named by independent generator stamps (e.g. a C2PA OpenAI manifest on an image whose EXIF says Make="Ideogram AI"). - Camera + AI: a camera-capture C2PA device (Pixel/Leica/Sony/Nikon/Truepic) coexisting with an AI-generation marker -- a genuine capture is not AI. High-precision by design: only hard generator stamps feed it (C2PA issuer when the source is AI, SynthID proxy, EXIF/XMP generator, IPTC AISystemUsed, xAI, AIGC). The fuzzy visible sparkle and the open invisible watermark are excluded -- the latter can be a by-product of our own SDXL removal pass. Vendor normalization (_vendor_of over _AI_VENDOR_TOKENS) keeps consistent signals from clashing (C2PA "Google (Gemini)" + SynthID-Google agree); the C2PA vendor is read from the issuer attribution, not the resolved platform, so a camera label like "Google Pixel" cannot mis-normalize to an AI vendor. Surfaced as ProvenanceReport.integrity_clashes (red in the table view, included in --json). 19 new tests; all real single-origin fixtures (chatgpt/firefly/ doubao/grok/mj) verified to produce zero clashes (false-positive guard). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 13:27:25 -07:00
dependabot[bot]	6694a79514	chore(deps): bump ultralytics in the minor-and-patch group (#20 ) Bumps the minor-and-patch group with 1 update: [ultralytics](https://github.com/ultralytics/ultralytics). Updates `ultralytics` from 8.4.55 to 8.4.56 - [Release notes](https://github.com/ultralytics/ultralytics/releases) - [Commits](https://github.com/ultralytics/ultralytics/compare/v8.4.55...v8.4.56) --- updated-dependencies: - dependency-name: ultralytics dependency-version: 8.4.56 dependency-type: direct:production update-type: version-update:semver-patch dependency-group: minor-and-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-05-27 12:04:47 -07:00
test-user	7b47fa9f6a	fix(io): Unicode-safe cv2 image IO + un-eat the [gpu] install hint (v0.6.6) Two CLI/IO robustness bugs surfaced by issues #17 and #19. #17 -- non-ASCII image paths (Chinese/Cyrillic/accented) failed on Windows: cv2.imread/imwrite use the platform ANSI code-page API, so the decode came back empty with a "can't open/read file" warning. New image_io.imread/imwrite route through np.fromfile+cv2.imdecode / cv2.imencode+tofile (Unicode-safe, byte- identical output, cv2.imread None-semantics preserved); all 8 cv2 read/write call sites now go through it. Behavior-neutral on macOS/Linux (already accept UTF-8 paths), so the fix is correct-by-construction for the Windows-only bug. #19 (incidental) -- rich parsed the "[gpu]" in the GPU-extra install hint as a style tag and dropped it, so the printed command was the un-installable "pip install 'remove-ai-watermarks'". Escaped as \[gpu] at both call sites. Tests: test_image_io.py (non-ASCII round-trip, alpha, missing/empty/garbage semantics); test_cli.py::TestGpuHintMarkup (install hint keeps the extra). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 11:52:48 -07:00
test-user	d847b39292	docs: lawful-use disclaimer + primary-source Legal-table corrections README: - surface a lawful-use / no-liability disclaimer near the top - reword two feature bullets away from detection-evasion framing ("bypass AI image classifiers" -> neutral post-processing; drop the platform-targeting language from the "Made with AI" bullet) - Legal table, each corrected against the primary text: - CA AB 2655 was struck down on Section 230 ONLY (Kohls v. Bonta, E.D. Cal., Aug 2025); the court did not reach the First Amendment (the companion AB 2839 was separately enjoined on 1A grounds) - COPIED Act: add the bill number (S. 1396, 119th Cong.) - South Korea AI Framework Act: in force 22 January 2026 (exact date) CLAUDE.md: sync the South Korea date to 22 January 2026. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 11:50:20 -07:00
test-user	fee0e139af	docs: refresh README roadmap + metadata-strip coverage for v0.6.x - Metadata-strip feature now lists the audio/video container coverage shipped in v0.6.0-v0.6.4 (MP4/MOV/M4V/M4A via ISOBMFF box walker; WebM/MP3/WAV/FLAC/OGG losslessly via ffmpeg). - Roadmap updated: the AVIF/HEIF item now reflects that top-level XMP/C2PA boxes and non-ISOBMFF audio/video are handled, with only meta-box-item EXIF/XMP left (needs exiftool). Added the open backlog: multi-signal "Integrity Clash" reporting (arXiv:2603.02378), Canon/Samsung device signers pending a real sample, the streaming-MP4 scan-window limit, and Resemble PerTh audio as evaluated-but-infeasible. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 10:42:40 -07:00
test-user	e1c99b5937	fix(identify): gate C2PA issuer->generator attribution on AI source type (v0.6.5) Prevents an unmapped C2PA device whose manifest incidentally contains a mapped issuer substring (e.g. the "Adobe XMP" toolkit string in a Canon/Sony camera capture) from being mislabeled as that AI generator ("Adobe Firefly"). _attribute_platform now names a specific AI-generator platform only when the digital-source-type is trainedAlgorithmicMedia; otherwise it degrades to the neutral "C2PA signer: X" label. Real Firefly/OpenAI/Google output carries the AI source-type and is unaffected (verified: chatgpt-1.png->OpenAI, firefly-1.png->Adobe Firefly still attribute). Closes the only real downside of leaving Canon/Samsung/Bria device signers unmapped: detection and removal were already unaffected; now the platform label degrades gracefully too. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.6.5	2026-05-27 10:29:12 -07:00
test-user	f9cf14c372	feat(metadata): strip container metadata from WebM/MP3/WAV/FLAC/OGG via ffmpeg (v0.6.4) remove_ai_metadata now handles non-ISOBMFF audio/video (which the box walker can't reach) by shelling out to ffmpeg with a lossless stream copy (`-map_metadata -1 -map_chapters -1 -c copy`): codec data is untouched, only container tags/chapters (ID3 / RIFF / Vorbis comments / EBML tags) are dropped. Requires ffmpeg on PATH; raises a clear RuntimeError if absent or if ffmpeg can't parse the input (instead of crashing in the image path). Verified end-to-end: a real ffmpeg-made WAV/MP3 with a "Suno AI" title tag -> tag gone, audio bytes preserved. NOT built (evaluated, deliberate): Resemble PerTh audio detection -- `get_watermark()` returns a raw bit array with no presence/confidence flag, so reliably telling watermarked from clean needs Resemble's fixed payload or a confidence API (neither public; no real sample to calibrate). Same wall as the SynthID pixel detector. AVIF/HEIF meta-box EXIF/XMP stripping also stays a gap (needs exiftool, a non-installed binary). Both documented in CLAUDE.md. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.6.4	2026-05-26 21:39:42 -07:00
test-user	bc3228d387	feat(visible): Doubao text-mark removal + universal region eraser Add deterministic, CPU-only removal of the visible Doubao "豆包AI生成" mark and a position-agnostic region eraser for any other visible watermark/logo. - doubao_engine.py: locate (geometry, scales with width) + polarity-aware white-top-hat glyph mask + cv2 inpaint; coverage-gated detection and a dense-text safety guard. No GPU, ~30ms. - region_eraser.py + `erase` command: inpaint arbitrary --region box(es). Default cv2 backend (no deps); optional big-LaMa via onnxruntime (`lama` extra, Carve/LaMa-ONNX, model downloaded on first use, never bundled). - cli `visible --mark auto\|gemini\|doubao`: auto routes by detector confidence. - tests for both engines; seed previously-unseeded CLI image fixtures to stop the Doubao detector flaking on random corners. - .gitignore: doubao_capture/{seeds,captures} scratch (alpha-map calibration). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-26 21:31:51 -07:00
test-user	9f93d9c0c5	feat(identify): add Sony C2PA device attribution, verified (v0.6.3) Adds Sony to _DEVICE_C2PA_PLATFORM, matching Sony's own `sony.sig` / `sony.cert` C2PA assertion namespace (NOT bare "Sony", which is a common EXIF Make). Verified against a real Sony-signed file (Sony PXW-Z300, signer "Sony Corporation") found in the Security4Media/c2pa-video-player repo. The sample is video (MP4) -- our ISOBMFF C2PA path detects it; Sony Alpha stills likely share the namespace. Verified device set is now Leica, Nikon, Google Pixel, Sony, Truepic. Canon / Samsung / Bria still have no public direct-download C2PA sample to verify. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.6.3	2026-05-26 21:13:49 -07:00
test-user	64be9598f2	fix(identify): device-token-first C2PA attribution; add verified Pixel (v0.6.2) Replaces the claim-generator-string match with a distinctive device-token scan of the manifest bytes (_device_platform / _DEVICE_C2PA_PLATFORM), which is more robust: it catches devices where the generator name lives under a non-standard CBOR key (Pixel uses `claim_generator_info`, so it has no `claim_generator`). - Adds Google Pixel, verified against a real Pixel 10 Pro C2PA file (attached to c2pa-rs issue #1609/#1554): cert CN "Pixel Camera", digitalSourceType `computationalCapture` -> capture authenticity, not AI (is_ai stays None). - Token distinctiveness is load-bearing: bare "Truepic" matched the OpenAI chatgpt-1.png fixture (Truepic is a trust-chain signing authority), so the token is the specific "Truepic_Lens"; "Pixel Camera" (cert CN) not "Pixel". - Verified Leica/Nikon/Truepic/Pixel attribute correctly and OpenAI/Adobe/MJ do not regress. Sony/Canon/Samsung/Bria stay unmapped: no public direct- download C2PA sample exists to verify their in-manifest string. - Regression tests: device token beats incidental issuer mentions (Leica, Pixel-vs-Google). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.6.2	2026-05-26 20:43:40 -07:00
test-user	dda2ee7fbb	fix(identify): attribute C2PA by claim_generator, not incidental issuer tokens (v0.6.1) Verified on real signed files that the issuer byte-scan mis-attributes multi-entity manifests: Leica read as "Truepic" (timestamp authority in the chain), Nikon as "Adobe Firefly" (XMP-toolkit "Adobe" + the sample's "Adobe_MAX" name), Truepic as "Google". Platform attribution now prefers the claim generator (what produced the asset) and falls back to the issuer scan. - New _CLAIM_GENERATOR_PLATFORM map + _platform_from_generator; claim generator read for non-PNG via the now-public c2pa.cbor_text_after. - Device tokens listed only where verified against a real C2PA file (Leica lc_c2pa, Nikon, Truepic Lens); Pixel/Samsung/Sony/Canon/Bria deferred until a real sample confirms the in-manifest string. Camera C2PA marks capture authenticity, so these never set is_ai. - cbor_text_after made public (was _cbor_text_after); call sites + tests updated. - Regression test: claim_generator beats incidental Adobe/Google/Truepic tokens. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.6.1	2026-05-26 20:10:07 -07:00
test-user	2676325184	feat(c2pa): expand soft-binding vendor map with registry-verified algs Adds Trufo, Overlai, MarkAny, Mentaport, LumaTrace, VerdaAI, ContentLens, ISCC (io.iscc content code), and Adobe ICN fingerprint to C2PA_SOFT_BINDINGS, and notes AIWatermark wraps Meta PixelSeal. All `alg` prefixes verified against the official c2pa-org/softbinding-algorithm-list registry. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-26 18:00:16 -07:00
test-user	c196a16900	feat: detect soft-binding vendors, IPTC 2025.1, video/audio C2PA, TrustMark (v0.6.0) Broadens metadata provenance coverage at the detection and container-strip level. Detection: - C2PA soft-binding `alg` -> forensic-watermark vendor (Adobe TrustMark, Digimarc, Imatag, Steg.AI, Microsoft, ...) via C2PA_SOFT_BINDINGS + soft_binding_vendors_in(); names the watermark vendor even when the watermark itself can't be decoded. - IPTC Photo Metadata 2025.1 AI-disclosure XMP fields (AISystemUsed etc.) via iptc_ai_system() + IPTC_AI_FIELD_MARKERS. - Adobe TrustMark open keyless decoder (trustmark_detector.py, optional extra `trustmark`) -- the watermark behind Adobe Durable Content Credentials. Detects provenance, not AI origin, so it does not assert is_ai. Removal / containers: - isobmff.strip_c2pa_boxes now also drops a top-level XMP uuid box that carries an AI label (matched by AI-marker content, byte-order-robust; plain XMP kept). - remove_ai_metadata routes MP4/MOV/M4V/M4A (and any ftyp-sniffed ISOBMFF) through the box stripper; raises a clear error for non-ISOBMFF audio/video (WebM/MP3/WAV) instead of crashing in the image path. Tests: soft-binding scan, IPTC element/attribute/presence, MP4 + M4A detect/ strip, ISOBMFF XMP surgical strip, content-sniff, unsupported-container guard, TrustMark absent-safety + identify integration. ruff clean; pyright clean on all new modules. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.6.0	2026-05-26 17:56:48 -07:00
test-user	ba94de8275	feat: strip AI-provenance EXIF tags on removal (v0.5.6) remove_ai_metadata now scrubs AI tags from the JPEG EXIF instead of passing the block through wholesale. Closes the v0.5.5 follow-up: the xAI/Grok Signature + UUID-Artist pair was detected but not removed. - metadata._scrub_ai_exif(): deletes the xAI signature pair and any Software/Make/Artist/ImageDescription tag carrying an AI_GENERATOR_TOKENS token (so Ideogram's Make="Ideogram AI" is scrubbed too), keeping genuine camera/editor EXIF intact. - Shared _is_xai_signature_pair / _exif_text helpers (module-level compiled regexes) are now the single source of truth, used by both xai_signature and _scrub_ai_exif. - Tests: Grok signature stripped on JPEG output, Ideogram Make stripped, real-camera Make ("Apple") preserved. 325 passing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.5.6	2026-05-26 14:26:20 -07:00
test-user	74618b91a7	feat: detect xAI/Grok EXIF signature; refresh watermarking landscape (v0.5.5) xAI Grok (Aurora) images carry no C2PA/SynthID/IPTC -- their only provenance signal is an EXIF pair: ImageDescription "Signature: <base64>" + a UUID Artist. Verified stable across 3 genuine generations (a real download previously read as unknown / "no AI metadata"). - metadata.xai_signature(): matches the Signature blob + UUID Artist pair; wired into has_ai_metadata, get_ai_metadata, and identify (platform "xAI (Grok / Aurora)"). - data/samples/grok-1.jpg: real Grok fixture (neutral content; the Artist UUID is the public image id, not PII). - Tests: synthetic-fixture unit tests, real-sample assertion, identify integration (322 passing). Docs (research refresh, May 2026): - C2PA 2.4 Durable Content Credentials (soft-binding re-discovery after the embedded manifest is stripped). - New AI-labeling laws, primary-source verified: EU AI Act Art 50 (2026-08-02), South Korea AI Framework Act Art 31(3), California AB 853. - Hedge removal claims: defeating the SynthID verifier is not forensic invisibility (arXiv:2605.09203); cite SynthID-Image (arXiv:2510.09263). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.5.5	2026-05-26 14:14:35 -07:00
test-user	5e29c69e7b	README: lead with hosted raiw.cc CTA, honest free/paid split Move the raiw.cc call-to-action above the sponsor ask and drop the misleading "free web service" framing: visible-watermark and metadata removal are free, invisible removal runs on paid cloud GPUs. Also point no-GPU users to the hosted service from the invisible-removal feature bullet, where the GPU requirement is stated. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-26 12:47:28 -07:00
test-user	8ed4a754ff	Add GitHub Sponsors donation button (FUNDING.yml + README badge) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 16:09:52 -07:00
test-user	03fb460f77	Track the labeled SynthID corpus; complete metadata-source test coverage Corpus images were gitignored (local-only). The negatives were reviewed and cleared for publishing, so the labeled set is now committed (regular git, 65 MB across 25 files) -- making the removal regression set reproducible and CI-able. Corpus: - Track data/synthid_corpus/images/ (pos 9, neg 15, cleaned 1); keep only the synthetic refs/ calibration fills gitignored. - Reconcile manifest.csv to the on-disk files: 117 -> 25 rows (92 dangling rows for removed images pruned; dedup left one cleaned output, f6dd47a5). - Rewrite the corpus README layout/policy (images committed; review every image for private content before adding -- public repo, permanent history). Test fixtures: - Remove data/samples/not-ai-1/2/3 (personal iPhone photos, incl. GPS EXIF). - Add the clean_photo conftest fixture serving a verified-negative image from the corpus neg/ set; repoint the three "non-AI / clean photo" tests onto it (skips if the corpus is absent). Metadata-source coverage (close the last sub-variant gaps): - c2pa digitalSourceType: algorithmicMedia (procedural, not flagged AI) and compositeWithTrainedAlgorithmicMedia (AI + SynthID proxy). - exif_generator: EXIF Artist and ImageDescription fields (Software/Make/XMP CreatorTool were already covered). All 8 metadata-source kinds are now tested at both the unit and identify() level. 313 tests pass. CLAUDE.md updated (corpus tracked, clean_photo fixture). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:46:47 -07:00
test-user	3ebdee57b8	Test the untested pure logic: MPS fallback, tiling, isobmff/c2pa edges Coverage audit (pytest --cov) found real, non-model logic at 0%/low cover. Add unit tests that need no model download: - img2img_runner.py 0% -> 100%: the MPS->CPU fallback orchestration, mocked via injected load_pipeline/reload_on_cpu callables. Guards the production behavior hit this session (native-res SDXL OOMs on MPS, must retry on CPU; non-MPS errors must propagate; "mps"-worded error on a cpu device must not reload). - ctrlregen/tiling.py 0% -> 40%: the pure tile math (tile_positions, make_blend_weight, resize_center_crop) that decides how large images are split and blended. (run_tiled stays model-bound, untested.) - isobmff.py 93% -> 100%: size==0 (box-to-EOF) and truncated 64-bit largesize parsing branches for AVIF/HEIF/JXL C2PA stripping. - c2pa.py: non-PNG-signed .png reads as clean (has_c2pa_metadata / extract_c2pa_chunk) instead of mis-parsing. 309 tests pass (+23). Document in CLAUDE.md that these pure helpers are unit-tested without downloads so future sessions don't skip them as "ML". No src/ change, no release. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 14:21:32 -07:00
test-user	d24d8a4b14	Extract _target_size helper + regression-test native resolution (v0.5.4) The native-vs-downscale decision in InvisibleEngine.remove_watermark (the issue #10/#15 fix: max_resolution=0 must not pre-downscale, since any downscale both loses quality and lets SynthID survive) had no test. Extract it into a pure helper invisible_engine._target_size(w, h, max_resolution) and cover it with tests/test_invisible_engine.py::TestTargetSize so a re-introduced forced downscale fails CI instead of silently regressing #15. Also: - Clamp the short side to >=1 in _target_size: extreme aspect ratios (e.g. 5000x3 with --max-resolution 1024) truncated it to 0 and crashed image.resize(). Pre-existing in the inline math; fixed now that it is a named, tested function. - Consolidate the two duplicated temp-file save blocks into one unconditional save (behavior unchanged: the EXIF-transposed image is still always persisted before WatermarkRemover reloads it by path), and drop the now-redundant `_tmp_path is not None` guard in finally. - Bump version 0.5.3 -> 0.5.4 (pyproject, __init__, uv.lock); document the helper as the regression guard in CLAUDE.md. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.5.4	2026-05-25 14:09:33 -07:00
test-user	28fe13db8f	Document native-res MPS OOM -> CPU-fallback behavior in limitations Concrete data point from the 2026-05-25 gpt-image SDXL run: native 1254x1254 fp32 OOMs at the UNet step (not just VAE) on a 20 GB MPS ceiling, and img2img_runner auto-falls back to CPU and completes (slow, weight-identical, still defeats SynthID). enable_vae_tiling() alone does not prevent it. Fast Mac workarounds: fp16 on MPS or --max-resolution; neither is the default. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 13:57:13 -07:00
test-user	59d72c5db7	Record verified gpt-image-2 SynthID-cleaned chain in corpus Add manifest row for the 4ef377bd -> f6dd47a5 chain: a gpt-image-2 sample (openai.com/verify: SynthID + C2PA detected) cleaned via v0.5.3 `all` at native 1254x1254 (prod-equivalent SDXL base, strength 0.05, 50 steps). openai.com/verify reports SynthID NOT detected after the run, re-confirming that the #10 native-resolution default defeats OpenAI SynthID and resolving the #15 root cause (older SD-1.5/768px downscale default did not). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 13:55:10 -07:00
test-user	e27f24f520	test(samples): commit real Doubao fixture + AIGC real-sample test data/samples/doubao-1.png is the real #13 sample: carries the China TC260 <TC260:AIGC> XMP label and a visible '豆包AI生成' text mark (bottom-right). Grounds the AIGC detection on a real file (alongside the synthetic tests) and serves as the fixture for visible-watermark removal work. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 12:37:15 -07:00
test-user	1afc1e60ef	test(samples): add real Doubao TC260 AIGC reference sample 2048x2048 PNG carrying China's TC260 <TC260:AIGC> label; identify reports it as a China AIGC-labeled generator (TC260). Reference fixture for manual re-verification of the TC260 detection path -- the automated tests use synthetic blobs, so nothing depends on this file being present. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 12:36:28 -07:00
test-user	d45f0806a0	chore(release): v0.5.3 — detect China TC260 AIGC label (Doubao) - feat(identify): detect the China TC260 <TC260:AIGC> XMP label (Doubao and other China-served generators); reports platform + ContentProducer. Removal already strips it via the existing metadata cleaner. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.5.3	2026-05-25 12:30:40 -07:00
test-user	c7f0d71f90	feat(identify): detect China TC260 AIGC label (Doubao et al.) China-served generators embed an XMP <TC260:AIGC>{"Label":"1",...} block (China's mandatory AI-content labeling, TC260 standard). Doubao (ByteDance) uses it -- verified on the real #13 sample. It's none of C2PA / SynthID / imwatermark / IPTC, so identify() previously returned unknown. - metadata: AIGC_MARKERS + aigc_label() (json-decodes the HTML-entity-encoded block); has_ai_metadata + get_ai_metadata now surface it. - identify: new 'aigc' signal -> is_ai True, platform 'China AIGC-labeled generator (TC260; e.g. Doubao)', carries the ContentProducer code. - Container-agnostic raw-byte scan, so it covers the whole China-AIGC ecosystem (Jimeng/Kling/Qwen/Ernie share the standard). - Tests: synthetic TC260 block (metadata + identify). Docs updated. Addresses #13. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 12:29:51 -07:00
test-user	768d997ef0	docs: scope SynthID provenance claims to source-verified facts Threat model: replace the unverified deployment list (Gemini 3 Pro / Nano Banana Pro / Imagen 4 / Veo) with the source-verified scope -- SynthID across Imagen / Veo / Lyria plus Gemini app outputs (>10B items by Dec 2025), and attribute the 136-bit payload to the paper's SynthID-O variant. openai-images-2 sample: note the file predates the 19 May 2026 SynthID rollout across ChatGPT / Codex / API, and that openai.com/verify is now the public oracle (still no local decoder). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 12:18:13 -07:00
test-user	37769453a9	chore(release): v0.5.2 — native-resolution invisible removal (fixes #10 ) - fix(invisible): process at native resolution by default; the forced downscale-to-1024 -> upscale-back round-trip was the main quality loss (#10). Matches the raiw.cc backend (fal fast-sdxl = sdxl-base-1.0). New --max-resolution opt-in cap for GPU/MPS memory. - docs: verified fal checkpoint, native-res, gpt-image-2 SynthID. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.5.2	2026-05-25 10:00:25 -07:00
test-user	f3ecea348a	docs: gpt-image-2 carries SynthID (no local decoder; openai.com/verify oracle) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 09:59:16 -07:00
test-user	fb42295b3a	docs: record verified fal fast-sdxl checkpoint + native-resolution updates - fal's llms.txt confirms fast-sdxl is stabilityai/stable-diffusion-xl-base-1.0, the exact checkpoint the local CLI defaults to -> local == prod weights. Recorded in CLAUDE.md and README. - README How it works + sample README: replace the old downscale->upscale description with native-resolution processing (matches the #10 fix); document --max-resolution as an opt-in OOM cap. - README roadmap: idna already bumped (uv-secure clean). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 09:57:03 -07:00
test-user	18740969ae	fix(invisible): process at native resolution by default The invisible pipeline force-downscaled inputs >1024px to 1024 before diffusion, then upscaled the result back -- a lossy round-trip that was the main cause of the quality loss reported in #10. The hosted raiw.cc backend (fal fast-sdxl) does no pre-downscale, and at strength ~0.05 SDXL img2img doesn't need it. Default is now native resolution (max_resolution=0). New --max-resolution flag (invisible / all / batch) re-introduces an opt-in long-side cap only to bound GPU/MPS memory on very large inputs. Addresses #10. End-to-end quality/removal not re-verified locally (no GPU here); matches raiw-app's proven production config. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 09:50:06 -07:00
test-user	93c664f7fb	docs: sync README + corpus map with v0.5.x detection coverage - README Features: add the identify / provenance-detection capability. - README Supported models: add FLUX, Stability AI, Microsoft/Bing (MAI-Image), Meta AI rows; note SD/SDXL/FLUX imwatermark is locally detectable; add a detection note pointing at identify. - corpus README per-platform map: add Stability / Ideogram / Recraft / Krea-FLUX rows + an imwatermark column; correct Bing (MAI-Image, signs 'Microsoft'); note imwatermark fires only on pristine pipeline output, not re-hosts/exports. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 09:32:03 -07:00
test-user	20a754ef98	chore(release): v0.5.1 — security + bug fixes - security: bump idna 3.11 -> 3.16 (GHSA-65pc-fj4g-8rjx) - fix(ctrlregen): correct module import paths (#11, @neosun100) - fix(cli): preserve alpha channel through visible/all/batch (#8, @rlorenzo) - fix(cli): safer re-exec via -m instead of repr(sys.argv) -c string (#9, @eskibars) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.5.1	2026-05-25 09:21:53 -07:00
test-user	e60f183d29	style(ctrlregen): sort imports (follow-up to #11 ) #11 left the import block un-sorted (ruff I001); reorder so diffusers precedes the local ctrlregen import. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 09:21:53 -07:00
Rex Lorenzo	d091b9f822	fix(cli): preserve alpha channel in visible-watermark pipeline `cv2.imread(..., IMREAD_COLOR)` was silently stripping the alpha channel on RGBA inputs, and `cv2.imwrite` then wrote opaque 3-channel PNGs — so images with transparent backgrounds came back with an opaque-black (or white) background and the sparkle area baked in as a solid blob. Read the source with `IMREAD_UNCHANGED`, keep the alpha plane out of the detection/inpaint path (those still operate on BGR), and rejoin alpha at save time. The detected watermark bbox is also zeroed in the alpha plane so the sparkle region becomes transparent rather than an opaque artifact. Applies to `visible`, `all`, and `batch` modes. RGB-only inputs and JPEG outputs are unaffected.	2026-05-25 09:18:39 -07:00
test-user	e8d698814a	fix(cli): re-exec via -m instead of a repr(sys.argv) -c string Based on #9 by @eskibars. Replaces the os.execl(..., "-c", repr-string) restart (used after the CUDA-torch auto-install) with os.execv -m, so we no longer build an exec string from repr(sys.argv). Forwards sys.argv[1:] only: under -m Python sets argv[0] to the module path, so passing the full argv would re-inject the program name as a spurious Click argument. Verified: python -m remove_ai_watermarks.cli --version works; test_cli green. Closes #9 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 09:15:55 -07:00
Neo 孫	663c8c64ca	fix(ctrlregen): correct module import paths (#11 ) The CtrlRegen engine module references a non-existent top-level package 'ctrlregen' in four locations: src/remove_ai_watermarks/noai/ctrlregen/engine.py L39: from ctrlregen.pipeline import CustomCtrlRegenPipeline L57: from ctrlregen.color import color_match L242: from ctrlregen.tiling import resize_center_crop, run_tiled L267: from ctrlregen.tiling import resize_center_crop These should be absolute imports of the package's own subpackage. As a result, the top-level try/except sets _HAS_DIFFUSERS=False and _HAS_COLOR_MATCHER=False even when the [gpu] extra is correctly installed, and is_ctrlregen_available() always returns False. Effect on users: invoking the ctrlregen profile crashes with ImportError: Failed to auto-install missing dependencies: controlnet-aux, color-matcher, safetensors regardless of whether those packages are installed. The auto-install fallback also fails in uv-managed venvs (uv does not ship pip in the venv by default), so the error path is unrecoverable. Reproduction (before fix): uv sync --all-extras uv run remove-ai-watermarks invisible <image> --pipeline ctrlregen # → ImportError as above Fix: change the four imports to use the package-qualified path (matching the absolute-import style used elsewhere in the codebase, e.g. watermark_remover.py). Verified post-fix on Linux/CUDA (NVIDIA L40S): - is_ctrlregen_available() returns True - CtrlRegen pipeline loads, downloads weights, and runs end-to-end - Tile-based path (image > 512px) processes 6 tiles cleanly - 142 existing pytest tests still pass	2026-05-25 09:11:20 -07:00
test-user	b45e2a5731	chore(deps): bump idna 3.11 -> 3.16 (GHSA-65pc-fj4g-8rjx) Fixes the uv-secure abort that stopped maintain.sh: idna 3.11 had GHSA-65pc-fj4g-8rjx (fix in 3.15). uv lock --upgrade-package idna pulls 3.16; uv-secure now reports no vulnerabilities. Lock-only change, 266 tests still pass. Updates the stale CLAUDE.md note. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 09:04:24 -07:00
test-user	0762807e42	chore(release): v0.5.0 — image provenance (identify) + SD/SDXL/FLUX + EXIF/XMP detection New since v0.4.1: - identify command: aggregate C2PA, IPTC, SD/ComfyUI params, SynthID proxy, visible sparkle, open invisible watermark into one provenance verdict (--json, --no-visible). - Open SD/SDXL/FLUX invisible-watermark detection (imwatermark, extra: detect). - EXIF Software / XMP CreatorTool generator-tag reading (incl. AVIF/HEIF). - Stability AI + Microsoft/Bing C2PA issuers; SynthID metadata detection. - SynthID reference corpus + experimental pixel-carrier probe. - Fix: __version__ was stuck at 0.3.4 (banner mismatch), now synced. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.5.0	2026-05-25 08:58:45 -07:00
test-user	626f43aec9	docs: correct SynthID spectral-carrier understanding Deeper re-examination (2026-05-25) of github.com/aloshdenny/reverse-SynthID on our own data corrects the earlier over-stated dead-end: - The carrier IS real on solid fills -- measured via per-bin PHASE COHERENCE (the prior probe used spatial/FFT-magnitude NCC, which can't see a fixed-phase carrier). White gemini-2.5-flash fills: coherence 0.86 at carriers (0,+/-7..12,20..23) vs 0.31 random; single-image phase-match +0.83 vs -0.24 for real photos. - But it does not generalize: carriers are model-version/resolution/color specific (v4 codebook for 3.1-flash/nb-pro scores ~0.5 on 2.5-flash), and collapse on real content (coherence ~random; v4 content 0.518 vs neg 0.504, no separation). Net: a controlled-fill characterizer, not a real-content detector. Metadata proxy + visible sparkle + online oracles remain the ceiling. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 08:57:35 -07:00
test-user	ede35a3db5	feat(metadata): read EXIF Make tag; collect Ideogram/Recraft/Krea-FLUX Collected live samples from three popular generators we lacked: - Ideogram tags its downloads with EXIF Make="Ideogram AI" (no C2PA, no SynthID, no imwatermark) -- the Make tag is its only signal. exif_generator only read Software/Artist/ImageDescription, so it missed this; now reads Make too. Real cameras put "Apple"/"Canon" in Make (no AI token), so this stays low-false-positive. 4 originals ingested. - Recraft (PNG export) and Krea hosting FLUX 2: downloads carry NO detectable signal -- no C2PA/EXIF/IPTC, and notably no imwatermark despite Krea running FLUX. identify correctly reports 'unknown'. Both ingested as neg fixtures. Lesson recorded in CLAUDE.md: the imwatermark detector fires only on pristine output from a pipeline that runs the encoder (diffusers default, official BFL), not from re-hosts (Krea/Stability) or re-encoded exports (Recraft/Canva). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-24 18:38:56 -07:00
test-user	ad3b8ee248	feat(identify): read EXIF Software / XMP CreatorTool generator tags Closes the documented gap where EXIF/XMP fields inside AVIF/HEIF/JXL went unparsed. metadata.exif_generator extracts the EXIF Software/Artist tag (via PIL+piexif, which opens AVIF natively) and the XMP CreatorTool (via a container-agnostic raw-byte scan that also covers HEIF/JXL that PIL can't open), and matches against AI_GENERATOR_TOKENS so only generator names (Firefly, DALL-E, Midjourney, ComfyUI, ...) fire -- a plain 'Adobe Photoshop' or 'GIMP' tag is not flagged. identify() surfaces it as a high-confidence signal and uses it for platform attribution when no C2PA names a platform, so an AVIF/HEIF whose only AI signal is an EXIF/XMP generator tag is now caught. Validated with synthesized fixtures (the 'no positive fixtures' blocker was self-imposed): real AVIF and JPEG written with EXIF Software via PIL, plus an XMP CreatorTool raw-scan fixture. Zero false positives across the 109-image corpus (real iPhone photos carry no AI generator token). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-24 17:56:39 -07:00
test-user	3a1c5427c8	feat(c2pa): recognize Stability AI issuer; fix Microsoft platform label Collected live C2PA positives from Bing Image Creator and Stability Brand Studio (DreamStudio successor) and learned two things our scan got wrong: - Bing now runs Microsoft's own MAI-Image model, not DALL-E, and signs C2PA as 'Microsoft'. The scan caught it, but the platform label claimed 'Microsoft Designer (DALL-E / OpenAI backend)'. Relabeled model-neutral: 'Microsoft (Bing Image Creator / Designer)'. - Stability signs C2PA as 'Stability AI' (cert 'Stability AI Ltd'), which was not in C2PA_ISSUERS, so it read as 'unknown signer'. Added the issuer and a platform mapping. Stability uses no SynthID and (on its current Stable Image model) no imwatermark watermark -- verified, both negative. Both ingested as SynthID-negative corpus fixtures (they are AI but not SynthID) for issuer-coverage. Canva skipped: its downloads are re-encoded design exports that strip C2PA, so a Canva sample would be inconclusive. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-24 17:12:42 -07:00
test-user	27ad5b7645	feat(identify): detect open SD/SDXL/FLUX invisible watermark Research found one locally-fillable detection gap: Stable Diffusion, SDXL, and FLUX all embed an open DWT-DCT watermark via the invisible-watermark (imwatermark) library -- a PUBLIC decoder, no secret key, unlike SynthID. New invisible_watermark.py decodes the known fixed patterns (verified against upstream source: diffusers SDXL WATERMARK_MESSAGE, FLUX.2 src/flux2/watermark.py, and the 'StableDiffusionV1' default string) and identify() reports the scheme as a high-confidence signal. Verified locally end-to-end: embedding SDXL's exact 48-bit message and decoding it back recovers 48/48 bits; a clean image and our own fal-SDXL outputs decode to ~21/48 (no match). Caveat baked into the report: the watermark is fragile -- gone after JPEG q90 -- so it confirms origin only on pristine files; absence is never proof. imwatermark is an optional dep (extra 'detect'; pulls non-headless opencv), so the import is guarded and the signal is skipped when absent. CLI --no-visible now means metadata-only (skips both pixel-domain detectors). Also records the broader watermarking landscape in CLAUDE.md: which services are locally detectable (SD/SDXL/FLUX), C2PA-covered (Bing/Canva/ Getty/Shutterstock unsampled), or proprietary-only like SynthID (Amazon Titan/Nova, Kakao). Midjourney embeds neither C2PA nor an invisible mark. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-24 16:53:59 -07:00
test-user	7dcc922617	feat(probe): solid-fill SynthID carrier probe; corpus reconfirms no pixel detector scripts/synthid_pixel_probe.py is an experimental/diagnostic tool for the one pixel-domain question that isn't a dead-end: on solid-color fills the zero-mean residual IS essentially the watermark carrier. Two modes: 'consistency' (mean pairwise NCC of carriers across fills vs random baseline) and 'removal' (does the pipeline drop the carrier toward baseline?). Logic validated synthetically (injected carrier correlates, random noise doesn't, simulated removal collapses it) -- no real fills or GPU needed. Running its metric on the corpus independently re-confirms the documented dead-end for real content: at matched resolution SynthID positives do not cluster apart from negatives (within-Gemini 0.07; at 1024 px pos-vs-neg >= pos-vs-pos). An apparent 0.62 among 1254px ChatGPT positives turned out to be near-duplicate content (5 renders of one prompt at ~0.92; a distinct ChatGPT image scored ~0 against them), not a shared carrier. The probe is solid-fills-only; do not use on real content. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-24 16:35:39 -07:00
test-user	144b98cf0b	docs: record external AI-detector models as out of scope Generic HuggingFace AI-vs-real classifiers are per-generator, degrade off-distribution, are untested on the metadata-stripped surfaces we care about (gpt-image, Gemini Nano Banana), and our own SDXL pass would likely defeat them as it does SynthID. Detection stays local + signal-based. Decision 2026-05-24. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-24 16:27:00 -07:00
test-user	1a9f3e4fe5	test(identify): cover provenance branches, CLI, sparkle helper Adds 20 tests around the new provenance path: - identify(): local SD/ComfyUI params -> local-pipeline attribution; visible-sparkle gating at the 0.5 threshold (mocked detector: above, below, unavailable, opt-out); metadata verdict not downgraded by a sparkle hit; OpenAI/SynthID caveats + dedup; ProvenanceReport is JSON-serializable (the CLI --json path); and the honest edge where a C2PA manifest without an AI source marker stays 'unknown'. - CLI 'identify': help, clean PNG, AI PNG platform, valid --json, missing file. - gemini_engine.detect_sparkle_confidence: float in range for a real image, None for an unreadable file. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-24 16:27:00 -07:00

1 2

100 Commits