Shadowbroker

mirror of https://github.com/BigBodyCobain/Shadowbroker.git synced 2026-06-23 06:20:00 +02:00

Author	SHA1	Message	Date
BigBodyCobain	5ede669a12	Ship ShadowBroker v0.9.83 with live Infonet gate messaging and DM protocols. Gate hashchain replication, Tor/SOCKS transport hardening, terminal session teardown, v0.9.83 UI/changelog, and release digest pins for seamless updater verification.	2026-06-15 15:37:29 -06:00
BigBodyCobain	7151563a41	Fix Extra-participant E2E: live contact send and Tor prekey cache path. Adds connect-contact HTTP endpoint with cached-bundle support, subprocess contact send via docker cp bundle file, and direct Tor prekey fetch to avoid wedging single-worker uvicorn. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-14 17:00:13 -06:00
BigBodyCobain	f1cd9eb4b9	Pass Tor E2E shared DM flow and harden mesh relay for fleet participants. MLS export/reset and accept use live HTTP so uvicorn privacy-core state stays consistent; relay persistence and sender_seal fixes enable invite-accept-shared decrypt across onion peers. Adds participant/e2e compose overlays and harness recovery with optional Tor-only replicate mode. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-14 11:26:57 -06:00
BigBodyCobain	c266c5ff5e	Close v1 swarm: fresh-participant smoke test, join retries, README fleet note. Retry announce/manifest while Tor circuits warm on NODE and startup bootstrap. Add verify_swarm_fresh_participant.py for empty-volume GHCR smoke tests. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-12 03:09:02 -06:00
BigBodyCobain	89d6bb8fb9	Ship DM connect delivery, fleet pubkey lookup, OpenClaw Infonet agent, and relay auto-wormhole. Auto-relay connect DMs with End Contact severing, signed fleet prekey lookup, OpenClaw private Infonet channel intents, headless relay Tor bootstrap on redeploy, and swarm/DM live verification scripts. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-12 02:15:56 -06:00
BigBodyCobain	df76f6f147	Enable zero-config Infonet fleet join for all participant nodes. Ship sb-testnet fleet defaults, swarm/join API, NODE launcher registration step, and meshnode script defaults so users discover peers via the signed seed manifest without manual peer lists. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-11 10:25:48 -06:00
BigBodyCobain	776c89bfcf	Add private Infonet swarm discovery and gate propagation. Signed peer manifest pull/announce on the seed, immediate hashchain push for gate messages, seed-only Docker defaults, and stale-genesis sync diagnostics. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-11 03:15:25 -06:00
BigBodyCobain	e78e4d186d	Ship Meshtastic Chat UX, embedded Infonet/SHELL panels, and Docker dev polish. Rename Mesh Chat to Meshtastic Chat, embed the Infonet terminal with Arti/Tor warmup, improve the agent shell PTY (git in the backend image, operator PATH), and add docker-compose.override for local image builds. Gitignore Hermes Agent runtime installs. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-11 00:55:38 -06:00
BigBodyCobain	d1e1be4016	Replace mock Agent Shell overlay with inline xterm PTY and dock/expand UX. Uses a local-operator WebSocket bash session, keeps the map interactive, and SNAP docks the shell back into Mesh Chat instead of a floating blurred panel. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-10 11:30:50 -06:00
BigBodyCobain	a8fd33a758	Add OpenClaw fast-path routing with playbooks and expensive-command gate. Move intent routing into route_query/ask, short-circuit find_entity fuzzy search, and document the thin three-tool agent surface so Hermes avoids multi-second search_telemetry by default. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-09 21:32:08 -06:00
BigBodyCobain	1583fd5715	Expose new telemetry and recon toolkit to OpenClaw agents. Wire telegram_osint, malware, cyber, and SCM into search/slow-tier helpers; add osint_lookup, entity_expand, and osint_sweep commands; update README and skill docs. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-08 22:44:16 -06:00
BigBodyCobain	af9b3d08cc	feat: Telegram OSINT map layer, Osiris intel ports, and maritime settings Add Telegram OSINT with hourly incremental t.me scraping, metro geocoding separate from news centroids, threat-intercept popup UI with inline media, and HTML markers above alert boxes so pins stay clickable. Expose GFW_API_TOKEN in onboarding and Settings Maritime; harden GFW/CCTV/geo fetchers. Port Osiris- derived recon, SCM, entity graph, malware/cyber feeds, sanctions, and submarine cable layers with tests and documentation. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-08 21:04:08 -06:00
BigBodyCobain	b64b9e0962	Add Sentinel-2 road freight trends with Analyze Here UI. Port DrishX truck-motion detection as an opt-in slow layer: on-demand map-center analysis, preset corridors, layer panel toggle, and Docker road-corridor extras. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-07 23:39:13 -06:00
BigBodyCobain	079ff7b737	Harden production checklist: dedupe live-data routes and align serializers. Pin Mathieu's data-path checklist in docs and PR template, remove dead main.py fast/slow handlers, unify orjson via _live_data_json_bytes, and bound LiveUAMap Playwright defaults. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-06 20:16:18 -06:00
BigBodyCobain	bd81a940ff	Follow up on #375 review: dedupe live-data route and harden serializers. Align full /api/live-data with slow-tier orjson options, remove dead main.py duplicate, cap slow batches to pool size, cancel queued work on timeout, and stop retrying HTTP 4xx/5xx. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-06 20:10:59 -06:00
BigBodyCobain	9a0a9a116a	Address #375 production-readiness: dev bind, live-data lock, heavy fetch pool. Default python main.py to loopback, deep-copy dashboard snapshots outside the store lock with ETag on full live-data, and route GDELT/LiveUAMap/CCTV/slow-tier work through an isolated executor so Playwright jobs cannot starve fast-tier workers. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-04 17:29:04 -06:00
BigBodyCobain	80a01275ff	Add MKT opt-in on threat intercept, jittered market fetches, and Sentinel multi-scene dossier. Operators enable Polymarket/Kalshi correlation from Global Threat Intercept with a consent dialog; polls use a jittered schedule separate from the slow tier. Right-click Sentinel imagery returns up to three signed scenes again. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-04 09:01:21 -06:00
BigBodyCobain	3ac8442e4b	fix(uap): weekly live NUFORC refresh with 7-day cache for operators Each install pulls ~60-day sightings from nuforc.org every Monday; disk cache matches weekly cadence so users keep current pins between restarts. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-03 18:41:28 -06:00
BigBodyCobain	5f322b0a79	fix(uap): enforce 60-day window, refresh daily, live NUFORC on Windows Filter stale rows out of nuforc_recent_sightings.json on load; add requests-based live scrape when curl is disabled; daily scheduler rebuild instead of weekly-only. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-03 18:27:30 -06:00
BigBodyCobain	363b5a49c8	Close tg12 outbound audit (#348-#366): operator UA, opt-ins, docs - User-Agent is per-install handle only (no Shadowbroker product token) - LiveUAMap: Windows UI consent when enabling Global Incidents; env override - Meshtastic callsign upstream header off by default (opt-in true) - Expanded docs/OUTBOUND_DATA.md and README link for CCTV, basemap, Broadcastify Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-03 15:01:32 -06:00
BigBodyCobain	a3e5c98cd0	test(cctv): Madrid KML HTTPS-first fallback; clarify KiwiSDR #364 docs Adds unit coverage for MadridCityIngestor catalog fetch order. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-03 14:33:01 -06:00
BigBodyCobain	6a098e1c5f	Pin DeepState mirror, prefer HTTPS for Madrid/KiwiSDR, document outbound data (#362–#364). Operators can set DEEPSTATE_MIRROR_COMMIT for immutable frontline ingest; Madrid KML tries HTTPS then HTTP without changing camera image URLs or proxy Referers. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-03 14:31:31 -06:00
BigBodyCobain	f08781bdc9	Route dossier, geocode, and Wikimedia through the backend (#351 , #352 , #360 ) Proxy region dossier, Sentinel search, Wikipedia, and Wikidata via self-hosted APIs; remove LocateBar client-side Nominatim fallback; migrate legacy shadow- operator handles to operator- prefix. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-02 15:20:44 -06:00
BigBodyCobain	c3dd95f6a9	Address remaining safe security hardening	2026-06-02 13:34:11 -06:00
BigBodyCobain	017f383096	Fix BadHost path handling	2026-05-28 01:24:33 -06:00
Shadowbroker	1d7fa5185a	feat(infonet): private gate + DM hashchain spool with hardened propagation (#326 ) Private gate messages and offline DMs now ride the Infonet hashchain as ciphertext-only events, replicated across nodes via private transports (Tor onion / RNS / loopback) and decrypted only by parties holding the gate or recipient keys. Hashchain core (mesh_hashchain.py) ---------------------------------- * New ``append_private_gate_message`` and ``append_private_dm_message`` append paths with full signature verification, public-key binding, revocation check, and replay protection in a dedicated sequence domain (so a gate post does not consume the author's public broadcast sequence, and a DM cannot replay-block a public message at sequence=1). * Fork validation and full-chain validation now accept the gate signature compatibility variants — older signatures that canonicalize with/without epoch or reply_to still verify, so a re-sync from an older peer doesn't reject still-valid history. * DM hashchain spool: capped at 2 active sealed offline DMs per recipient mailbox, plus a per-(sender, recipient) cap so one prolific sender can't consume both slots. 1-hour TTL on the cap counter. Spool intentionally small — it's an offline bootstrap channel, not a persistent mailbox. * Rebuild-state preserves the gate sequence domain across reloads so a chain reload doesn't accidentally let an old gate sequence replay-collide on next append. Schema enforcement (mesh_schema.py) ----------------------------------- * Private gate + DM payloads have closed allowlists of fields. Plaintext keys (``message``, ``plaintext``, ``_local_plaintext``, ``_local_reply_to``) are explicit rejection-bait — they raise before the event ever touches the chain. * DM ciphertext + nonce must look like base64-ish sealed bytes; obvious base64-encoded plaintext shapes are rejected. * ``transport_lock`` required: DM hashchain spool requires ``private_strong``; gate accepts ``private``/``private_strong``/ ``rns``/``onion``. Defense-in-depth at the network layer (main.py + mesh_public.py) ---------------------------------------------------------------- * ``_infonet_sync_response_events`` now silently redacts private events (gate_message + dm_message) unless the request looks like a loopback / onion / RNS / private transport caller. If an operator accidentally exposes :8000 to the public internet, an external puller gets public events only — never ciphertext. * ``_sync_from_peer`` raises ``PeerSyncRateLimited`` for 429 (handled as 4-tuple return with retry_after_s) and ``PeerSyncHTTPError`` for other non-200 statuses (handled by ``_run_public_sync_cycle`` to honor server cooldown hints even outside the 429 path). DM relay hydration (main.py) ----------------------------- * New ``_hydrate_dm_relay_from_chain``: when accepted dm_message chain events arrive on a node, they get deposited into the local DM relay store with a deterministic sender_token_hash so re-sync of the same event is idempotent. Recipients see the ciphertext as a normal DM on their next poll and decrypt with their existing recipient key. Other surfaces -------------- * meshnode.bat / meshnode.sh now set ``MESH_INFONET_ALLOW_CLEARNET_SYNC= false`` and the participant runtime flags by default so a freshly spun-up node defaults to private-only sync. * InfonetTerminal/InfonetShell.tsx adds a gate directory renderer for the new private-gate workflow. * docker-compose.relay.yml binds the relay backend to 127.0.0.1:8000 only; Tor's hidden service forwards onion traffic into 127.0.0.1. Public clearnet :8000 stays off the network edge. Tests ----- * 7 new tests in test_private_gate_hashchain.py + test_private_dm_ hashchain.py covering: gate fork accepts ciphertext propagation, gate fork rejects plaintext, append rejects plaintext before normalize, append requires private_strong, append rejects non-sealed ciphertext shape, DM spool 2-per-recipient + 1-per-pair cap, DM hydration delivers to poll/claim. * Updated test_mesh_node_bootstrap_runtime.py covers 429 backoff via PeerSyncRateLimited 4-tuple AND PeerSyncHTTPError exception. * Updated test_s14b_public_sync_gate_filter.py + test_s9b_gate_store_ hydration.py + test_gate_write_cutover.py cover the new private redaction on public sync responses. * test_private_gate_hashchain.py + test_private_dm_hashchain.py: 10 passed locally. * Combined mesh-relevant suite (the 5 modified existing tests + 2 new): 17 passed. Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-24 21:25:18 -06:00
BigBodyCobain	03b8053617	feat(flights): cumulative fuel burned + CO2 emitted per flight Pre-fix the emissions tooltip only showed the per-hour rate — what most users actually want is the cumulative amount burned. This adds running totals computed by multiplying the model-based rate by the elapsed observation time since we first saw the airframe. New module ``flight_observations.py``: * Tracks first_seen_at + last_seen_at per icao24 hex. * Re-opens a fresh session when an aircraft is unseen for > 15 min (treated as a new flight — landed and took off, or transited a dead zone). Prevents the cumulative counter from resetting mid-flight if the trail-rendering cache prunes the trail. * Clamps elapsed time to 24h max so clock skew can't produce comically large numbers. * Pruned every 5 min via a new scheduler job (mirrors ais_prune cadence). flights.py + military.py emission enrichment now also attaches: * observed_seconds — how long we've been tracking this airframe. * fuel_gallons_burned — rate * elapsed_h. * co2_kg_emitted — rate * elapsed_h. The existing per-hour rate fields stay in the dict for backward compat and are shown as small secondary context in the tooltip. Frontend EmissionsEstimateBlock (NewsFeed.tsx) now prominently shows the cumulative totals with the rate as smaller context underneath plus "Observed in flight for Xh Ym". When observed_seconds is 0 (first refresh) it renders "Just observed · totals will appear on next refresh" instead of a misleading "0 gal". 12 backend tests cover record/accumulate/reset, the 24h clamp, prune, case-insensitive key normalization, and end-to-end emission integration in _classify_and_publish. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-23 07:56:23 -06:00
Shadowbroker	20807a2d62	Merge pull request #316 from BigBodyCobain/feat/aishub-fallback feat(ais): AISHub REST fallback when AISStream is offline (20-min polling)	2026-05-23 07:42:56 -06:00
Shadowbroker	79fbf9741b	Merge pull request #314 from BigBodyCobain/feat/ais-upstream-health feat(ais): surface AISStream upstream outage instead of failing silently	2026-05-23 07:12:37 -06:00
BigBodyCobain	a2f5d62926	feat(ais): AISHub REST fallback when AISStream WebSocket is offline When stream.aisstream.io is unreachable (cert outage, server down — see 2026-05-20 and 2026-05-23 events) the ships layer goes empty. This adds a slow REST fallback to data.aishub.net so the layer stays populated in degraded mode. Behavior: * Opt-in via AISHUB_USERNAME (free registration at aishub.net/api). Without the env var the fetcher is a no-op. * Default poll cadence 20 min — well inside their free-tier limits, gives ships time to move enough to look "alive". Configurable via AISHUB_POLL_INTERVAL_MINUTES, clamped to [1, 360]. * Internal gate: skips the poll entirely when the WebSocket primary is currently connected. Stomping fresh live data with 20-min-old REST data would be worse than leaving it alone. * Vessels merge into the shared _vessels dict with source="aishub" so the existing UI / health tooling can attribute the provider. * Live data wins races: if a WebSocket update for the same MMSI lands in the last 1s, we don't overwrite with the slower REST record. Scheduler job runs every AISHUB_POLL_INTERVAL_MINUTES minutes alongside the existing ais_prune job in data_fetcher.py. 24 tests cover gating (no-username, primary-connected), response parsing (success / error / empty / malformed / unexpected shape), record normalization (sentinels, missing fields, range checks, AIS @ padding), poll interval clamping, and end-to-end merge with live-data-wins. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-23 07:00:32 -06:00
BigBodyCobain	5e0b2c037e	feat(ais): surface upstream outage instead of failing silently On 2026-05-23, stream.aisstream.io went fully offline (TCP timeouts on port 443). The backend kept respawning the node WebSocket proxy every few seconds with nothing arriving. From the operator's POV the ships layer silently went empty — no banner, no log surfacing, no way to tell whether it was their config / network / viewport filter / upstream. Backend: * ais_proxy_status() now also returns: - connected (bool): true when a vessel message arrived in last 60s - last_msg_age_seconds (int \| None) - proxy_spawn_count (int): proxy respawns — sustained growth without connected means upstream is dead * /api/health escalates top status to "degraded" when AIS_API_KEY is set but the proxy is currently disconnected. Existing degraded_tls signal preserved. Frontend: * useAisUpstreamHealth hook polls /api/health every 30s, derives the outage state. Defensively only reports outage once spawn_count > 0 so operators who haven't opted in don't see the banner. * AisUpstreamBanner component renders a dismissible amber notice "Ship data temporarily unavailable — AISStream upstream is offline" mounted on the main app shell. 7 backend tests pin the status-shape contract and the /api/health escalation behavior in both with-key and without-key configurations. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-23 06:38:05 -06:00
Shadowbroker	69ef231e5a	Merge pull request #313 from BigBodyCobain/feat/flight-source-attribution feat(flights): stamp source attribution on every flight record	2026-05-23 06:29:31 -06:00
Shadowbroker	7a5f47ca9e	Merge pull request #312 from BigBodyCobain/fix/gps-jamming-thresholds fix(gps-jamming): count nac_p=0 + lower thresholds so layer actually fires	2026-05-23 06:29:20 -06:00
BigBodyCobain	f14d4feb6d	feat(flights): stamp source attribution on every flight record Pre-fix, adsb.lol records (the primary source for most flights) carried no source marker. OpenSky records got is_opensky: True and supplementals got supplemental_source, so any UI inspecting source labels saw OpenSky/airplanes.live records as explicitly tagged and adsb.lol records as "unlabeled" — making it look like adsb.lol wasn't being used at all even though it's the primary source. Changes: * _fetch_adsb_lol_regions stamps source="adsb.lol" on each aircraft before returning, so the tag survives the OpenSky dedupe-by-hex merge. * OpenSky records get source="OpenSky" (alongside is_opensky=True for back-compat). * military fetcher tags source on both adsb.lol and airplanes.live records before they're merged, and propagates source into the military_flights and uavs output dicts. * _classify_and_publish promotes the explicit source field into the published flight dict. Falls back to legacy supplemental_source if source is absent. Final fallback "adsb.lol" preserves prior behavior for any caller synthesizing records without going through a fetcher. 8 new tests cover the published-dict propagation, OpenSky tagging, supplemental fallback, explicit-wins precedence, default behavior, the adsb.lol regional fetcher tagging, and the military output dict. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-23 06:14:39 -06:00
BigBodyCobain	19a8560a80	fix(gps-jamming): count nac_p=0 + lower thresholds so the layer actually fires Three stacked filters meant the gps_jamming layer almost never lit up: 1. nac_p == 0 aircraft were dropped on the theory that "0 = old transponder." That's only half right — modern Mode-S Enhanced Surveillance transponders also fall back to nac_p=0 when they lose GPS lock entirely, which IS the jamming signature we want to catch. Discarding them was discarding the strongest signal. None (no field at all — typical for OpenSky-sourced records) is still skipped because absence-of-data isn't evidence. 2. GPS_JAMMING_MIN_AIRCRAFT was 5 per 1°x1° cell. Jamming hotspots (eastern Med, Russia/Ukraine border, Iran/Iraq) tend to have sparser traffic because pilots avoid them. Lowered to 3. 3. GPS_JAMMING_MIN_RATIO was 0.30. Combined with the (preserved) -1 noise cushion that made the effective bar high. Lowered to 0.20. The 1-aircraft noise cushion is intact so a single quirky transponder still can't flag a zone alone. Also extracted the detector loop into a pure ``detect_gps_jamming_zones()`` function at module scope so it's testable in isolation (was previously inlined inside ``_classify_and_publish``). The public signature accepts threshold overrides for ad-hoc re-tuning without code edits. 16 new tests cover nac_p=0 inclusion, None-skip preservation, MIN_AIRCRAFT lowering, MIN_RATIO lowering, noise cushion preservation, constant pinning, override behavior, lon/lng key compatibility, and robustness to empty/None inputs. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-22 23:40:18 -06:00
BigBodyCobain	0d0e009867	fix(uap): stop HF fallback from serving 3-year-old NUFORC sightings The UAP sightings layer is sourced from a live scrape of nuforc.org with a static Hugging Face CSV mirror (kcimc/NUFORC) as a fallback. The fallback parsed every row, sorted by occurred-desc, and took the top 250 — with no date cutoff. The HF mirror is a third-party snapshot that hasn't been refreshed in years, so the "newest 250" rows it returns are from ~2022-23. When the live path fails (Cloudflare 403, curl disabled on Windows, wdtNonce regex stale, etc.) users see a map full of sightings from 3 years ago, labeled as the "last 60 days" layer. Changes: * HF fallback now applies the same 60-day cutoff the live path uses. Rows outside the window are dropped before take-top-N. If the mirror has nothing inside the window the fallback returns [] (don't serve stale). * When the HF mirror is fully stale a loud ERROR log fires with the count of dropped rows so the operator can tell the mirror's the problem, not a network issue. * When BOTH live AND HF fallback produce 0 rows, fetch_uap_sightings now trips assert_canary("uap_sightings", 0) so the health registry shows the layer as broken instead of "fresh and empty for days." * Scheduler moved from daily 12:00 UTC to weekly Mondays 12:00 UTC. The layer is a rolling 60-day digest; refreshing once a week is enough cadence for human-readable map exploration and keeps nuforc.org load light. 6 new tests cover the cutoff filter, the doomsday-log path, the mixed-age path, the both-paths-empty health failure, the positive fallback path, and the scheduler cadence. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-22 23:27:12 -06:00
BigBodyCobain	31ebcb5cd9	Infonet sync: honor HTTP 429 Retry-After + exponential backoff Fixes the retry-storm that's been keeping the local node 429'd out of the seed peer (the diagnosis we ran earlier in the session). Pre-fix: 1. Sync hits the seed peer, gets HTTP 429 (Too Many Requests) 2. _peer_sync_response stringifies the status into a ValueError 3. _sync_from_peer catches it, error becomes the str() of the exc 4. _run_public_sync_cycle calls finish_sync(error=..., failure_backoff_s=60) 5. next_sync_due_at = now + 60s 6. After 60s, sync runs again, hits same upstream that hasn't reset its rate-limit bucket, 429 again. Loop indefinitely. Net effect: a node that hit one transient 429 would hammer the seed every 60s forever, keeping the bucket full and never recovering. We saw this in the live status dump: consecutive_failures=49, last_sync_ok_at=0, retry storm sustained over the entire uptime. What changed ------------ services/mesh/mesh_infonet_sync_support.py * New typed exception PeerSyncRateLimited carries the parsed Retry-After value out of the HTTP layer instead of stringifying everything into a generic ValueError. * New parse_retry_after_header() handles both RFC 7231 §7.1.3 forms (delay-seconds and HTTP-date). Clamped at 1 hour so a hostile peer can't silence us for days. * New _failure_backoff_seconds() helper computes the next delay as max(exponential, retry_after_s). Schedule with default base=60s, cap=1800s: failure 1 -> 60s (preserves pre-fix for transient blips) failure 2 -> 120s failure 3 -> 240s failure 4 -> 480s failure 5 -> 960s failure 6+ -> 1800s (capped at 30 min) cap_s=0 explicitly disables exponential entirely — operators who want pure-Retry-After behavior have that option. * finish_sync now accepts retry_after_s and failure_backoff_cap_s kwargs. Backward-compatible: existing callers that don't pass retry_after_s get the same first-failure delay as before (the base value), only repeat failures grow. main.py * _peer_sync_response detects 429 specifically, parses the Retry-After header, raises PeerSyncRateLimited(retry_after_s=N). Includes the response body prefix in the message so the operator's last_error finally shows something useful. * _sync_from_peer extended to return (ok, error, forked, retry_after_s) — the 4th tuple element is non-zero only when the upstream sent a parseable Retry-After. Existing call shape preserved: the lone caller in _run_public_sync_cycle was updated in the same commit. * _run_public_sync_cycle forwards retry_after_s into finish_sync. Tests ----- backend/tests/mesh/test_infonet_sync_429_backoff.py — 17 new tests: TestParseRetryAfter (7): - integer seconds form - HTTP-date form (computed as seconds-from-now) - HTTP-date in the past returns 0 - empty / whitespace returns 0 - malformed returns 0 - clamps to 1 hour (hostile-peer cap) - negative returns 0 TestFailureBackoffSeconds (5): - exponential growth schedule pins each level - retry_after wins when larger than exponential - exponential wins when larger than retry_after - cap_s=0 disables exponential entirely - zero inputs return zero TestFinishSyncBackoff (5): - first failure uses base unchanged (pre-fix back-compat) - consecutive_failures actually grow the delay - retry_after honored at low failure count - success resets consecutive_failures - last_error carries the HTTP status / Retry-After detail All 24 existing sync-support / status-gate tests still pass. Other failures in tests/mesh/ are pre-existing on origin/main and unrelated to this change (verified by running the same tests against the user's main worktree without these edits). What the operator sees after this lands + a docker rebuild ---------------------------------------------------------- With the live 429 storm we diagnosed: Pre-fix: consecutive_failures keeps climbing 1/min forever, last_error empty or generic Post-fix: consecutive_failures grows, next_sync_due_at backs off exponentially (max 30 min), last_error explicitly carries "HTTP 429 from <peer> (retry_after=Ns): <body>" so the operator can see what's actually wrong. Once the upstream bucket drains and a sync succeeds, consecutive_failures resets to 0 and the schedule returns to the normal 300s interval.	2026-05-22 22:55:05 -06:00
Shadowbroker	b3fca3dc18	Merge pull request #309 from BigBodyCobain/feat/cross-node-dm-mailbox-replication DM mailbox: per-(sender, recipient) anti-spam cap + replication primitives	2026-05-22 22:43:26 -06:00
BigBodyCobain	401f114e4f	DM mailbox: outbound replication + receiving endpoint Second commit on this branch (first added the per-sender cap + accept_replica primitive). This commit wires the actual cross-node propagation: Outbound (sender side) ---------------------- * New ``DMRelay._replicate_envelope_to_peers_async()`` — fire-and-forget thread that POSTs the envelope to every authenticated relay peer via the same per-peer HMAC pattern gate-message replication uses (#256 ``X-Peer-Url`` + ``X-Peer-HMAC`` headers, ``resolve_peer_key_for_url``). * ``deposit()`` now calls the replication helper after a successful local accept. Per-peer errors are swallowed — slow Tor peers must not block the sender's UX, and the recipient polling from a healthy peer works fine even if some peers are down. * Metrics: dm_replication_push_ok / _rejected / _error. Inbound (receiving side) ------------------------ * New endpoint ``POST /api/mesh/dm/replicate-envelope`` in routers/mesh_peer_sync.py. * Same HMAC auth gate (``_verify_peer_push_hmac``) as the existing infonet/gate peer-push endpoints. Unauthenticated requests get 403. * Body cap of 64 KB (DM envelope is bounded by MESH_DM_MAX_MSG_BYTES). * Calls DMRelay.accept_replica which enforces the per-sender cap as a network rule — hostile sender's relay can hold extras locally but honest peers reject them on inbound replication. End-to-end flow now works ------------------------- 1. Alice's node accepts a deposit to Bob's mailbox (local cap check). 2. Alice's node spawns a background thread that POSTs the envelope to MESH_RELAY_PEERS with per-peer HMAC. 3. Each peer's /api/mesh/dm/replicate-envelope verifies the HMAC and calls accept_replica, which re-enforces the per-sender cap. 4. Bob (offline at the time of send) eventually logs into ANY node in MESH_RELAY_PEERS, his existing pollDmMailboxes pulls from the local mailbox there, finds Alice's envelope, decrypts. Tests ----- backend/tests/test_dm_replicate_envelope_endpoint.py — 4 tests: TestReplicateEndpointAuth: - rejects requests without peer HMAC (403) - rejects requests with WRONG peer HMAC (403) — confirms the HMAC is actually verified, not just present - rejects oversize bodies (>64 KB) with 400/413 TestReplicateEndpointRegistered: - static check that POST /api/mesh/dm/replicate-envelope is registered on app.routes — catches future refactor that drops the router include All 38 backend tests touching the new code paths still pass: test_dm_relay_per_sender_cap.py (14) test_dm_replicate_envelope_endpoint.py (4) test_no_new_duplicate_routes.py (1) — new route is unique test_per_peer_secret_resolver.py (19) — HMAC primitive unaffected What's still ahead (PR-3+) -------------------------- * ack propagation: when recipient pulls a message on node X, peers Y/Z should prune their copies to free the sender's quota network-wide. Without this, the sender's quota frees only on the node the recipient actually polled — other peers still see N pending until TTL expiry. Workable but suboptimal. PR-3 will add a /api/mesh/dm/ack endpoint with the same HMAC pattern. * recipient pull-from-peers: today the recipient's poll only hits their own node's relay. If they log into a peer they didn't deposit with, they need a way to fetch envelopes from other peers in MESH_RELAY_PEERS. Today this works as long as the recipient's current node is one of the peers Alice's node pushed to — which is true in a fully-meshed deployment but not guaranteed for partial meshes. PR-4 if telemetry shows this matters.	2026-05-22 19:23:09 -06:00
BigBodyCobain	79b39e8985	DM mailbox: per-(sender, recipient) anti-spam cap + replication primitives Foundation work for cross-node DM mailbox replication. Adds the network rule that makes the replication safe to ship next, plus the primitives the outbound replication PR will call. The rule -------- A single sender can have at most N UNACKED messages parked in a single recipient's mailbox at any one time. Default N=2, tunable via ``MESH_DM_PENDING_PER_SENDER_LIMIT``. Once the recipient pulls (acks) a message, the sender's quota for that (sender, recipient) pair frees up. Network rule, not local rule ---------------------------- The cap is enforced TWICE: 1. ``DMRelay.deposit(...)`` — local check on the sender's own node. Refuses to spool the (N+1)th message before it can be replicated. 2. ``DMRelay.accept_replica(...)`` — replication-acceptance check on every receiving peer. Refuses to accept an inbound replica that would put the local mailbox over the cap. The second half is what makes the rule a NETWORK rule. A hostile sender could patch out the deposit check on their own relay and continue to spool extras locally — but those extras can never propagate, because every honest peer enforces the same cap on the way in. A recipient who polls from honest peers therefore never sees more than N pending from any one sender, regardless of how many spam attempts the hostile sender's relay accepted. New API surface on ``DMRelay`` ------------------------------ _per_sender_pending_limit() — reads MESH_DM_PENDING_PER_SENDER_LIMIT _per_sender_pending_count(...) — counts unacked from a sender for a mailbox accept_replica(envelope=...) — peer-push receive entry point envelope_for_replication(...) — helper to extract a wire-form envelope ``accept_replica`` is idempotent on duplicate ``msg_id`` (replication round-trips and multi-path delivery don't double-spool). ``envelope_for_replication`` exposes the exact shape ``accept_replica`` expects, so the follow-up PR (outbound replication wiring) just has to fetch the envelope and POST it to authenticated peer URLs with the existing per-peer HMAC pattern from #256. Why this is PR-1 of two ----------------------- The full cross-node mailbox replication needs three pieces: A. cap enforcement on deposit (in this PR) B. cap enforcement on replica acceptance (in this PR) C. outbound: push envelope to MESH_RELAY_PEERS after deposit (NEXT PR) (A) + (B) shipped together close the cap-bypass attack surface BEFORE (C) introduces the actual cross-node propagation. Shipping them in the other order would briefly let extras propagate during the window between "outbound push lands" and "accept_replica cap lands." Tests ----- backend/tests/test_dm_relay_per_sender_cap.py — 14 tests: TestDepositCap: - first 2 deposits succeed (UX baseline) - 3rd from same sender rejected with friendly message - different senders have independent quotas - different recipients have independent quotas - ack frees the quota (after recipient pulls, sender can deposit again) - cap is env-tunable TestAcceptReplicaCap: - replica accepted under cap - idempotent on duplicate msg_id (no double-spool, no rejection) - rejected at cap with structured ``cap_violation`` marker so sender's relay can stop retrying - per-sender, not per-mailbox: different sender_block_ref passes even when another sender at the same mailbox is capped - malformed envelope shapes rejected without crash TestEnvelopeForReplication: - returns the envelope for stored messages - returns None for unknown msg_id - round-trips through accept_replica end-to-end (proves the wire shape matches across the two sides)	2026-05-22 19:18:01 -06:00
BigBodyCobain	b041b5e97c	Fix #298 : move Sentinel credentials from browser storage to backend .env Reported by @tg12. Pre-fix, the Settings panel stored real third-party Copernicus CDSE client_id + client_secret in browser localStorage / sessionStorage via the privacy storage helper, and the proxy routes required those values to come back in every tile/token request body. Any same-origin script (XSS, malicious browser extension, dev-tools HAR export) had read access to the credentials. This change moves them server-side, behind the same .env-backed admin flow every other third-party API key (OpenSky, AIS Stream, Finnhub, Shodan, …) already uses. Backend ------- backend/services/api_settings.py * Added SENTINEL_CLIENT_ID and SENTINEL_CLIENT_SECRET entries to API_REGISTRY. The existing GET/PUT /api/settings/api-keys flow (already require_local_operator-gated, .env-backed) now manages them — no new route surface. backend/routers/tools.py * /api/sentinel/token and /api/sentinel/tile resolve credentials via a new _resolve_sentinel_credentials() helper: body fields win for back-compat with any legacy callers, otherwise the helper reads SENTINEL_CLIENT_ID / SENTINEL_CLIENT_SECRET from os.environ. * When neither source has a value, the route returns 400 with a friendly pointer ("Set SENTINEL_CLIENT_ID and SENTINEL_CLIENT_SECRET in the API Keys panel") instead of the curt "required" message. The user's standing rule against hostile errors applies. * Function bodies only — decorator lines untouched, so this PR does not conflict with #303 (which adds Depends(require_local_operator) to the same routes). Frontend -------- frontend/src/lib/sentinelHub.ts — rewritten * Removed: getSentinelCredentials / setSentinelCredentials / clearSentinelCredentials / getSentinelCredentialStorageMode. These were the browser-storage read/write helpers; their existence was the bug. * Added: checkBackendSentinelStatus(), refreshSentinelStatus(), getCachedSentinelStatus(), and a kept-for-back-compat hasSentinelCredentials() shim. Status is sourced from /api/settings/api-keys (the same endpoint the API Keys panel already uses), so we don't add a new route just for this read. * Added: migrateLegacySentinelBrowserKeys() — one-shot, idempotent helper that clears sb_sentinel_client_id / _secret / _instance_id from BOTH localStorage and sessionStorage. We deliberately do NOT auto-POST those legacy browser values to the backend; doing so would silently migrate a secret across a trust boundary without operator consent. Operators re-enter once in the API Keys panel and the legacy keys get wiped here. * fetchSentinelTile and getSentinelToken no longer send client_id / client_secret in the request body. The backend uses .env. frontend/src/components/SettingsPanel.tsx * Dropped sb_sentinel_client_id / _secret / _instance_id from PRIVACY_SENSITIVE_BROWSER_KEYS — they're no longer written. * SentinelTab rewritten: removed the inline Client ID / Client Secret inputs + Save / Clear / Test buttons. Replaced with a status panel that calls checkBackendSentinelStatus() on mount, a one-click "Open API Keys Panel" button, and a migration banner that appears only when migrateLegacySentinelBrowserKeys() actually cleared something. * Setup guide STEP 3 now points to the API Keys panel instead of the local form. frontend/src/app/page.tsx * Added a one-time useEffect that fires checkBackendSentinelStatus() on mount so the cached value (which the synchronous hasSentinelCredentials() shim reads) is populated before MaplibreViewer's tile-URL memo runs. Tests ----- backend/tests/test_sentinel_credentials_server_side.py (new) * API_REGISTRY surface — sentinel_client_id / sentinel_client_secret are registered with the right env_keys, ALLOWED_ENV_KEYS lets /api/settings/api-keys PUT them. * Resolution order — body wins, env is fallback, neither → 400 with the friendly pointer message, and NO upstream HTTP call when neither source has credentials (asserted via MagicMock(side_effect=AssertionError)). * /api/sentinel/tile same shape. frontend/src/__tests__/utils/sentinelHub.test.ts (new) * migrateLegacySentinelBrowserKeys clears localStorage AND sessionStorage, reports what it cleared, idempotent. * fetchSentinelTile + getSentinelToken POST WITHOUT client_id / client_secret in the body (plants leaked credentials in browser storage first to prove they are NOT picked up). * checkBackendSentinelStatus parses /api/settings/api-keys correctly: true only when both keys is_set, false on partial config or network errors. All 7 backend tests + 8 frontend tests pass locally. The test_no_new_duplicate_routes guard and the api-settings test suite still pass. Credit: @tg12 for the audit report.	2026-05-22 10:44:50 -06:00
Shadowbroker	49621824b1	Use USNI Fleet Tracker as the primary carrier source + small UI fixes (#293 ) Background ========== PR #285 set up the seed -> cache -> GDELT model for the carrier tracker to address audit issues #244/#245/#246. The GDELT half of that pipeline hits api.gdeltproject.org's doc API for headline-region keyword matching -- low precision (false centroid positions per #245) AND unreliable (the host times out from some networks, including Docker Desktop on Windows). USNI publishes a weekly Fleet & Marine Tracker with explicit prose like: "The Gerald R. Ford Carrier Strike Group is operating in the Red Sea" "Aircraft carrier USS George Washington (CVN-73) is in port in Yokosuka, Japan" That is a strictly better source for U.S. Navy carrier positions: authoritative, deterministically parseable, weekly cadence. What this PR does ================= New module: backend/services/fetchers/usni_fleet_tracker.py - Pulls USNI's WordPress RSS feeds (site-wide + category, unioned). - Picks the most recent fleet-tracker post by parsed pubDate. - For each carrier in the registry, scans the article body for "is operating in / is in port in / returned to / transiting" near the carrier's name, hull code, or "<name> Carrier Strike Group" variant. Captures the region/port phrase that follows. - Maps the region phrase to coordinates via the existing REGION_COORDS table, with a USNI-phrase alias table for the specific wording USNI uses ("Yokosuka, Japan", "Norfolk, Va.", "Naval Station San Diego", "5th Fleet AOR", etc.). - Returns {hull: position_entry} with position_confidence="recent" and position_source_at = the article's actual publication timestamp (not now()). Politeness ---------- Uses outbound_user_agent("usni-fleet-tracker") so USNI sees a per-install Shadowbroker identifier (Round 7a / PR #292). The article body pages return 403 to non-browser UAs; the WordPress RSS feed serves the full <content:encoded> body and is the supported aggregator path. No browser UA spoofing. carrier_tracker.update_carrier_positions() now runs three phases: 1. Bootstrap from cache (or seed on first run). 2. USNI fleet tracker -- PRIMARY high-confidence source. 3. GDELT -- SECONDARY backfill; can NOT demote a "recent" USNI position to an "approximate" GDELT headline match. Verified live: 6 of 11 carriers picked up real May 18, 2026 positions on first refresh (Eisenhower, Ford, Bush, Roosevelt, Lincoln, Washington). The other 5 weren't mentioned in this week's article (they're in port at homeports with no deployment changes) and kept their cache entries -- which is the correct seed/cache contract from PR #285. Other small fixes bundled in ============================ docker-compose.yml: add the 6 third-party-fetcher opt-in env vars (PREDICTION_MARKETS_ENABLED, FINANCIAL_ENABLED, FIMI_ENABLED, NUFORC_ENABLED, NEWS_ENABLED, CROWDTHREAT_ENABLED). They were documented in .env.example but never wired through compose, so setting them in .env had no effect. frontend/src/components/TopRightControls.tsx: fix 6 broken i18n keys that were showing as raw "terminal.term1" / "terminal.cleanupDetail" / "node.soloReady" placeholders in the INFONET TERMINAL modal. The translation files have these strings under different key names; the component now calls the right ones. Full-file sweep confirmed every other t('...') key in the whole frontend resolves cleanly.	2026-05-21 20:39:23 -06:00
Shadowbroker	76750caa92	Round 7a: per-operator outbound attribution + GDELT GCS-direct fix (#292 ) == Per-install operator handle for every third-party API call == Before this PR, every Shadowbroker install identified itself to Wikipedia, Wikidata, Nominatim, GDELT, OpenMHz, Broadcastify, weather.gov, NUFORC, Sentinel/Planetary Computer, TinyGS / CelesTrak, Shodan, Finnhub, and others with a single project-wide User-Agent ("Shadowbroker/1.0" or "ShadowBroker-OSINT/1.0"). From the upstream's perspective every install in the world looked like one giant scraper. If one install misbehaved, the upstream's only recourse was to block "Shadowbroker" as a whole. PR #284 inadvertently doubled down on this in the frontend by introducing a shared `WIKIMEDIA_API_USER_AGENT` constant. This PR retrofits both backends to per-operator attribution. New setting: OPERATOR_HANDLE (env var / settings UI / auto-gen) New helper: network_utils.outbound_user_agent("purpose") The handle is auto-generated as "operator-XXXXXX" on first call (the "shadow-" prefix from earlier drafts was deliberately dropped — too suspicious-looking for abuse-detection systems). Operators can override via OPERATOR_HANDLE; the value is sanitized to lowercase alphanumeric+dash+underscore and capped at 48 chars. Persisted to backend/data/operator_handle.json so it survives container restarts. Retrofitted call sites (every previously-MONSTER User-Agent): - services/region_dossier.py (Wikipedia + Wikidata + Nominatim) - services/geocode.py (Nominatim) - services/sentinel_search.py (Microsoft Planetary Computer) - services/feed_ingester.py (operator-curated RSS feeds) - services/fetchers/earth_observation.py (weather.gov, NUFORC) - services/fetchers/infrastructure.py - services/fetchers/aircraft_database.py - services/fetchers/route_database.py - services/fetchers/trains.py - services/fetchers/meshtastic_map.py - services/shodan_connector.py - services/unusual_whales_connector.py (Finnhub) - services/tinygs_fetcher.py (CelesTrak + TinyGS) - services/sar/sar_products_client.py - services/geopolitics.py (GDELT) - services/radio_intercept.py (Broadcastify + OpenMHz) - routers/cctv.py + main.py (CCTV proxy) - routers/ai_intel.py - scripts/convert_power_plants.py (release-time data refresh) Spoofed browser UAs removed (issues #289 / #290 / #291 — tg12 audit): - cloudscraper-based Chrome impersonation against api.openmhz.com -> replaced with honest requests + per-install UA - Mozilla/5.0 spoofed UA on Broadcastify scrape -> replaced with honest UA - Mozilla/5.0 + fake first-party Referer on OpenMHz audio relay -> replaced with honest UA - cloudscraper dependency dropped from pyproject.toml + uv.lock Frontend retrofit: - new GET /api/settings/operator-handle endpoint (local-operator gated) returns the install's handle - frontend/src/lib/wikimediaClient.ts fetches the handle once on first use, caches it for page lifetime, embeds it in the Api-User-Agent for every Wikipedia / Wikidata browser-direct call == GDELT GCS-direct fix == GDELT's data.gdeltproject.org is a CNAME to a Google Cloud Storage bucket. GCS responds with the wildcard *.storage.googleapis.com cert which legitimately does NOT cover the GDELT custom domain, so Python's TLS verification correctly refuses the connection. Some networks happen to route through a path where this works; many (notably Docker Desktop's outbound NAT on local installs) do not. Verified on the maintainer's local install: GDELT was unreachable; 1610 geopolitical events / 48 export files were dropping silently. Fix: services/geopolitics._gcs_direct_gdelt_url() rewrites any data.gdeltproject.org URL to its GCS-direct equivalent (storage.googleapis.com/data.gdeltproject.org/...) where the standard GCS cert is genuinely valid. api.gdeltproject.org and every other host are left untouched. Confirmed live: backend log goes from GDELT lastupdate failed: 500 to Downloading 48 GDELT export files... Downloaded 48/48 GDELT exports GDELT parsed: 1610 conflict locations from 48 files == Tests == backend/tests/test_per_operator_outbound_attribution.py (12 tests) backend/tests/test_gdelt_gcs_direct_rewrite.py (6 tests) backend/tests/test_region_dossier_wikimedia_ua.py (updated to pin the helper + per-operator handle, not the old constant) frontend/src/__tests__/utils/wikimediaClient.test.ts (rewritten to mock /api/settings/operator-handle and assert per-operator UA) Local: backend 114/114 security+audit+round7a suite green; frontend 718/718 vitest suite green. Credit: tg12 (external security audit, issues #289/#290/#291 relating to spoofed UAs); BigBodyCobain (operator-prefix call, GDELT cloud-vs-local diagnosis).	2026-05-21 15:11:28 -06:00
Shadowbroker	5e6bb8511a	Fix #244/#245/#246: carrier tracker seed/cache/freshness model (#285 ) Replace the dated editorial fallback positions baked into the registry with a one-shot seed file + persistent observation cache. The user's runtime cache now reflects what THIS install has actually observed, not what USNI published on March 9, 2026. A year from now, the cache holds a year of observations and the seed is irrelevant. == #244: dated editorial coordinates out of the registry == CARRIER_REGISTRY no longer carries fallback_lat/lng/heading/desc. Those fields are deleted. The registry is now identity + homeport only. New file: backend/data/carrier_seed.json - Read-only, shipped with every release. - Used ONCE on first-ever startup to bootstrap carrier_cache.json. - Each entry stamped with position_confidence="seed" and the actual as-of date (2026-03-09), NOT now(). == #245: approximate confidence for headline-derived positions == _parse_carrier_positions_from_news() now stamps every GDELT-derived entry with position_confidence="approximate" so the UI knows the coordinate is a region-centroid match, not a precise observation. After the freshness window the label rolls over to "stale_approximate" so old-and-imprecise is distinguishable from recent-and-imprecise. The article's actual seendate is used as position_source_at instead of now(), so the "last reported X days ago" badge is honest. == #246: freshness is labelling, not eviction == The cache always preserves the last position the system observed, forever. What changes is the position_confidence label: - within configurable window (default 14d, env-overridable via SHADOWBROKER_CARRIER_FRESHNESS_DAYS) -> "recent" - older -> "stale" - seed-bootstrap entries that were never refreshed -> "seed" - homeport defaults (carrier added post-install) -> "homeport_default" - headline-derived (any age, fresh) -> "approximate" - headline-derived (older than window) -> "stale_approximate" The position itself never reverts to the seed or the registry. The user always sees the last position the system observed. Per the user's explicit guidance: "from there have it be the last position the user has logged the carriers that way a year from now it doesnt revert to where the ships are today". == Other improvements == - CACHE_FILE moved to backend/data/carrier_cache.json so it lives in the volume-mounted dir under Docker compose. Previously it was at /app/carrier_cache.json which got wiped on every container restart (pre-existing bug). - Atomic cache write (temp + os.replace) so a crash mid-write does not leave a truncated cache file. == Public API shape == Every carrier object the API emits now includes: - position_confidence: seed \| recent \| stale \| approximate \| stale_approximate \| homeport_default - position_source_at: ISO timestamp of when the underlying source was observed (NOT now()) - is_fallback: convenience boolean for the UI; true when the confidence is seed/stale/stale_approximate/ homeport_default Existing fields (estimated, source, source_url, last_osint_update, name, type, lat, lng, country, desc, wiki) are preserved exactly so the current ShipPopup frontend renders unchanged. last_osint_update now reflects position_source_at instead of now(), which is what the existing "last reported MM/DD" badge always meant to show. Tests: backend/tests/test_carrier_tracker_quality.py — 17 tests covering seed bootstrap, subsequent-startup ignoring seed, no-seed/ no-cache homeport fallback, registry no longer has fallback fields, freshness window labelling + env override, "year-old cache entry keeps its position, only the label flips" regression, approximate confidence for headline matches, GDELT seendate ISO parser, public response shape backward compat. Credit: tg12 (external security audit, three P1/P2 issues).	2026-05-21 11:15:52 -06:00
Shadowbroker	0fee36e8f7	Fix #218/#219/#220: identify ShadowBroker on Wikipedia + Wikidata calls (#284 ) Wikimedia's User-Agent policy asks API clients to identify themselves with a stable, contactable identifier so their operators can rate-limit or coordinate. Before this change, ShadowBroker was sending: - Backend (region_dossier.py): generic project default UA only; no Api-User-Agent. - Frontend (useRegionDossier.ts, WikiImage.tsx, NewsFeed.tsx): zero identifying header at all; three separate copy-pasted anonymous fetches with their own module-local caches. Three separate components doing the same broken thing meant policy fixes had to happen in three places, with no shared cache or kill switch. Fix (no UX change, zero hostility): == Backend == `backend/services/region_dossier.py` now sets explicit `User-Agent` + `Api-User-Agent` headers on every outbound Wikidata and Wikipedia request via a new `_WIKIMEDIA_REQUEST_HEADERS` constant. The identifier includes a contact path (issues page on the public GitHub repo). == Frontend == New shared helper `frontend/src/lib/wikimediaClient.ts`: - `fetchWikipediaSummary(title)` — single source of truth for Wikipedia REST summary lookups, with one shared LRU cache (in-flight requests deduplicated, 512-entry cap), `Api-User-Agent` on every fetch. - `fetchWikidataSparql(query)` — same shape for Wikidata SPARQL. - `WIKIMEDIA_API_USER_AGENT` — exported constant; one place to update if Wikimedia ever asks us to back off. Refactored three components to use the shared client: - `frontend/src/hooks/useRegionDossier.ts` — fetchLeader() and fetchLocalWikiSummary() now route through the shared helpers. - `frontend/src/components/WikiImage.tsx` — uses fetchWikipediaSummary, proper React state instead of module-mutation + forceUpdate trick. - `frontend/src/components/NewsFeed.tsx` — same shape. UX: byte-for-byte identical. Same thumbnails, same dossier content, same load behavior. The only observable difference is the outgoing request header. Note on #239 (route duplication): an audit-grade inventory shows 166 main.py routes are shadowed by router modules. That cleanup is too large to land safely in this PR; it will be staged as a separate ladder of small PRs grouped by router module. Tests: - `backend/tests/test_region_dossier_wikimedia_ua.py` — 3 tests asserting backend headers are present. - `frontend/src/__tests__/utils/wikimediaClient.test.ts` — 9 tests covering Api-User-Agent presence, shared cache, concurrent deduplication, disambiguation/HTTP-error/network-error fallthroughs, empty-input safety. Local: backend 76/76 security suite green, frontend 716/716 vitest suite green. Credit: tg12 (external security audit).	2026-05-21 10:48:05 -06:00
Shadowbroker	2e14e75a0e	Fix #256 : per-peer HMAC secrets defeat cross-peer impersonation (#281 ) Before this change, every peer-push HMAC was derived from the single fleet-shared MESH_PEER_PUSH_SECRET. The receiver could prove "this request was signed by someone who knows the fleet secret" but it could NOT prove which peer signed it. Any peer that knew the global secret could compute the expected HMAC for any other peer URL and forge a push pretending to be that peer. Fix: introduce MESH_PEER_SECRETS, an optional comma-separated url=secret map. When a peer URL appears in the map, only the listed per-peer secret is accepted for it -- the global secret is ignored for that specific URL. Peer A no longer knows peer B's secret, so peer A cannot forge a push claiming to be peer B. The new helper resolve_peer_key_for_url() in mesh_crypto.py wraps the lookup and is called from every existing peer-push call site: - backend/auth.py:_verify_peer_push_hmac (receiver) - backend/main.py:_http_peer_push_loop (Infonet event push) - backend/main.py:_http_gate_pull_loop (gate event pull) - backend/main.py:_http_gate_push_loop (gate event push) - backend/services/mesh/mesh_router.py (two transports, push) - backend/services/mesh/mesh_hashchain.py (gate wire ref key) - backend/services/mesh/mesh_wormhole_prekey.py (peer prekey lookup) Zero hostility, by design: - Single-peer installs leave MESH_PEER_SECRETS empty -> resolver falls back to MESH_PEER_PUSH_SECRET -> behavior is byte-for-byte unchanged. - Multi-peer installs that haven't migrated yet behave exactly as before. - Multi-peer installs that DO migrate set MESH_PEER_SECRETS on both ends of each peering and immediately close the impersonation surface for those URLs. Migration is incremental: unlisted peers keep using the global secret. Tests in backend/tests/test_per_peer_secret_resolver.py: - env parsing (default, override, whitespace, malformed entries, cache) - precedence: per-peer beats global - migration window: unlisted peer falls back to global - IMPERSONATION REFUSAL: peer A with global-secret-only cannot forge HMAC for peer B that has a per-peer secret configured - IMPERSONATION REFUSAL: peer A with its OWN per-peer secret cannot forge HMAC for peer B - positive control: legitimate peer B request verifies - zero-behavior-change: single-peer install produces the same key bytes as before the change Credit: tg12 (external security audit, P1/High/High confidence)	2026-05-21 10:05:29 -06:00
Shadowbroker	fb11e0881f	Fix #251 : refuse symlink/hardlink members during Tor bundle extraction (#277 ) External audit (@tg12) flagged that the Tor Expert Bundle extractor checked tarinfo.name against path traversal but never inspected tarinfo.linkname for symlink or hardlink members. Python 3.11's tarfile.extractall() honors symlinks, so a malicious archive could ship a member like:: name = "innocent.txt" (passes the path-traversal check) type = SYMTYPE linkname = "C:\Windows\System32\config\system" After extraction, subsequent reads of innocent.txt dereference to that arbitrary filesystem location; subsequent writes corrupt it. On Windows (where Tor Expert Bundle extraction actually runs), this is a host-compromise path of essentially the same severity as the supply-chain RCE in #231 — gated only by the integrity check we just hardened in PR #261/#265. Python 3.12+ added tarfile.extract / extractall filter='data' as a built-in mitigation; we're on Python 3.11 in production, so we implement the same idea manually. Fix in backend/services/tor_hidden_service.py: Extract the existing path-traversal-only check into a new _extract_tor_bundle_safely() helper that: 1. Refuses any member with member.issym() or member.islnk() True. Tor bundles never legitimately contain symlinks or hardlinks so this is non-disruptive. Logs the linkname so an operator can see what the malicious archive was trying to alias. 2. Refuses any member that isn't isfile() or isdir() — no FIFOs, no character or block devices, no contiguous-file-type entries. None of those belong in a Tor Expert Bundle and accepting them is a class of bug we don't need to debug later. 3. Preserves the original path-traversal guard (member.name must resolve under install_dir). 4. Catches tarfile.TarError so a corrupt archive returns False gracefully instead of bubbling out an exception. Tests: backend/tests/test_tor_bundle_symlink_filter.py (8 tests) - Clean archive with only regular files extracts successfully - Symlink member is rejected (the core regression) - Hardlink member is rejected - Symlink with relative target inside install_dir is still rejected (we don't allow symlinks at all, not just absolute-target ones) - FIFO/device-style member is rejected - Path-traversal guard still works under the new shape - Malformed/non-tar file is rejected gracefully (no crash) - Failure on one member rejects the whole bundle (no half-extract) Validation: pytest backend/tests/test_tor_bundle_symlink_filter.py backend/tests/test_tor_bundle_verification.py -> 14 passed UX impact: zero for legitimate Tor releases. Operators installing a real Tor Expert Bundle continue to see "Tor installed at:" exactly as before. Only malicious archives are refused, with a clear log message identifying the rejected linkname. Credit: @tg12 — the original report was specific enough that the fix design was immediate. Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:41:13 -06:00
Shadowbroker	7f96151e56	Fix #231 : multi-source SHA-256 verification for the self-updater (#265 ) External audit (@tg12, May 18) found that backend/services/updater.py silently skipped all SHA-256 integrity verification whenever the MESH_UPDATE_SHA256 env var was unset — which is the default. Nothing in any install doc tells operators to set it, so practically every deployment was running the auto-updater with zero integrity check. That made GitHub release pipeline compromise a single-step path to arbitrary code execution on every node that auto-updates. Investigation surfaced a deeper bug too: the updater downloads zipball_url (GitHub's auto-generated source archive) but the maintainer's release process publishes SHA256SUMS.txt for a separate named asset (ShadowBroker_v.zip). So even if MESH_UPDATE_SHA256 WERE set, operators had no published digest to compare against — the file they were downloading wasn't the file the maintainer had signed. This PR fixes both issues with the same multi-source verification chain we shipped for the Tor bundle in PR #261: backend/services/updater.py _download_release() now prefers a maintainer-signed release asset matching ShadowBroker_v.zip over zipball_url. Captures the SHA256SUMS.txt asset URL when present. _validate_zip_hash() rewritten as a four-source chain: 1. MESH_UPDATE_SHA256 env var (operator override, preserved) 2. SHA256SUMS.txt asset published with the release (primary — the maintainer's release process already publishes this) 3. Baked-in backend/data/release_digests.json (second line of defense for releases that lack the SHA256SUMS asset, or when the asset can't be fetched at update time) 4. HTTPS-only fallback with a loud warning (preserves the auto- update flow during transient outages) Mismatch from any source that DID respond is fatal — the update is refused and the existing install keeps running. Only the "no source reachable at all" case falls back to HTTPS-only. _fetch_sha256sums() new — fetches and parses a standard SHA256SUMS.txt asset. Handles both "<digest> <name>" and binary- marker "<digest> *<name>" formats. Tolerant to comments, blank lines, and malformed entries. backend/data/release_digests.json (new) Baked-in digest list keyed by release tag. Seeded with the v0.9.79 entries copied from the published SHA256SUMS.txt: ShadowBroker_v0.9.79.zip = f6877c1d6661... ShadowBroker_0.9.79_x64-setup.exe = f7b676ada45c... ShadowBroker_0.9.79_x64_en-US.msi = e0713c3cdda1... Whitelisted in .gitignore alongside the other static reference data files (kiwisdr_directory.json, tor_bundle_digests.json, aisstream_spki_pins.json). backend/tests/test_update_integrity_chain.py (new, 16 tests) - Each source matches → success, identifies which source verified - Each source mismatches → RuntimeError "mismatch" - No source reachable → https-only fallback with loud warning - Env override beats all other sources (preserved precedence) - SHA256SUMS.txt parser handles standard, binary-marker, comments, and network-failure cases Validation: pytest backend/tests/test_update_integrity_chain.py → 16 passed pytest (all 15 security test files together) → 105 passed UX impact: zero. Normal auto-update flow is unchanged for legitimate releases (path 2 catches everything because the release publishes SHA256SUMS.txt). Transient network failures during update gracefully fall through to path 3 then path 4 — no operator intervention needed. The only user-visible behavior change is in the compromised-release case, where the update is now refused instead of silently applied. Credit: @tg12 for the original bug report and the specific call-out that MESH_UPDATE_SHA256 was unreachable by default operators. Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 01:31:20 -06:00
Shadowbroker	729ea78cb2	Fix #258 : AIS proxy SPKI pinning fallback for expired upstream cert (#262 ) External report from @jmleclercq: AISStream's Let's Encrypt cert expired on 2026-05-20 (verified — their renewal pipeline failed), so the AIS WebSocket connection dies with CERT_HAS_EXPIRED and the maritime layer empties out. The reporter worked around it locally by passing { rejectUnauthorized: false } to the WebSocket constructor and asked whether we should add an env var for that. That fix is the wrong fix. Disabling TLS validation entirely lets any network attacker MITM the WebSocket and inject fake ship positions — same class as the GDELT plaintext-HTTP MITM we just closed in #199. Adding an env var for it would be an attractive nuisance: operators set it once during a bad cert week and then forget, leaving themselves open to MITM forever. Right fix: SPKI pinning, same pattern as the Tor bundle digest pinning in #201. The insight is that Let's Encrypt renewals keep the SAME public key by default, so the SPKI hash survives normal cert rotation. We can relax the date check while keeping the identity check. Mechanics: backend/data/aisstream_spki_pins.json (new) Pinned SHA-256 hashes of the DER-encoded SPKI bytes for stream.aisstream.io. Captured 2026-05-20 from the live cert. Format is base64(sha256(pubkey_der)), matching the canonical openssl pipeline. Whitelisted in .gitignore alongside the other static reference data files (KiwiSDR directory, Tor bundle digests). backend/ais_proxy.js Path A (99.9% of the time): normal TLS validation. Untouched. Path B (on CERT_HAS_EXPIRED only): re-handshake with rejectUnauthorized=false JUST to read the leaf cert, compute its SPKI hash, compare against the pinned list. If match → upstream is still the genuine AISStream → re-open the WebSocket with rejectUnauthorized=false and log DEGRADED MODE. If no match → refuse the connection, log loudly: this would be a real MITM. Pin file is looked up in three locations so the same code works in the Docker backend, the Tauri desktop runtime, and any operator-relocated layout (SHADOWBROKER_AIS_PINS env var). Embedded fallback list inside the JS so portable installs that haven't shipped the JSON still work. backend/services/ais_stream.py Captures the proxy's status markers from stdout ({"__ais_proxy_status": {"degraded_tls": true}}) into a module- level snapshot. Exposes ais_proxy_status() for the health endpoint. Doesn't touch the data plane — degraded mode keeps receiving vessel data, just with weaker MITM protection. backend/routers/health.py + backend/services/schemas.py /api/health now includes an ais_proxy block with degraded_tls. Top-level status escalates ok -> degraded when AIS is in degraded TLS mode (but won't downgrade a worse SLO status). Operators get a visible signal that they're in degraded mode without needing to grep logs. Tests: backend/tests/test_ais_spki_pinning.py (7 tests) - Pin file structure validation (JSON, host entry, base64 SHA-256) - ais_proxy_status() snapshot semantics (starts empty, defensive copy) - /api/health surfaces ais_proxy.degraded_tls when set - /api/health returns empty ais_proxy when proxy hasn't reported Node.js syntax check passes (node --check) on both backend/ais_proxy.js and the Tauri runtime mirror. When AISStream renews their cert (likely within hours-to-days), the normal-TLS path succeeds on next reconnect and degraded_tls clears automatically. No operator action needed. If they instead rotate their server key, the SPKI check will fail and we'll need to add the new hash to backend/data/aisstream_spki_pins.json before removing the old one. Credit: @jmleclercq for the clear report and the careful workaround verification (Node version, ws version, manual probe). Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-20 20:31:56 -06:00
@aaronjmars	8e27658157	fix(security): use defusedxml for untrusted XML parsing (#259 ) Detected by Aeon + Semgrep (5x use-defused-xml ERROR). Severity: medium CWE-776 (billion laughs) / CWE-611 (XML external entity) Five XML parse sites pass response bodies into the Python stdlib xml.etree.ElementTree without protection against entity expansion attacks. Python's ElementTree still permits internal entity references by default (per the docs vulnerabilities table), so a malicious or compromised upstream can ship a "billion laughs"-style payload that expands to gigabytes in memory. The user-controllable site is sb_monitor._parse_rss: the OpenClaw skill exposes add_custom_feed(name, url, ...) to the agent, then poll_custom_feeds fetches feed.url and passes the body to xml.etree.ElementTree.fromstring with no host allowlist or entity-bomb defence. The other four sites (psk_reporter_fetcher, aircraft_database, cctv_pipeline x2) parse XML from hard-coded upstreams (pskreporter.info, s3.opensky-network.org, datos.madrid.es); defence-in-depth for upstream-compromise/MITM. Switch all five call sites to defusedxml.ElementTree. Same fromstring/find/findall/iter/findtext API, but rejects entity references by default (raises defusedxml.EntitiesForbidden). Confirmed locally that a 4-deep billion-laughs payload that expands to 3000 chars under stdlib ET is rejected by defusedxml. Added defusedxml>=0.7.1 to backend/pyproject.toml dependencies. Co-authored-by: aeonframework <aeon-bot@aaronjmars.com>	2026-05-20 20:01:25 -06:00

1 2 3

116 Commits