Shadowbroker

mirror of https://github.com/BigBodyCobain/Shadowbroker.git synced 2026-05-28 18:11:31 +02:00

Author	SHA1	Message	Date
Shadowbroker	76750caa92	Round 7a: per-operator outbound attribution + GDELT GCS-direct fix (#292 ) == Per-install operator handle for every third-party API call == Before this PR, every Shadowbroker install identified itself to Wikipedia, Wikidata, Nominatim, GDELT, OpenMHz, Broadcastify, weather.gov, NUFORC, Sentinel/Planetary Computer, TinyGS / CelesTrak, Shodan, Finnhub, and others with a single project-wide User-Agent ("Shadowbroker/1.0" or "ShadowBroker-OSINT/1.0"). From the upstream's perspective every install in the world looked like one giant scraper. If one install misbehaved, the upstream's only recourse was to block "Shadowbroker" as a whole. PR #284 inadvertently doubled down on this in the frontend by introducing a shared `WIKIMEDIA_API_USER_AGENT` constant. This PR retrofits both backends to per-operator attribution. New setting: OPERATOR_HANDLE (env var / settings UI / auto-gen) New helper: network_utils.outbound_user_agent("purpose") The handle is auto-generated as "operator-XXXXXX" on first call (the "shadow-" prefix from earlier drafts was deliberately dropped — too suspicious-looking for abuse-detection systems). Operators can override via OPERATOR_HANDLE; the value is sanitized to lowercase alphanumeric+dash+underscore and capped at 48 chars. Persisted to backend/data/operator_handle.json so it survives container restarts. Retrofitted call sites (every previously-MONSTER User-Agent): - services/region_dossier.py (Wikipedia + Wikidata + Nominatim) - services/geocode.py (Nominatim) - services/sentinel_search.py (Microsoft Planetary Computer) - services/feed_ingester.py (operator-curated RSS feeds) - services/fetchers/earth_observation.py (weather.gov, NUFORC) - services/fetchers/infrastructure.py - services/fetchers/aircraft_database.py - services/fetchers/route_database.py - services/fetchers/trains.py - services/fetchers/meshtastic_map.py - services/shodan_connector.py - services/unusual_whales_connector.py (Finnhub) - services/tinygs_fetcher.py (CelesTrak + TinyGS) - services/sar/sar_products_client.py - services/geopolitics.py (GDELT) - services/radio_intercept.py (Broadcastify + OpenMHz) - routers/cctv.py + main.py (CCTV proxy) - routers/ai_intel.py - scripts/convert_power_plants.py (release-time data refresh) Spoofed browser UAs removed (issues #289 / #290 / #291 — tg12 audit): - cloudscraper-based Chrome impersonation against api.openmhz.com -> replaced with honest requests + per-install UA - Mozilla/5.0 spoofed UA on Broadcastify scrape -> replaced with honest UA - Mozilla/5.0 + fake first-party Referer on OpenMHz audio relay -> replaced with honest UA - cloudscraper dependency dropped from pyproject.toml + uv.lock Frontend retrofit: - new GET /api/settings/operator-handle endpoint (local-operator gated) returns the install's handle - frontend/src/lib/wikimediaClient.ts fetches the handle once on first use, caches it for page lifetime, embeds it in the Api-User-Agent for every Wikipedia / Wikidata browser-direct call == GDELT GCS-direct fix == GDELT's data.gdeltproject.org is a CNAME to a Google Cloud Storage bucket. GCS responds with the wildcard *.storage.googleapis.com cert which legitimately does NOT cover the GDELT custom domain, so Python's TLS verification correctly refuses the connection. Some networks happen to route through a path where this works; many (notably Docker Desktop's outbound NAT on local installs) do not. Verified on the maintainer's local install: GDELT was unreachable; 1610 geopolitical events / 48 export files were dropping silently. Fix: services/geopolitics._gcs_direct_gdelt_url() rewrites any data.gdeltproject.org URL to its GCS-direct equivalent (storage.googleapis.com/data.gdeltproject.org/...) where the standard GCS cert is genuinely valid. api.gdeltproject.org and every other host are left untouched. Confirmed live: backend log goes from GDELT lastupdate failed: 500 to Downloading 48 GDELT export files... Downloaded 48/48 GDELT exports GDELT parsed: 1610 conflict locations from 48 files == Tests == backend/tests/test_per_operator_outbound_attribution.py (12 tests) backend/tests/test_gdelt_gcs_direct_rewrite.py (6 tests) backend/tests/test_region_dossier_wikimedia_ua.py (updated to pin the helper + per-operator handle, not the old constant) frontend/src/__tests__/utils/wikimediaClient.test.ts (rewritten to mock /api/settings/operator-handle and assert per-operator UA) Local: backend 114/114 security+audit+round7a suite green; frontend 718/718 vitest suite green. Credit: tg12 (external security audit, issues #289/#290/#291 relating to spoofed UAs); BigBodyCobain (operator-prefix call, GDELT cloud-vs-local diagnosis).	2026-05-21 15:11:28 -06:00
@aaronjmars	8e27658157	fix(security): use defusedxml for untrusted XML parsing (#259 ) Detected by Aeon + Semgrep (5x use-defused-xml ERROR). Severity: medium CWE-776 (billion laughs) / CWE-611 (XML external entity) Five XML parse sites pass response bodies into the Python stdlib xml.etree.ElementTree without protection against entity expansion attacks. Python's ElementTree still permits internal entity references by default (per the docs vulnerabilities table), so a malicious or compromised upstream can ship a "billion laughs"-style payload that expands to gigabytes in memory. The user-controllable site is sb_monitor._parse_rss: the OpenClaw skill exposes add_custom_feed(name, url, ...) to the agent, then poll_custom_feeds fetches feed.url and passes the body to xml.etree.ElementTree.fromstring with no host allowlist or entity-bomb defence. The other four sites (psk_reporter_fetcher, aircraft_database, cctv_pipeline x2) parse XML from hard-coded upstreams (pskreporter.info, s3.opensky-network.org, datos.madrid.es); defence-in-depth for upstream-compromise/MITM. Switch all five call sites to defusedxml.ElementTree. Same fromstring/find/findall/iter/findtext API, but rejects entity references by default (raises defusedxml.EntitiesForbidden). Confirmed locally that a 4-deep billion-laughs payload that expands to 3000 chars under stdlib ET is rejected by defusedxml. Added defusedxml>=0.7.1 to backend/pyproject.toml dependencies. Co-authored-by: aeonframework <aeon-bot@aaronjmars.com>	2026-05-20 20:01:25 -06:00
BigBodyCobain	b86a258535	Release v0.9.79 runtime and messaging update Ship the v0.9.79 runtime refresh with transport lane isolation, Infonet secure-message address management, MeshChat MQTT controls, selected asset trail behavior, telemetry panel refinements, onboarding updates, and desktop/package metadata alignment. Also ignore local graphify work products so analysis folders do not leak into future commits.	2026-05-12 11:49:46 -06:00
BigBodyCobain	b8ac0fb9e7	Harden v0.9.75 wormhole node sync and telemetry panels Add Tor/onion runtime wiring and faster Infonet node status refresh. Keep node bootstrap state clearer across Docker and local runtimes. Use selected aircraft trail history for cumulative tracked-aircraft emissions.	2026-05-06 14:04:16 -06:00
BigBodyCobain	6ffd54931c	Release v0.9.75 runtime and onboarding update Ship the 0.9.75 source update with improved startup/runtime hardening, operator API key onboarding, Meshtastic MQTT controls, Infonet/MeshChat separation, desktop package versioning, and aircraft telemetry refinements. Also updates focused backend/frontend tests for node settings, Meshtastic MQTT settings, and desktop runtime behavior.	2026-05-06 01:15:54 -06:00
Shadowbroker	38bcc976a4	Merge pull request #140 from BigBodyCobain/dependabot/pip/backend/yfinance-1.3.0 Upgrades yfinance from 0.2.54 to 1.3.0 in /backend	2026-05-02 00:26:10 -06:00
Shadowbroker	77b4361ad6	Merge pull request #141 from BigBodyCobain/dependabot/pip/backend/playwright-1.59.0 Bump playwright from 1.50.0 to 1.59.0 in /backend	2026-05-02 00:25:23 -06:00
Shadowbroker	c5819d40d1	Merge pull request #138 from BigBodyCobain/dependabot/pip/backend/pydantic-2.13.3 Gets pydantic from 2.11.1 to 2.13.3 in /backend	2026-05-02 00:24:54 -06:00
dependabot[bot]	da2a27f92a	chore(deps): bump sgp4 from 2.23 to 2.25 in /backend Bumps [sgp4](https://github.com/brandon-rhodes/python-sgp4) from 2.23 to 2.25. - [Commits](https://github.com/brandon-rhodes/python-sgp4/compare/2.23...2.25) --- updated-dependencies: - dependency-name: sgp4 dependency-version: '2.25' dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 05:49:04 +00:00
dependabot[bot]	e6bea9dad3	chore(deps): bump playwright from 1.50.0 to 1.59.0 in /backend Bumps [playwright](https://github.com/microsoft/playwright-python) from 1.50.0 to 1.59.0. - [Release notes](https://github.com/microsoft/playwright-python/releases) - [Commits](https://github.com/microsoft/playwright-python/compare/v1.50.0...v1.59.0) --- updated-dependencies: - dependency-name: playwright dependency-version: 1.59.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 05:49:00 +00:00
dependabot[bot]	aebd5f0198	chore(deps): bump yfinance from 0.2.54 to 1.3.0 in /backend Bumps [yfinance](https://github.com/ranaroussi/yfinance) from 0.2.54 to 1.3.0. - [Release notes](https://github.com/ranaroussi/yfinance/releases) - [Changelog](https://github.com/ranaroussi/yfinance/blob/main/CHANGELOG.rst) - [Commits](https://github.com/ranaroussi/yfinance/compare/0.2.54...1.3.0) --- updated-dependencies: - dependency-name: yfinance dependency-version: 1.3.0 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 05:48:56 +00:00
dependabot[bot]	2f70b50f65	chore(deps): bump pydantic from 2.11.1 to 2.13.3 in /backend Bumps [pydantic](https://github.com/pydantic/pydantic) from 2.11.1 to 2.13.3. - [Release notes](https://github.com/pydantic/pydantic/releases) - [Changelog](https://github.com/pydantic/pydantic/blob/main/HISTORY.md) - [Commits](https://github.com/pydantic/pydantic/compare/v2.11.1...v2.13.3) --- updated-dependencies: - dependency-name: pydantic dependency-version: 2.13.3 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 05:48:49 +00:00
BigBodyCobain	4ec1fce53d	ci: unblock v0.9.7 release checks	2026-05-01 23:24:46 -06:00
BigBodyCobain	28b3bd5ebf	release: prepare v0.9.7	2026-05-01 22:56:50 -06:00
anoracleofra-code	81b99c0571	fix: add meshtastic, PyNaCl, vaderSentiment to dependencies Full import audit found these packages used but missing from pyproject.toml — all silently broken in Docker: - meshtastic: MQTT protobuf decode (why US/LongFast chat was empty) - PyNaCl: DM sealed-box encryption - vaderSentiment: oracle sentiment analysis (unguarded, would crash)	2026-03-26 16:19:24 -06:00
anoracleofra-code	6140e9b7da	fix: pin paho-mqtt to v1.x (v2 broke callback API) paho-mqtt v2 changed Client constructor and on_connect callback signatures, breaking the Meshtastic MQTT bridge. Pin to <2.0.0 so the existing v1 code works correctly in Docker.	2026-03-26 15:57:14 -06:00
anoracleofra-code	12cf5c0824	fix: add paho-mqtt dependency + improve Infonet sync status labels paho-mqtt was missing from pyproject.toml, causing the Meshtastic MQTT bridge to silently disable itself in Docker — no live chat messages could be received. Also improve Infonet node status labels: show RETRYING when sync fails instead of misleading SYNCING, and WAITING when node is enabled but no sync has run yet.	2026-03-26 15:45:11 -06:00
anoracleofra-code	fb6d098adf	fix: add missing orjson, beautifulsoup4, cryptography deps to pyproject.toml Docker image was crash-looping with `ModuleNotFoundError: No module named 'orjson'` because these packages were imported but not declared as dependencies. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-26 08:03:17 -06:00
anoracleofra-code	09e39de4ef	fix: add dev dependency group to pyproject.toml for CI CI runs `uv sync --group dev` but only a `test` group existed. Renamed to `dev` and added ruff + black so Docker Publish can pass. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-26 06:33:35 -06:00
dependabot[bot]	f3946d9b0d	chore(deps): bump python-dotenv from 1.0.1 to 1.2.2 in /backend Bumps [python-dotenv](https://github.com/theskumar/python-dotenv) from 1.0.1 to 1.2.2. - [Release notes](https://github.com/theskumar/python-dotenv/releases) - [Changelog](https://github.com/theskumar/python-dotenv/blob/main/CHANGELOG.md) - [Commits](https://github.com/theskumar/python-dotenv/compare/v1.0.1...v1.2.2) --- updated-dependencies: - dependency-name: python-dotenv dependency-version: 1.2.2 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-03-26 11:59:51 +00:00
Orfeo Terkuci	fa2d47ca66	Refactor project structure: separate backend dependencies into pyproject.toml	2026-03-24 20:03:51 +01:00

21 Commits