gstack/browse/test at 372a5aa382bb931dcfb9efaee7bed24f3d5bddfa - gstack - MS-GitHub-Backup (Gitea)

CalvinBackup/gstack

mirror of https://github.com/garrytan/gstack.git synced 2026-06-18 15:50:11 +02:00

Files

T

History

Garry Tan b515f31400 feat(security): always run Haiku on tool outputs (drop the L4 gate)

Tool-result scan previously short-circuited when L4 (TestSavantAI)
scored below WARN, and further gated Haiku on any layer firing at >=
LOG_ONLY. On BrowseSafe-Bench that meant Haiku almost never ran,
because TestSavantAI has ~15% recall on browser-agent-specific
attacks (social engineering, indirect injection). We were gating our
best signal on our weakest.

Run all three classifiers (L4 + L4c + Haiku) in parallel. Cost:
~$0.002 + ~8s Haiku wall time per tool result, bounded by the 15s
Haiku timeout. Haiku also runs in parallel with the content scans
so it's additive only against the stream handler budget, not
against the session wall time.

User-input pre-spawn path unchanged — shouldRunTranscriptCheck still
gates there. The Stack Overflow FP mitigation that original gate was
built for still applies to direct user input; tool outputs have
different characteristics.

Source-contract test updated to pin the new parallel-three shape.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-20 21:15:57 +08:00

..

test(security): mock-claude scenario for tool-result injection path

2026-04-20 20:55:25 +08:00

activity.test.ts

feat: headed mode + sidebar agent + Chrome extension (v0.12.0) (#517 )

2026-03-26 11:15:24 -06:00

adversarial-security.test.ts

fix: security audit remediation — 12 fixes, 20 tests (v0.13.1.0) (#595 )

2026-03-28 08:35:24 -06:00

batch.test.ts

refactor: extract TabSession for per-tab state isolation (v0.15.16.0) (#873 )

2026-04-07 00:23:36 -07:00

browser-manager-unit.test.ts

feat: headed mode + sidebar agent + Chrome extension (v0.12.0) (#517 )

2026-03-26 11:15:24 -06:00

build.test.ts

fix: ngrok Windows build + close CI error-swallowing gap (v0.18.0.1) (#1024 )

2026-04-16 13:49:04 -07:00

bun-polyfill.test.ts

fix: Windows support — Node.js server fallback for Playwright (#255 )

2026-03-20 12:22:11 -07:00

commands.test.ts

feat(browse): Puppeteer parity — load-html, screenshot --selector, viewport --scale, file:// (v1.1.0.0) (#1062 )

2026-04-18 23:25:33 +08:00

compare-board.test.ts

refactor: extract TabSession for per-tab state isolation (v0.15.16.0) (#873 )

2026-04-07 00:23:36 -07:00

config.test.ts

fix: Windows browse — health-check-first ensureServer, detached startServer, Windows process mgmt (v0.11.11.0) (#431 )

2026-03-24 00:38:10 -07:00

content-security.test.ts

feat: content security — 4-layer prompt injection defense for pair-agent (#815 )

2026-04-06 14:41:06 -07:00

cookie-import-browser.test.ts

feat: Wave 3 — community bug fixes & platform support (v0.11.6.0) (#359 )

2026-03-23 22:15:23 -07:00

cookie-picker-routes.test.ts

community wave: 6 PRs + hardening (v0.18.1.0) (#1028 )

2026-04-17 00:45:13 -07:00

data-platform.test.ts

feat: browser data platform for AI agents (v0.16.0.0) (#907 )

2026-04-08 00:41:55 -07:00

dx-polish.test.ts

feat(browse): Puppeteer parity — load-html, screenshot --selector, viewport --scale, file:// (v1.1.0.0) (#1062 )

2026-04-18 23:25:33 +08:00

error-handling.test.ts

refactor: AI slop reduction with cross-model quality review (v0.16.3.0) (#941 )

2026-04-10 17:13:15 -10:00

file-drop.test.ts

feat: headed mode + sidebar agent + Chrome extension (v0.12.0) (#517 )

2026-03-26 11:15:24 -06:00

find-browse.test.ts

feat: multi-agent support — gstack works on Codex, Gemini CLI, and Cursor (v0.9.0) (#226 )

2026-03-19 18:20:50 -07:00

findport.test.ts

feat: community PRs — faster install, skill namespacing, uninstall, Codex fallback, Windows fix, Python patterns (v0.12.9.0) (#561 )

2026-03-27 00:44:37 -06:00

gstack-config.test.ts

feat: composable skills — INVOKE_SKILL resolver + factoring infrastructure (v0.13.7.0) (#644 )

2026-03-29 23:35:17 -06:00

gstack-update-check.test.ts

fix: community PRs + security hardening + E2E stability (v0.12.7.0) (#552 )

2026-03-26 23:21:27 -06:00

handoff.test.ts

refactor: extract TabSession for per-tab state isolation (v0.15.16.0) (#873 )

2026-04-07 00:23:36 -07:00

learnings-injection.test.ts

fix: community security wave — 8 PRs, 4 contributors (v0.15.13.0) (#847 )

2026-04-06 00:47:04 -07:00

path-validation.test.ts

fix: community security wave — 8 PRs, 4 contributors (v0.15.13.0) (#847 )

2026-04-06 00:47:04 -07:00

pdf-flags.test.ts

feat(v1.4.0.0): /make-pdf — markdown to publication-quality PDFs (#1086 )

2026-04-20 13:20:30 +08:00

platform.test.ts

fix: Windows support — Node.js server fallback for Playwright (#255 )

2026-03-20 12:22:11 -07:00

security-adversarial-fixes.test.ts

test(security): regression tests for 4 adversarial-review fixes

2026-04-20 11:07:27 +08:00

security-adversarial.test.ts

test(security): adversarial suite for canary + ensemble combiner

2026-04-20 04:18:48 +08:00

security-audit-r2.test.ts

feat(browse): Puppeteer parity — load-html, screenshot --selector, viewport --scale, file:// (v1.1.0.0) (#1062 )

2026-04-18 23:25:33 +08:00

security-bench.test.ts

test(security): add BrowseSafe-Bench smoke harness (v1 baseline)

2026-04-20 04:50:53 +08:00

security-bunnative.test.ts

test(security): bun-native tokenizer correctness + bench harness shape

2026-04-20 05:02:59 +08:00

security-classifier.test.ts

test(security): classifier gating + status contract (9 tests)

2026-04-20 04:21:17 +08:00

security-e2e-fullstack.test.ts

test(security): full-stack E2E — the security-contract anchor

2026-04-20 05:40:54 +08:00

security-integration.test.ts

test(security): integration suite — content-security.ts + security.ts coexistence

2026-04-20 04:20:14 +08:00

security-live-playwright.test.ts

test(security): live Playwright integration — defense-in-depth E5 contract

2026-04-20 04:44:07 +08:00

security-review-flow.test.ts

test(security): review-flow regression tests

2026-04-20 20:25:37 +08:00

security-review-fullstack.test.ts

test(security): full-stack review E2E — real classifier + mock-claude

2026-04-20 20:55:45 +08:00

security-review-sidepanel-e2e.test.ts

test(security): sidepanel review E2E — Playwright drives Allow/Block

2026-04-20 20:55:16 +08:00

security-sidepanel-dom.test.ts

test(security): sidepanel DOM tests via Playwright — shield + banner render

2026-04-20 05:40:54 +08:00

security-source-contracts.test.ts

feat(security): always run Haiku on tool outputs (drop the L4 gate)

2026-04-20 21:15:57 +08:00

security.test.ts

test(security): 4 new ensemble tests — 3-way agreement rule

2026-04-20 04:55:23 +08:00

server-auth.test.ts

fix: cookie picker auth token leak (v0.15.17.0) (#904 )

2026-04-08 10:10:13 -07:00

sidebar-agent-roundtrip.test.ts

fix: sidebar agent uses real tab URL instead of stale Playwright URL (v0.12.6.0) (#544 )

2026-03-26 22:07:03 -06:00

sidebar-agent.test.ts

Merge origin/main into garrytan/prompt-injection-guard

2026-04-20 14:09:09 +08:00

sidebar-integration.test.ts

fix: sidebar agent uses real tab URL instead of stale Playwright URL (v0.12.6.0) (#544 )

2026-03-26 22:07:03 -06:00

sidebar-security.test.ts

test(security): assert tool-result ML scan surface (Read/Glob/Grep ingress)

2026-04-20 04:42:20 +08:00

sidebar-unit.test.ts

fix: sidebar agent uses real tab URL instead of stale Playwright URL (v0.12.6.0) (#544 )

2026-03-26 22:07:03 -06:00

sidebar-ux.test.ts

fix: community security wave — 8 PRs, 4 contributors (v0.15.13.0) (#847 )

2026-04-06 00:47:04 -07:00

snapshot.test.ts

refactor: extract TabSession for per-tab state isolation (v0.15.16.0) (#873 )

2026-04-07 00:23:36 -07:00

state-ttl.test.ts

fix: security audit remediation — 12 fixes, 20 tests (v0.13.1.0) (#595 )

2026-03-28 08:35:24 -06:00

tab-isolation.test.ts

feat: browser data platform for AI agents (v0.16.0.0) (#907 )

2026-04-08 00:41:55 -07:00

test-server.ts

feat: Phase 3.5 — cookie import, QA testing, team retro (v0.3.1) (#29 )

2026-03-13 00:31:41 -07:00

token-registry.test.ts

feat: browser data platform for AI agents (v0.16.0.0) (#907 )

2026-04-08 00:41:55 -07:00

url-validation.test.ts

feat(browse): Puppeteer parity — load-html, screenshot --selector, viewport --scale, file:// (v1.1.0.0) (#1062 )

2026-04-18 23:25:33 +08:00

watch.test.ts

feat: headed mode + sidebar agent + Chrome extension (v0.12.0) (#517 )

2026-03-26 11:15:24 -06:00

watchdog.test.ts

community wave: 6 PRs + hardening (v0.18.1.0) (#1028 )

2026-04-17 00:45:13 -07:00

welcome-page.test.ts

feat: GStack Browser — double-click AI browser with anti-bot stealth (#695 )

2026-04-04 10:17:05 -07:00