mirror of
https://github.com/garrytan/gstack.git
synced 2026-05-02 03:35:09 +02:00
a1a933614c
* feat: CDP inspector module — persistent sessions, CSS cascade, style modification New browse/src/cdp-inspector.ts with full CDP inspection engine: - inspectElement() via CSS.getMatchedStylesForNode + DOM.getBoxModel - modifyStyle() via CSS.setStyleTexts with headless page.evaluate fallback - Persistent CDP session lifecycle (create, reuse, detach on nav, re-create) - Specificity sorting, overridden property detection, UA rule filtering - Modification history with undo support - formatInspectorResult() for CLI output Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: browse server inspector endpoints + inspect/style/cleanup/prettyscreenshot CLI Server endpoints: POST /inspector/pick, GET /inspector, POST /inspector/apply, POST /inspector/reset, GET /inspector/history, GET /inspector/events (SSE). CLI commands: inspect (CDP cascade), style (live CSS mod), cleanup (page clutter removal), prettyscreenshot (clean screenshot pipeline). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: sidebar CSS inspector — element picker, box model, rule cascade, quick edit Extension changes for the visual CSS inspector: - inspector.js: element picker with hover highlight, CSS selector generation, basic mode fallback (getComputedStyle + CSSOM), page alteration handlers - inspector.css: picker overlay styles (blue highlight + tooltip) - background.js: inspector message routing (picker <-> server <-> sidepanel) - sidepanel: Inspector tab with box model viz (gstack palette), matched rules with specificity badges, computed styles, click-to-edit quick edit, Send to Agent/Code button, empty/loading/error states Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: document inspect, style, cleanup, prettyscreenshot browse commands Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: auto-track user-created tabs and handle tab close browser-manager.ts changes: - context.on('page') listener: automatically tracks tabs opened by the user (Cmd+T, right-click open in new tab, window.open). Previously only programmatic newTab() was tracked, so user tabs were invisible. - page.on('close') handler in wirePageEvents: removes closed tabs from the pages map and switches activeTabId to the last remaining tab. - syncActiveTabByUrl: match Chrome extension's active tab URL to the correct Playwright page for accurate tab identity. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: per-tab agent isolation via BROWSE_TAB environment variable Prevents parallel sidebar agents from interfering with each other's tab context. Three-layer fix: - sidebar-agent.ts: passes BROWSE_TAB=<tabId> env var to each claude process, per-tab processing set allows concurrent agents across tabs - cli.ts: reads process.env.BROWSE_TAB and includes tabId in command request body - server.ts: handleCommand() temporarily switches activeTabId when tabId is present, restores after command completes (safe: Bun event loop is single-threaded) Also: per-tab agent state (TabAgentState map), per-tab message queuing, per-tab chat buffers, verbose streaming narration, stop button endpoint. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: sidebar per-tab chat context, tab bar sync, stop button, UX polish Extension changes: - sidepanel.js: per-tab chat history (tabChatHistories map), switchChatTab() swaps entire chat view, browserTabActivated handler for instant tab sync, stop button wired to /sidebar-agent/stop, pollTabs renders tab bar - sidepanel.html: updated banner text ("Browser co-pilot"), stop button markup, input placeholder "Ask about this page..." - sidepanel.css: tab bar styles, stop button styles, loading state fixes - background.js: chrome.tabs.onActivated sends browserTabActivated to sidepanel with tab URL for instant tab switch detection Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: per-tab isolation, BROWSE_TAB pinning, tab tracking, sidebar UX sidebar-agent.test.ts (new tests): - BROWSE_TAB env var passed to claude process - CLI reads BROWSE_TAB and sends tabId in body - handleCommand accepts tabId, saves/restores activeTabId - Tab pinning only activates when tabId provided - Per-tab agent state, queue, concurrency - processingTabs set for parallel agents sidebar-ux.test.ts (new tests): - context.on('page') tracks user-created tabs - page.on('close') removes tabs from pages map - Tab isolation uses BROWSE_TAB not system prompt hack - Per-tab chat context in sidepanel - Tab bar rendering, stop button, banner text Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: resolve merge conflicts — keep security defenses + per-tab isolation Merged main's security improvements (XML escaping, prompt injection defense, allowed commands whitelist, --model opus, Write tool, stderr capture) with our branch's per-tab isolation (BROWSE_TAB env var, processingTabs set, no --resume). Updated test expectations for expanded system prompt. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: bump version and changelog (v0.13.9.0) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: add inspector message types to background.js allowlist Pre-existing bug found by Codex: ALLOWED_TYPES in background.js was missing all inspector message types (startInspector, stopInspector, elementPicked, pickerCancelled, applyStyle, toggleClass, injectCSS, resetAll, inspectResult). Messages were silently rejected, making the inspector broken on ALL pages. Also: separate executeScript and insertCSS into individual try blocks in injectInspector(), store inspectorMode for routing, and add content.js fallback when script injection fails (CSP, chrome:// pages). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: basic element picker in content.js for CSP-restricted pages When inspector.js can't be injected (CSP, chrome:// pages), content.js provides a basic picker using getComputedStyle + CSSOM: - startBasicPicker/stopBasicPicker message handlers - captureBasicData() with ~30 key CSS properties, box model, matched rules - Hover highlight with outline save/restore (never leaves artifacts) - Click uses e.target directly (no re-querying by selector) - Sends inspectResult with mode:'basic' for sidebar rendering - Escape key cancels picker and restores outlines Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: cleanup + screenshot buttons in sidebar inspector toolbar Two action buttons in the inspector toolbar: - Cleanup (🧹): POSTs cleanup --all to server, shows spinner, chat notification on success, resets inspector state (element may be removed) - Screenshot (📸): POSTs screenshot to server, shows spinner, chat notification with saved file path Shared infrastructure: - .inspector-action-btn CSS with loading spinner via ::after pseudo-element - chat-notification type in addChatEntry() for system messages - package.json version bump to 0.13.9.0 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: inspector allowlist, CSP fallback, cleanup/screenshot buttons 16 new tests in sidebar-ux.test.ts: - Inspector message allowlist includes all inspector types - content.js basic picker (startBasicPicker, captureBasicData, CSSOM, outline save/restore, inspectResult with mode basic, Escape cleanup) - background.js CSP fallback (separate try blocks, inspectorMode, fallback) - Cleanup button (POST /command, inspector reset after success) - Screenshot button (POST /command, notification rendering) - Chat notification type and CSS styles Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: update project documentation for v0.13.9.0 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: cleanup + screenshot buttons in chat toolbar (not just inspector) Quick actions toolbar (🧹 Cleanup, 📸 Screenshot) now appears above the chat input, always visible. Both inspector and chat buttons share runCleanup() and runScreenshot() helper functions. Clicking either set shows loading state on both simultaneously. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: chat toolbar buttons, shared helpers, quick-action-btn styles Tests that chat toolbar exists (chat-cleanup-btn, chat-screenshot-btn, quick-actions container), CSS styles (.quick-action-btn, .quick-action-btn.loading), shared runCleanup/runScreenshot helper functions, and cleanup inspector reset. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: aggressive cleanup heuristics — overlays, scroll unlock, blur removal Massively expanded CLEANUP_SELECTORS with patterns from uBlock Origin and Readability.js research: - ads: 30+ selectors (Google, Amazon, Outbrain, Taboola, Criteo, etc.) - cookies: OneTrust, Cookiebot, TrustArc, Quantcast + generic patterns - overlays (NEW): paywalls, newsletter popups, interstitials, push prompts, app download banners, survey modals - social: follow prompts, share tools - Cleanup now defaults to --all when no args (sidebar button fix) - Uses !important on all display:none (overrides inline styles) - Unlocks body/html scroll (overflow:hidden from modal lockout) - Removes blur/filter effects (paywall content blur) - Removes max-height truncation (article teaser truncation) - Collapses empty ad placeholder whitespace (empty divs after ad removal) - Skips gstack-ctrl indicator in sticky removal Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: disable action buttons when disconnected, no error spam - setActionButtonsEnabled() toggles .disabled class on all cleanup/screenshot buttons (both chat toolbar and inspector toolbar) - Called with false in updateConnection when server URL is null - Called with true when connection established - runCleanup/runScreenshot silently return when disconnected instead of showing 'Not connected' error notifications - CSS .disabled style: pointer-events:none, opacity:0.3, cursor:not-allowed Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: cleanup heuristics, button disabled state, overlay selectors 17 new tests: - cleanup defaults to --all on empty args - CLEANUP_SELECTORS overlays category (paywall, newsletter, interstitial) - Major ad networks in selectors (doubleclick, taboola, criteo, etc.) - Major consent frameworks (OneTrust, Cookiebot, TrustArc, Quantcast) - !important override for inline styles - Scroll unlock (body overflow:hidden) - Blur removal (paywall content blur) - Article truncation removal (max-height) - Empty placeholder collapse - gstack-ctrl indicator skip in sticky cleanup - setActionButtonsEnabled function - Buttons disabled when disconnected - No error spam from cleanup/screenshot when disconnected - CSS disabled styles for action buttons Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: LLM-based page cleanup — agent analyzes page semantically Instead of brittle CSS selectors, the cleanup button now sends a prompt to the sidebar agent (which IS an LLM). The agent: 1. Runs deterministic $B cleanup --all as a quick first pass 2. Takes a snapshot to see what's left 3. Analyzes the page semantically to identify remaining clutter 4. Removes elements intelligently, preserving site branding This means cleanup works correctly on any site without site-specific selectors. The LLM understands that "Your Daily Puzzles" is clutter, "ADVERTISEMENT" is junk, but the SF Chronicle masthead should stay. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: aggressive cleanup heuristics + preserve top nav bar Deterministic cleanup improvements (used as first pass before LLM analysis): - New 'clutter' category: audio players, podcast widgets, sidebar puzzles/games, recirculation widgets (taboola, outbrain, nativo), cross-promotion banners - Text-content detection: removes "ADVERTISEMENT", "Article continues below", "Sponsored", "Paid content" labels and their parent wrappers - Sticky fix: preserves the topmost full-width element near viewport top (site nav bar) instead of hiding all sticky/fixed elements. Sorts by vertical position, preserves the first one that spans >80% viewport width. Tests: clutter category, ad label removal, nav bar preservation logic. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: LLM-based cleanup architecture, deterministic heuristics, sticky nav 22 new tests covering: - Cleanup button uses /sidebar-command (agent) not /command (deterministic) - Cleanup prompt includes deterministic first pass + agent snapshot analysis - Cleanup prompt lists specific clutter categories for agent guidance - Cleanup prompt preserves site identity (masthead, headline, body, byline) - Cleanup prompt instructs scroll unlock and $B eval removal - Loading state management (async agent, setTimeout) - Deterministic clutter: audio/podcast, games/puzzles, recirculation - Ad label text patterns (ADVERTISEMENT, Sponsored, Article continues) - Ad label parent wrapper hiding for small containers - Sticky nav preservation (sort by position, first full-width near top) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: prevent repeat chat message rendering on reconnect/replay Root cause: server persists chat to disk (chat.jsonl) and replays on restart. Client had no dedup, so every reconnect re-rendered the entire history. Messages from an old HN session would repeat endlessly on the SF Chronicle tab. Fix: renderedEntryIds Set tracks which entry IDs have been rendered. addChatEntry skips entries already in the set. Entries without an id (local notifications) bypass the check. Clear chat resets the set. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: agent stops when done, no focus stealing, opus for prompt injection safety Three fixes for sidebar agent UX: - System prompt: "Be CONCISE. STOP as soon as the task is done. Do NOT keep exploring or doing bonus work." Prevents agent from endlessly taking screenshots and highlighting elements after answering the question. - switchTab(id, opts): new bringToFront option. Internal tab pinning (BROWSE_TAB) uses bringToFront: false so agent commands never steal window focus from the user's active app. - Keep opus model (not sonnet) for prompt injection resistance on untrusted web pages. Remove Write from allowedTools (agent only needs Bash for $B). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: agent conciseness, focus stealing, opus model, switchTab opts Tests for the three UX fixes: - System prompt contains STOP/CONCISE/Do NOT keep exploring - sidebar agent uses opus (not sonnet) for prompt injection resistance - switchTab has bringToFront option, defaults to true (opt-out) - handleCommand tab pinning uses bringToFront: false (no focus steal) - Updated stale tests: switchTab signature, allowedTools excludes Write, narration -> conciseness, tab pinning restore calls Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: sidebar CSS interaction E2E — HN comment highlight round-trip New E2E test (periodic tier, ~$2/run) that exercises the full sidebar agent pipeline with CSS interaction: 1. Agent navigates to Hacker News 2. Clicks into the top story's comments 3. Reads comments and identifies the most insightful one 4. Highlights it with a 4px solid orange outline via style injection Tests: navigation, snapshot, text reading, LLM judgment, CSS modification. Requires real browser + real Claude (ANTHROPIC_API_KEY). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: sidebar CSS E2E test — correct idle timeout (ms not s), pipe stdio Root cause of test failure: BROWSE_IDLE_TIMEOUT is in milliseconds, not seconds. '600' = 0.6 seconds, server died immediately after health check. Fixed to '600000' (10 minutes). Also: use 'pipe' stdio instead of file descriptors (closing fds kills child on macOS/bun), catch ConnectionRefused on poll retry, 4 min poll timeout for the multi-step opus task. Test passes: agent navigates to HN, reads comments, identifies most insightful one, highlights it with orange CSS, stops. 114s, $0.00. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
1195 lines
47 KiB
TypeScript
1195 lines
47 KiB
TypeScript
/**
|
|
* Tests for sidebar UX changes:
|
|
* - System prompt does not bake in page URL (navigation fix)
|
|
* - --resume is never used (stale context fix)
|
|
* - /sidebar-chat response includes agentStatus
|
|
* - Sidebar HTML has updated banner, placeholder, stop button
|
|
* - Narration instructions present in system prompt
|
|
*/
|
|
|
|
import { describe, test, expect } from 'bun:test';
|
|
import * as fs from 'fs';
|
|
import * as path from 'path';
|
|
|
|
const ROOT = path.resolve(__dirname, '..');
|
|
|
|
// ─── System prompt tests (server.ts spawnClaude) ─────────────────
|
|
|
|
describe('sidebar system prompt (server.ts)', () => {
|
|
const serverSrc = fs.readFileSync(path.join(ROOT, 'src', 'server.ts'), 'utf-8');
|
|
|
|
test('system prompt does not bake in page URL', () => {
|
|
// The old prompt had: `The user is currently viewing: ${pageUrl}`
|
|
// The new prompt should NOT contain this pattern
|
|
// Extract the systemPrompt array from spawnClaude
|
|
const promptSection = serverSrc.slice(
|
|
serverSrc.indexOf('const systemPrompt = ['),
|
|
serverSrc.indexOf("].join('\\n');", serverSrc.indexOf('const systemPrompt = [')) + 15,
|
|
);
|
|
expect(promptSection).not.toContain('currently viewing');
|
|
expect(promptSection).not.toContain('${pageUrl}');
|
|
});
|
|
|
|
test('system prompt tells agent to check URL before acting', () => {
|
|
const promptSection = serverSrc.slice(
|
|
serverSrc.indexOf('const systemPrompt = ['),
|
|
serverSrc.indexOf("].join('\\n');", serverSrc.indexOf('const systemPrompt = [')) + 15,
|
|
);
|
|
expect(promptSection).toContain('NEVER');
|
|
expect(promptSection).toContain('navigate back');
|
|
expect(promptSection).toContain('NEVER assume');
|
|
expect(promptSection).toContain('url`');
|
|
});
|
|
|
|
test('system prompt includes conciseness and stop instructions', () => {
|
|
const promptSection = serverSrc.slice(
|
|
serverSrc.indexOf('const systemPrompt = ['),
|
|
serverSrc.indexOf("].join('\\n');", serverSrc.indexOf('const systemPrompt = [')) + 15,
|
|
);
|
|
expect(promptSection).toContain('CONCISE');
|
|
expect(promptSection).toContain('STOP');
|
|
});
|
|
|
|
test('--resume is never used in spawnClaude args', () => {
|
|
// Extract the spawnClaude function
|
|
const fnStart = serverSrc.indexOf('function spawnClaude(');
|
|
const fnEnd = serverSrc.indexOf('\nfunction ', fnStart + 1);
|
|
const fnBody = serverSrc.slice(fnStart, fnEnd);
|
|
// Should not push --resume to args
|
|
expect(fnBody).not.toContain("'--resume'");
|
|
expect(fnBody).not.toContain('"--resume"');
|
|
});
|
|
|
|
test('system prompt includes inspect and style commands', () => {
|
|
const promptSection = serverSrc.slice(
|
|
serverSrc.indexOf('const systemPrompt = ['),
|
|
serverSrc.indexOf("].join('\\n');", serverSrc.indexOf('const systemPrompt = [')) + 15,
|
|
);
|
|
expect(promptSection).toContain('inspect');
|
|
expect(promptSection).toContain('style');
|
|
expect(promptSection).toContain('cleanup');
|
|
});
|
|
});
|
|
|
|
// ─── /sidebar-chat response includes agentStatus ─────────────────
|
|
|
|
describe('/sidebar-chat agentStatus', () => {
|
|
const serverSrc = fs.readFileSync(path.join(ROOT, 'src', 'server.ts'), 'utf-8');
|
|
|
|
test('sidebar-chat response includes agentStatus field', () => {
|
|
// Find the GET /sidebar-chat handler — look for the data response, not the auth error
|
|
const handlerStart = serverSrc.indexOf("url.pathname === '/sidebar-chat'");
|
|
// Find the response that returns entries + total (skip the auth error response)
|
|
const entriesResponse = serverSrc.indexOf('{ entries, total', handlerStart);
|
|
expect(entriesResponse).toBeGreaterThan(handlerStart);
|
|
const responseLine = serverSrc.slice(entriesResponse, entriesResponse + 100);
|
|
expect(responseLine).toContain('agentStatus');
|
|
});
|
|
});
|
|
|
|
// ─── Sidebar HTML tests ──────────────────────────────────────────
|
|
|
|
describe('sidebar HTML (sidepanel.html)', () => {
|
|
const html = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.html'), 'utf-8');
|
|
|
|
test('banner says "Browser co-pilot" not "Standalone mode"', () => {
|
|
expect(html).toContain('Browser co-pilot');
|
|
expect(html).not.toContain('Standalone mode');
|
|
});
|
|
|
|
test('input placeholder says "Ask about this page"', () => {
|
|
expect(html).toContain('Ask about this page');
|
|
expect(html).not.toContain('Message Claude Code');
|
|
});
|
|
|
|
test('stop button exists with id stop-agent-btn', () => {
|
|
expect(html).toContain('id="stop-agent-btn"');
|
|
expect(html).toContain('class="stop-btn"');
|
|
});
|
|
|
|
test('stop button is hidden by default', () => {
|
|
// The stop button should have style="display: none;" initially
|
|
const stopBtnMatch = html.match(/id="stop-agent-btn"[^>]*/);
|
|
expect(stopBtnMatch).not.toBeNull();
|
|
expect(stopBtnMatch![0]).toContain('display: none');
|
|
});
|
|
});
|
|
|
|
// ─── Sidebar JS tests ───────────────────────────────────────────
|
|
|
|
describe('sidebar JS (sidepanel.js)', () => {
|
|
const js = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.js'), 'utf-8');
|
|
|
|
test('stopAgent function exists', () => {
|
|
expect(js).toContain('async function stopAgent()');
|
|
});
|
|
|
|
test('stopAgent calls /sidebar-agent/stop endpoint', () => {
|
|
expect(js).toContain('/sidebar-agent/stop');
|
|
});
|
|
|
|
test('stop button click handler is wired up', () => {
|
|
expect(js).toContain("getElementById('stop-agent-btn')");
|
|
expect(js).toContain('stopAgent');
|
|
});
|
|
|
|
test('updateStopButton function exists', () => {
|
|
expect(js).toContain('function updateStopButton(');
|
|
});
|
|
|
|
test('agent_start shows stop button', () => {
|
|
// Find the agent_start handler and verify it calls updateStopButton(true)
|
|
const startHandler = js.slice(
|
|
js.indexOf("entry.type === 'agent_start'"),
|
|
js.indexOf("entry.type === 'agent_done'"),
|
|
);
|
|
expect(startHandler).toContain('updateStopButton(true)');
|
|
});
|
|
|
|
test('agent_done hides stop button', () => {
|
|
const doneHandler = js.slice(
|
|
js.indexOf("entry.type === 'agent_done'"),
|
|
js.indexOf("entry.type === 'agent_error'"),
|
|
);
|
|
expect(doneHandler).toContain('updateStopButton(false)');
|
|
});
|
|
|
|
test('agent_error hides stop button', () => {
|
|
const errorIdx = js.indexOf("entry.type === 'agent_error'");
|
|
const errorHandler = js.slice(errorIdx, errorIdx + 500);
|
|
expect(errorHandler).toContain('updateStopButton(false)');
|
|
});
|
|
|
|
test('orphaned thinking cleanup checks agentStatus from server', () => {
|
|
// After polling, if agentStatus !== processing, thinking dots are removed
|
|
expect(js).toContain("data.agentStatus !== 'processing'");
|
|
});
|
|
|
|
test('orphaned thinking cleanup adds (session ended) notice', () => {
|
|
expect(js).toContain('(session ended)');
|
|
});
|
|
|
|
test('sendMessage renders user bubble + thinking dots optimistically', () => {
|
|
// sendMessage should create user bubble and agent-thinking BEFORE the server responds
|
|
const sendFn = js.slice(js.indexOf('async function sendMessage()'), js.indexOf('async function sendMessage()') + 2000);
|
|
expect(sendFn).toContain('chat-bubble user');
|
|
expect(sendFn).toContain('agent-thinking');
|
|
expect(sendFn).toContain('lastOptimisticMsg');
|
|
});
|
|
|
|
test('fast polling during agent execution (300ms), slow when idle (1000ms)', () => {
|
|
expect(js).toContain('FAST_POLL_MS');
|
|
expect(js).toContain('SLOW_POLL_MS');
|
|
expect(js).toContain('startFastPoll');
|
|
expect(js).toContain('stopFastPoll');
|
|
// Fast = 300ms
|
|
expect(js).toContain('300');
|
|
// Slow = 1000ms
|
|
expect(js).toContain('1000');
|
|
});
|
|
|
|
test('agent_done calls stopFastPoll', () => {
|
|
const doneHandler = js.slice(
|
|
js.indexOf("entry.type === 'agent_done'"),
|
|
js.indexOf("entry.type === 'agent_error'"),
|
|
);
|
|
expect(doneHandler).toContain('stopFastPoll');
|
|
});
|
|
|
|
test('duplicate user bubble prevention via lastOptimisticMsg', () => {
|
|
expect(js).toContain('lastOptimisticMsg');
|
|
// When polled message matches optimistic, skip rendering
|
|
expect(js).toContain('lastOptimisticMsg === entry.message');
|
|
});
|
|
});
|
|
|
|
// ─── Sidebar agent queue poll (sidebar-agent.ts) ─────────────────
|
|
|
|
describe('sidebar agent queue poll (sidebar-agent.ts)', () => {
|
|
const agentSrc = fs.readFileSync(path.join(ROOT, 'src', 'sidebar-agent.ts'), 'utf-8');
|
|
|
|
test('queue poll interval is 200ms or less for fast TTFO', () => {
|
|
const match = agentSrc.match(/const POLL_MS\s*=\s*(\d+)/);
|
|
expect(match).not.toBeNull();
|
|
const pollMs = parseInt(match![1], 10);
|
|
expect(pollMs).toBeLessThanOrEqual(200);
|
|
});
|
|
});
|
|
|
|
// ─── System prompt size (TTFO optimization) ──────────────────────
|
|
|
|
describe('system prompt size', () => {
|
|
const serverSrc = fs.readFileSync(path.join(ROOT, 'src', 'server.ts'), 'utf-8');
|
|
|
|
test('system prompt is compact (under 30 lines)', () => {
|
|
const start = serverSrc.indexOf('const systemPrompt = [');
|
|
const end = serverSrc.indexOf("].join('\\n');", start);
|
|
const promptBlock = serverSrc.slice(start, end);
|
|
const lines = promptBlock.split('\n').length;
|
|
// Compact prompt = fewer input tokens = faster first response
|
|
// Higher limit accommodates security lines (prompt injection defense, allowed commands)
|
|
expect(lines).toBeLessThan(30);
|
|
});
|
|
|
|
test('system prompt does not contain verbose narration examples', () => {
|
|
// We trimmed examples to reduce token count. The agent gets the
|
|
// instruction to narrate, not 6 examples of how.
|
|
const start = serverSrc.indexOf('const systemPrompt = [');
|
|
const end = serverSrc.indexOf("].join('\\n');", start);
|
|
const promptBlock = serverSrc.slice(start, end);
|
|
expect(promptBlock).not.toContain('Examples of good narration');
|
|
expect(promptBlock).not.toContain('I can see a login form');
|
|
});
|
|
});
|
|
|
|
// ─── TTFO latency chain invariants ──────────────────────────────
|
|
|
|
describe('TTFO latency chain', () => {
|
|
const js = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.js'), 'utf-8');
|
|
const agentSrc = fs.readFileSync(path.join(ROOT, 'src', 'sidebar-agent.ts'), 'utf-8');
|
|
|
|
test('optimistic render happens BEFORE chrome.runtime.sendMessage', () => {
|
|
// In sendMessage(), the bubble + thinking dots must be created
|
|
// before the async POST to the server
|
|
const sendFn = js.slice(
|
|
js.indexOf('async function sendMessage()'),
|
|
js.indexOf('async function sendMessage()') + 3000,
|
|
);
|
|
const optimisticIdx = sendFn.indexOf('agent-thinking');
|
|
const sendIdx = sendFn.indexOf('chrome.runtime.sendMessage');
|
|
expect(optimisticIdx).toBeGreaterThan(0);
|
|
expect(sendIdx).toBeGreaterThan(0);
|
|
expect(optimisticIdx).toBeLessThan(sendIdx);
|
|
});
|
|
|
|
test('sendMessage calls startFastPoll before server request', () => {
|
|
const sendFn = js.slice(
|
|
js.indexOf('async function sendMessage()'),
|
|
js.indexOf('async function sendMessage()') + 3000,
|
|
);
|
|
const fastPollIdx = sendFn.indexOf('startFastPoll');
|
|
const sendIdx = sendFn.indexOf('chrome.runtime.sendMessage');
|
|
expect(fastPollIdx).toBeGreaterThan(0);
|
|
expect(fastPollIdx).toBeLessThan(sendIdx);
|
|
});
|
|
|
|
test('agent_start from server does not duplicate thinking dots', () => {
|
|
// When we already showed dots optimistically, agent_start from
|
|
// the poll should skip creating a second set
|
|
const startHandler = js.slice(
|
|
js.indexOf("entry.type === 'agent_start'"),
|
|
js.indexOf("entry.type === 'agent_done'"),
|
|
);
|
|
expect(startHandler).toContain('agent-thinking');
|
|
// Should check if thinking already exists and skip
|
|
expect(startHandler).toContain("getElementById('agent-thinking')");
|
|
});
|
|
|
|
test('FAST_POLL_MS is strictly less than SLOW_POLL_MS', () => {
|
|
const fastMatch = js.match(/FAST_POLL_MS\s*=\s*(\d+)/);
|
|
const slowMatch = js.match(/SLOW_POLL_MS\s*=\s*(\d+)/);
|
|
expect(fastMatch).not.toBeNull();
|
|
expect(slowMatch).not.toBeNull();
|
|
expect(parseInt(fastMatch![1], 10)).toBeLessThan(parseInt(slowMatch![1], 10));
|
|
});
|
|
|
|
test('stopAgent also calls stopFastPoll', () => {
|
|
const stopFn = js.slice(
|
|
js.indexOf('async function stopAgent()'),
|
|
js.indexOf('async function stopAgent()') + 800,
|
|
);
|
|
expect(stopFn).toContain('stopFastPoll');
|
|
});
|
|
});
|
|
|
|
// ─── Browser tab bar ────────────────────────────────────────────
|
|
|
|
describe('browser tab bar (server.ts)', () => {
|
|
const serverSrc = fs.readFileSync(path.join(ROOT, 'src', 'server.ts'), 'utf-8');
|
|
|
|
test('/sidebar-tabs endpoint exists', () => {
|
|
expect(serverSrc).toContain("/sidebar-tabs'");
|
|
expect(serverSrc).toContain('getTabListWithTitles');
|
|
});
|
|
|
|
test('/sidebar-tabs/switch endpoint exists', () => {
|
|
expect(serverSrc).toContain("/sidebar-tabs/switch'");
|
|
expect(serverSrc).toContain('switchTab');
|
|
});
|
|
|
|
test('/sidebar-tabs requires auth', () => {
|
|
// Find the handler and verify auth check
|
|
const handlerIdx = serverSrc.indexOf("/sidebar-tabs'");
|
|
const handlerBlock = serverSrc.slice(handlerIdx, handlerIdx + 300);
|
|
expect(handlerBlock).toContain('validateAuth');
|
|
});
|
|
});
|
|
|
|
describe('browser tab bar (sidepanel.js)', () => {
|
|
const js = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.js'), 'utf-8');
|
|
|
|
test('pollTabs function exists and calls /sidebar-tabs', () => {
|
|
expect(js).toContain('async function pollTabs()');
|
|
expect(js).toContain('/sidebar-tabs');
|
|
});
|
|
|
|
test('renderTabBar function exists', () => {
|
|
expect(js).toContain('function renderTabBar(tabs)');
|
|
});
|
|
|
|
test('tab bar hidden when only 1 tab', () => {
|
|
const renderFn = js.slice(
|
|
js.indexOf('function renderTabBar('),
|
|
js.indexOf('function renderTabBar(') + 600,
|
|
);
|
|
expect(renderFn).toContain('tabs.length <= 1');
|
|
expect(renderFn).toContain("display = 'none'");
|
|
});
|
|
|
|
test('switchBrowserTab calls /sidebar-tabs/switch', () => {
|
|
expect(js).toContain('async function switchBrowserTab(');
|
|
expect(js).toContain('/sidebar-tabs/switch');
|
|
});
|
|
|
|
test('tab polling interval is set on connection', () => {
|
|
expect(js).toContain('tabPollInterval');
|
|
expect(js).toContain('setInterval(pollTabs');
|
|
});
|
|
|
|
test('tab polling cleaned up on disconnect', () => {
|
|
expect(js).toContain('clearInterval(tabPollInterval)');
|
|
});
|
|
|
|
test('only re-renders when tabs change (diff check)', () => {
|
|
expect(js).toContain('lastTabJson');
|
|
expect(js).toContain('json === lastTabJson');
|
|
});
|
|
});
|
|
|
|
describe('browser tab bar (sidepanel.html)', () => {
|
|
const html = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.html'), 'utf-8');
|
|
|
|
test('browser-tabs container exists', () => {
|
|
expect(html).toContain('id="browser-tabs"');
|
|
});
|
|
|
|
test('browser-tabs hidden by default', () => {
|
|
const match = html.match(/id="browser-tabs"[^>]*/);
|
|
expect(match).not.toBeNull();
|
|
expect(match![0]).toContain('display:none');
|
|
});
|
|
});
|
|
|
|
// ─── Bidirectional tab sync ──────────────────────────────────────
|
|
|
|
describe('sidebar→browser tab switch', () => {
|
|
const bmSrc = fs.readFileSync(path.join(ROOT, 'src', 'browser-manager.ts'), 'utf-8');
|
|
|
|
test('switchTab supports bringToFront option', () => {
|
|
expect(bmSrc).toContain('switchTab(id: number, opts?');
|
|
expect(bmSrc).toContain('bringToFront');
|
|
// Default behavior still brings to front (opt-out, not opt-in)
|
|
expect(bmSrc).toContain('bringToFront !== false');
|
|
});
|
|
});
|
|
|
|
describe('browser→sidebar tab sync', () => {
|
|
const bmSrc = fs.readFileSync(path.join(ROOT, 'src', 'browser-manager.ts'), 'utf-8');
|
|
const serverSrc = fs.readFileSync(path.join(ROOT, 'src', 'server.ts'), 'utf-8');
|
|
const js = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.js'), 'utf-8');
|
|
|
|
test('syncActiveTabByUrl method exists on BrowserManager', () => {
|
|
expect(bmSrc).toContain('syncActiveTabByUrl(activeUrl: string)');
|
|
});
|
|
|
|
test('syncActiveTabByUrl updates activeTabId when URL matches a different tab', () => {
|
|
const fn = bmSrc.slice(
|
|
bmSrc.indexOf('syncActiveTabByUrl('),
|
|
bmSrc.indexOf('syncActiveTabByUrl(') + 1200,
|
|
);
|
|
expect(fn).toContain('this.activeTabId = id');
|
|
// Exact match
|
|
expect(fn).toContain('pageUrl === activeUrl');
|
|
// Fuzzy match (origin+pathname)
|
|
expect(fn).toContain('activeOriginPath');
|
|
expect(fn).toContain('fuzzyId');
|
|
});
|
|
|
|
test('context.on("page") tracks user-created tabs', () => {
|
|
expect(bmSrc).toContain("context.on('page'");
|
|
expect(bmSrc).toContain('this.pages.set(id, page)');
|
|
// Should log when new tab detected
|
|
expect(bmSrc).toContain('New tab detected');
|
|
});
|
|
|
|
test('page close handler removes tab from pages map', () => {
|
|
expect(bmSrc).toContain("page.on('close'");
|
|
expect(bmSrc).toContain('this.pages.delete(id)');
|
|
expect(bmSrc).toContain('Tab closed');
|
|
});
|
|
|
|
test('syncActiveTabByUrl skips when only 1 tab (no ambiguity)', () => {
|
|
const fn = bmSrc.slice(
|
|
bmSrc.indexOf('syncActiveTabByUrl('),
|
|
bmSrc.indexOf('syncActiveTabByUrl(') + 600,
|
|
);
|
|
expect(fn).toContain('this.pages.size <= 1');
|
|
});
|
|
|
|
test('/sidebar-tabs reads activeUrl param and calls syncActiveTabByUrl', () => {
|
|
const handler = serverSrc.slice(
|
|
serverSrc.indexOf("/sidebar-tabs'"),
|
|
serverSrc.indexOf("/sidebar-tabs'") + 500,
|
|
);
|
|
expect(handler).toContain("get('activeUrl')");
|
|
expect(handler).toContain('syncActiveTabByUrl');
|
|
});
|
|
|
|
test('/sidebar-command syncs activeTabUrl BEFORE reading tabId', () => {
|
|
// The server must call syncActiveTabByUrl before getActiveTabId
|
|
// so the agent targets the correct tab
|
|
const cmdIdx = serverSrc.indexOf("url.pathname === '/sidebar-command'");
|
|
const handler = serverSrc.slice(cmdIdx, cmdIdx + 1200);
|
|
const syncIdx = handler.indexOf('syncActiveTabByUrl');
|
|
const getIdIdx = handler.indexOf('getActiveTabId');
|
|
expect(syncIdx).toBeGreaterThan(0);
|
|
expect(getIdIdx).toBeGreaterThan(syncIdx); // sync happens BEFORE reading ID
|
|
});
|
|
|
|
test('background.js listens for chrome.tabs.onActivated', () => {
|
|
const bgSrc = fs.readFileSync(path.join(ROOT, '..', 'extension', 'background.js'), 'utf-8');
|
|
expect(bgSrc).toContain('chrome.tabs.onActivated.addListener');
|
|
expect(bgSrc).toContain('browserTabActivated');
|
|
});
|
|
|
|
test('sidepanel handles browserTabActivated message instantly', () => {
|
|
expect(js).toContain("msg.type === 'browserTabActivated'");
|
|
// Should call switchChatTab for instant context swap
|
|
expect(js).toContain('switchChatTab');
|
|
});
|
|
|
|
test('pollTabs sends Chrome active tab URL to server', () => {
|
|
const pollFn = js.slice(
|
|
js.indexOf('async function pollTabs()'),
|
|
js.indexOf('async function pollTabs()') + 800,
|
|
);
|
|
expect(pollFn).toContain('chrome.tabs.query');
|
|
expect(pollFn).toContain('activeUrl=');
|
|
});
|
|
});
|
|
|
|
describe('browser tab bar (sidepanel.css)', () => {
|
|
const css = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.css'), 'utf-8');
|
|
|
|
test('browser-tabs styles exist', () => {
|
|
expect(css).toContain('.browser-tabs');
|
|
expect(css).toContain('.browser-tab');
|
|
expect(css).toContain('.browser-tab.active');
|
|
});
|
|
|
|
test('tab bar is horizontally scrollable', () => {
|
|
const barStyle = css.slice(
|
|
css.indexOf('.browser-tabs {'),
|
|
css.indexOf('}', css.indexOf('.browser-tabs {')) + 1,
|
|
);
|
|
expect(barStyle).toContain('overflow-x: auto');
|
|
});
|
|
|
|
test('active tab is visually distinct', () => {
|
|
const activeStyle = css.slice(
|
|
css.indexOf('.browser-tab.active {'),
|
|
css.indexOf('}', css.indexOf('.browser-tab.active {')) + 1,
|
|
);
|
|
expect(activeStyle).toContain('--bg-surface');
|
|
expect(activeStyle).toContain('--text-body');
|
|
});
|
|
});
|
|
|
|
// ─── Event relay (processAgentEvent) ────────────────────────────
|
|
|
|
describe('processAgentEvent handles sidebar-agent event types', () => {
|
|
const serverSrc = fs.readFileSync(path.join(ROOT, 'src', 'server.ts'), 'utf-8');
|
|
|
|
// Extract processAgentEvent function body
|
|
const fnStart = serverSrc.indexOf('function processAgentEvent(');
|
|
const fnEnd = serverSrc.indexOf('\nfunction ', fnStart + 1);
|
|
const fnBody = serverSrc.slice(fnStart, fnEnd > fnStart ? fnEnd : fnStart + 2000);
|
|
|
|
test('handles tool_use events directly (not raw Claude stream format)', () => {
|
|
// Must handle { type: 'tool_use', tool, input } from sidebar-agent
|
|
expect(fnBody).toContain("event.type === 'tool_use'");
|
|
expect(fnBody).toContain('event.tool');
|
|
expect(fnBody).toContain('event.input');
|
|
});
|
|
|
|
test('handles text_delta events directly', () => {
|
|
expect(fnBody).toContain("event.type === 'text_delta'");
|
|
expect(fnBody).toContain('event.text');
|
|
});
|
|
|
|
test('handles text events directly', () => {
|
|
expect(fnBody).toContain("event.type === 'text'");
|
|
});
|
|
|
|
test('handles result events', () => {
|
|
expect(fnBody).toContain("event.type === 'result'");
|
|
});
|
|
|
|
test('handles agent_error events', () => {
|
|
expect(fnBody).toContain("event.type === 'agent_error'");
|
|
expect(fnBody).toContain('event.error');
|
|
});
|
|
|
|
test('does NOT re-parse raw Claude stream events (no content_block_start)', () => {
|
|
// sidebar-agent.ts already transforms these. Server should not duplicate.
|
|
expect(fnBody).not.toContain('content_block_start');
|
|
expect(fnBody).not.toContain('content_block_delta');
|
|
expect(fnBody).not.toContain("event.type === 'assistant'");
|
|
});
|
|
|
|
test('all event types call addChatEntry with role: agent', () => {
|
|
// Every addChatEntry in processAgentEvent should have role: 'agent'
|
|
const addCalls = fnBody.match(/addChatEntry\(\{[^}]+\}\)/g) || [];
|
|
for (const call of addCalls) {
|
|
expect(call).toContain("role: 'agent'");
|
|
}
|
|
});
|
|
});
|
|
|
|
// ─── Per-tab chat context ────────────────────────────────────────
|
|
|
|
describe('per-tab chat context (server.ts)', () => {
|
|
const serverSrc = fs.readFileSync(path.join(ROOT, 'src', 'server.ts'), 'utf-8');
|
|
|
|
test('/sidebar-chat accepts tabId query param', () => {
|
|
const handler = serverSrc.slice(
|
|
serverSrc.indexOf("/sidebar-chat'"),
|
|
serverSrc.indexOf("/sidebar-chat'") + 600,
|
|
);
|
|
expect(handler).toContain('tabId');
|
|
});
|
|
|
|
test('addChatEntry takes a tabId parameter', () => {
|
|
// addChatEntry should route entries to the correct tab's buffer
|
|
expect(serverSrc).toContain('tabId');
|
|
// Look for tabId in addChatEntry function
|
|
const fnIdx = serverSrc.indexOf('function addChatEntry(');
|
|
if (fnIdx > -1) {
|
|
const fnBody = serverSrc.slice(fnIdx, fnIdx + 300);
|
|
expect(fnBody).toContain('tabId');
|
|
}
|
|
});
|
|
|
|
test('spawnClaude passes active tab ID to queue entry', () => {
|
|
const spawnFn = serverSrc.slice(
|
|
serverSrc.indexOf('function spawnClaude('),
|
|
serverSrc.indexOf('\nfunction ', serverSrc.indexOf('function spawnClaude(') + 1),
|
|
);
|
|
expect(spawnFn).toContain('tabId');
|
|
});
|
|
|
|
test('tab isolation uses BROWSE_TAB env var instead of system prompt hack', () => {
|
|
const agentSrc = fs.readFileSync(path.join(ROOT, 'src', 'sidebar-agent.ts'), 'utf-8');
|
|
// Agent passes BROWSE_TAB env var to claude (not a system prompt instruction)
|
|
expect(agentSrc).toContain('BROWSE_TAB');
|
|
// Server handleCommand reads tabId from body and pins to that tab
|
|
expect(serverSrc).toContain('savedTabId');
|
|
expect(serverSrc).toContain('switchTab(tabId)');
|
|
});
|
|
});
|
|
|
|
describe('per-tab chat context (sidepanel.js)', () => {
|
|
const js = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.js'), 'utf-8');
|
|
|
|
test('tracks activeTabId for chat context', () => {
|
|
expect(js).toContain('activeTabId');
|
|
});
|
|
|
|
test('pollChat sends tabId to server', () => {
|
|
const pollFn = js.slice(
|
|
js.indexOf('async function pollChat()'),
|
|
js.indexOf('async function pollChat()') + 600,
|
|
);
|
|
expect(pollFn).toContain('tabId');
|
|
});
|
|
|
|
test('switching tabs swaps displayed chat', () => {
|
|
// When tab changes, old chat is saved and new tab's chat is shown
|
|
expect(js).toContain('switchChatTab');
|
|
});
|
|
|
|
test('switchChatTab saves current tab DOM and restores new tab', () => {
|
|
const fn = js.slice(
|
|
js.indexOf('function switchChatTab('),
|
|
js.indexOf('function switchChatTab(') + 800,
|
|
);
|
|
expect(fn).toContain('chatDomByTab');
|
|
expect(fn).toContain('innerHTML');
|
|
});
|
|
|
|
test('sendMessage includes tabId in message', () => {
|
|
const sendFn = js.slice(
|
|
js.indexOf('async function sendMessage()'),
|
|
js.indexOf('async function sendMessage()') + 2000,
|
|
);
|
|
expect(sendFn).toContain('tabId');
|
|
expect(sendFn).toContain('sidebarActiveTabId');
|
|
});
|
|
});
|
|
|
|
// ─── Sidebar CSS tests ──────────────────────────────────────────
|
|
|
|
describe('sidebar CSS (sidepanel.css)', () => {
|
|
const css = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.css'), 'utf-8');
|
|
|
|
test('stop button style exists', () => {
|
|
expect(css).toContain('.stop-btn');
|
|
});
|
|
|
|
test('stop button uses error color', () => {
|
|
const stopBtnSection = css.slice(
|
|
css.indexOf('.stop-btn {'),
|
|
css.indexOf('}', css.indexOf('.stop-btn {')) + 1,
|
|
);
|
|
expect(stopBtnSection).toContain('--error');
|
|
});
|
|
|
|
test('experimental-banner no longer uses amber warning colors', () => {
|
|
const bannerSection = css.slice(
|
|
css.indexOf('.experimental-banner {'),
|
|
css.indexOf('}', css.indexOf('.experimental-banner {')) + 1,
|
|
);
|
|
// Should not be amber/warning anymore
|
|
expect(bannerSection).not.toContain('245, 158, 11, 0.15');
|
|
expect(bannerSection).not.toContain('#F59E0B');
|
|
});
|
|
|
|
test('tool description uses system font not mono', () => {
|
|
const toolSection = css.slice(
|
|
css.indexOf('.agent-tool {'),
|
|
css.indexOf('}', css.indexOf('.agent-tool {')) + 1,
|
|
);
|
|
expect(toolSection).toContain('font-system');
|
|
expect(toolSection).not.toContain('font-mono');
|
|
});
|
|
});
|
|
|
|
// ─── Inspector message allowlist fix ────────────────────────────
|
|
|
|
describe('inspector message allowlist fix', () => {
|
|
const bgSrc = fs.readFileSync(path.join(ROOT, '..', 'extension', 'background.js'), 'utf-8');
|
|
|
|
test('ALLOWED_TYPES includes inspector message types', () => {
|
|
const allowListSection = bgSrc.slice(
|
|
bgSrc.indexOf('const ALLOWED_TYPES'),
|
|
bgSrc.indexOf(']);', bgSrc.indexOf('const ALLOWED_TYPES')) + 3,
|
|
);
|
|
expect(allowListSection).toContain('startInspector');
|
|
expect(allowListSection).toContain('stopInspector');
|
|
expect(allowListSection).toContain('elementPicked');
|
|
expect(allowListSection).toContain('pickerCancelled');
|
|
expect(allowListSection).toContain('applyStyle');
|
|
expect(allowListSection).toContain('inspectResult');
|
|
});
|
|
});
|
|
|
|
// ─── CSP fallback basic picker ──────────────────────────────────
|
|
|
|
describe('CSP fallback basic picker', () => {
|
|
const contentSrc = fs.readFileSync(path.join(ROOT, '..', 'extension', 'content.js'), 'utf-8');
|
|
const bgSrc = fs.readFileSync(path.join(ROOT, '..', 'extension', 'background.js'), 'utf-8');
|
|
|
|
test('content.js contains startBasicPicker message handler', () => {
|
|
expect(contentSrc).toContain("msg.type === 'startBasicPicker'");
|
|
expect(contentSrc).toContain('startBasicPicker()');
|
|
});
|
|
|
|
test('content.js contains captureBasicData function with getComputedStyle', () => {
|
|
expect(contentSrc).toContain('function captureBasicData(');
|
|
expect(contentSrc).toContain('getComputedStyle(');
|
|
expect(contentSrc).toContain('getBoundingClientRect()');
|
|
});
|
|
|
|
test('content.js contains CSSOM iteration with cross-origin try/catch', () => {
|
|
expect(contentSrc).toContain('document.styleSheets');
|
|
expect(contentSrc).toContain('cssRules');
|
|
expect(contentSrc).toContain('cross-origin');
|
|
});
|
|
|
|
test('content.js saves and restores outline on elements', () => {
|
|
expect(contentSrc).toContain('basicPickerSavedOutline');
|
|
// Outline is restored in cleanup and highlight functions
|
|
expect(contentSrc).toContain('.style.outline = basicPickerSavedOutline');
|
|
});
|
|
|
|
test('content.js basic picker sends inspectResult with mode basic', () => {
|
|
expect(contentSrc).toContain("mode: 'basic'");
|
|
expect(contentSrc).toContain("type: 'inspectResult'");
|
|
});
|
|
|
|
test('content.js basic picker cleans up on Escape', () => {
|
|
expect(contentSrc).toContain('onBasicKeydown');
|
|
expect(contentSrc).toContain("e.key === 'Escape'");
|
|
expect(contentSrc).toContain('basicPickerCleanup');
|
|
});
|
|
|
|
test('background.js injectInspector has separate try blocks for executeScript and insertCSS', () => {
|
|
const injectFn = bgSrc.slice(
|
|
bgSrc.indexOf('async function injectInspector('),
|
|
bgSrc.indexOf('\n}', bgSrc.indexOf('async function injectInspector(') + 1) + 2,
|
|
);
|
|
// executeScript and insertCSS should be in separate try blocks
|
|
expect(injectFn).toContain('executeScript');
|
|
expect(injectFn).toContain('insertCSS');
|
|
// Fallback sends startBasicPicker
|
|
expect(injectFn).toContain("type: 'startBasicPicker'");
|
|
expect(injectFn).toContain("mode: 'basic'");
|
|
});
|
|
|
|
test('background.js stores inspectorMode for routing', () => {
|
|
expect(bgSrc).toContain('inspectorMode');
|
|
});
|
|
});
|
|
|
|
// ─── Cleanup and screenshot buttons ─────────────────────────────
|
|
|
|
describe('cleanup and screenshot buttons', () => {
|
|
const html = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.html'), 'utf-8');
|
|
const js = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.js'), 'utf-8');
|
|
const css = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.css'), 'utf-8');
|
|
|
|
test('sidepanel.html contains cleanup and screenshot buttons in inspector', () => {
|
|
expect(html).toContain('inspector-cleanup-btn');
|
|
expect(html).toContain('inspector-screenshot-btn');
|
|
expect(html).toContain('inspector-action-btn');
|
|
});
|
|
|
|
test('sidepanel.html contains cleanup and screenshot buttons in chat toolbar', () => {
|
|
expect(html).toContain('chat-cleanup-btn');
|
|
expect(html).toContain('chat-screenshot-btn');
|
|
expect(html).toContain('quick-actions');
|
|
});
|
|
|
|
test('cleanup button sends smart prompt to sidebar agent (not just deterministic selectors)', () => {
|
|
// Should use /sidebar-command endpoint (agent-based) not just /command (deterministic)
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('async function runScreenshot('),
|
|
);
|
|
expect(cleanupFn).toContain('sidebar-command');
|
|
expect(cleanupFn).toContain('cleanupPrompt');
|
|
// Should include both deterministic first pass AND agent snapshot analysis
|
|
expect(cleanupFn).toContain('cleanup --all');
|
|
expect(cleanupFn).toContain('snapshot -i');
|
|
// Should instruct agent to KEEP site branding
|
|
expect(cleanupFn).toContain('KEEP');
|
|
expect(cleanupFn).toContain('header/masthead/logo');
|
|
});
|
|
|
|
test('sidepanel.js screenshot handler POSTs to /command with screenshot', () => {
|
|
expect(js).toContain("command: 'screenshot'");
|
|
});
|
|
|
|
test('sidepanel.js has notification rendering for type notification', () => {
|
|
expect(js).toContain("entry.type === 'notification'");
|
|
expect(js).toContain('chat-notification');
|
|
});
|
|
|
|
test('sidepanel.css contains inspector-action-btn styles', () => {
|
|
expect(css).toContain('.inspector-action-btn');
|
|
expect(css).toContain('.inspector-action-btn.loading');
|
|
});
|
|
|
|
test('sidepanel.css contains quick-action-btn styles for chat toolbar', () => {
|
|
expect(css).toContain('.quick-action-btn');
|
|
expect(css).toContain('.quick-action-btn.loading');
|
|
expect(css).toContain('.quick-actions');
|
|
});
|
|
|
|
test('cleanup and screenshot use shared helper functions', () => {
|
|
expect(js).toContain('async function runCleanup(');
|
|
expect(js).toContain('async function runScreenshot(');
|
|
// Both inspector and chat buttons are wired
|
|
expect(js).toContain('chatCleanupBtn');
|
|
expect(js).toContain('chatScreenshotBtn');
|
|
});
|
|
|
|
test('sidepanel.css contains chat-notification styles', () => {
|
|
expect(css).toContain('.chat-notification');
|
|
});
|
|
});
|
|
|
|
describe('cleanup heuristics (write-commands.ts)', () => {
|
|
const wcSrc = fs.readFileSync(path.join(ROOT, 'src', 'write-commands.ts'), 'utf-8');
|
|
|
|
test('cleanup defaults to --all when no args provided', () => {
|
|
// Should not throw on empty args, should default to doAll
|
|
expect(wcSrc).toContain('if (args.length === 0)');
|
|
expect(wcSrc).toContain('doAll = true');
|
|
});
|
|
|
|
test('CLEANUP_SELECTORS has overlays category', () => {
|
|
expect(wcSrc).toContain('overlays: [');
|
|
expect(wcSrc).toContain('paywall');
|
|
expect(wcSrc).toContain('newsletter');
|
|
expect(wcSrc).toContain('interstitial');
|
|
expect(wcSrc).toContain('push-notification');
|
|
expect(wcSrc).toContain('app-banner');
|
|
});
|
|
|
|
test('CLEANUP_SELECTORS ads has major ad networks', () => {
|
|
expect(wcSrc).toContain('doubleclick');
|
|
expect(wcSrc).toContain('googlesyndication');
|
|
expect(wcSrc).toContain('amazon-adsystem');
|
|
expect(wcSrc).toContain('outbrain');
|
|
expect(wcSrc).toContain('taboola');
|
|
expect(wcSrc).toContain('criteo');
|
|
});
|
|
|
|
test('CLEANUP_SELECTORS cookies has major consent frameworks', () => {
|
|
expect(wcSrc).toContain('onetrust');
|
|
expect(wcSrc).toContain('CybotCookiebot');
|
|
expect(wcSrc).toContain('truste');
|
|
expect(wcSrc).toContain('qc-cmp2');
|
|
expect(wcSrc).toContain('Quantcast');
|
|
});
|
|
|
|
test('cleanup uses !important to override inline styles', () => {
|
|
// Elements with inline style="display:block" need !important to hide
|
|
expect(wcSrc).toContain("setProperty('display', 'none', 'important')");
|
|
});
|
|
|
|
test('cleanup unlocks scroll (body overflow:hidden)', () => {
|
|
expect(wcSrc).toContain("overflow === 'hidden'");
|
|
expect(wcSrc).toContain("setProperty('overflow', 'auto', 'important')");
|
|
});
|
|
|
|
test('cleanup removes blur effects (paywall blur)', () => {
|
|
expect(wcSrc).toContain("filter?.includes('blur')");
|
|
expect(wcSrc).toContain("setProperty('filter', 'none', 'important')");
|
|
});
|
|
|
|
test('cleanup removes article truncation (max-height)', () => {
|
|
expect(wcSrc).toContain('truncat');
|
|
expect(wcSrc).toContain("setProperty('max-height', 'none', 'important')");
|
|
});
|
|
|
|
test('cleanup collapses empty ad placeholder whitespace', () => {
|
|
expect(wcSrc).toContain('empty placeholders');
|
|
// Should check text content length before collapsing
|
|
expect(wcSrc).toContain('text.length < 20');
|
|
});
|
|
|
|
test('sticky cleanup skips gstack control indicator', () => {
|
|
expect(wcSrc).toContain("gstack-ctrl");
|
|
});
|
|
|
|
test('CLEANUP_SELECTORS has clutter category', () => {
|
|
expect(wcSrc).toContain('clutter: [');
|
|
expect(wcSrc).toContain('audio-player');
|
|
expect(wcSrc).toContain('podcast-player');
|
|
expect(wcSrc).toContain('puzzle');
|
|
expect(wcSrc).toContain('recirculation');
|
|
expect(wcSrc).toContain('everlit');
|
|
});
|
|
|
|
test('cleanup removes "ADVERTISEMENT" text labels', () => {
|
|
expect(wcSrc).toContain('adTextPatterns');
|
|
expect(wcSrc).toContain('/^advertisement$/i');
|
|
expect(wcSrc).toContain('/article continues/i');
|
|
expect(wcSrc).toContain('ad labels');
|
|
});
|
|
|
|
test('sticky cleanup preserves topmost full-width nav bar', () => {
|
|
// Should preserve the first full-width element near the top
|
|
expect(wcSrc).toContain('preservedTopNav');
|
|
expect(wcSrc).toContain('viewportWidth * 0.8');
|
|
// Should sort sticky elements by vertical position
|
|
expect(wcSrc).toContain('sort((a, b) => a.top - b.top)');
|
|
});
|
|
});
|
|
|
|
describe('chat toolbar buttons disabled state', () => {
|
|
const js = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.js'), 'utf-8');
|
|
const css = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.css'), 'utf-8');
|
|
|
|
test('setActionButtonsEnabled function exists', () => {
|
|
expect(js).toContain('function setActionButtonsEnabled(enabled)');
|
|
});
|
|
|
|
test('buttons are disabled when disconnected', () => {
|
|
// updateConnection should call setActionButtonsEnabled(false) when no URL
|
|
expect(js).toContain('setActionButtonsEnabled(false)');
|
|
expect(js).toContain('setActionButtonsEnabled(true)');
|
|
});
|
|
|
|
test('runCleanup silently returns when disconnected (no error spam)', () => {
|
|
// Should NOT show "Not connected" notification, just return silently
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('\n}', js.indexOf('async function runCleanup(') + 1) + 2,
|
|
);
|
|
expect(cleanupFn).not.toContain('Not connected to browse server');
|
|
});
|
|
|
|
test('CSS has disabled style for action buttons', () => {
|
|
expect(css).toContain('.quick-action-btn.disabled');
|
|
expect(css).toContain('.inspector-action-btn.disabled');
|
|
expect(css).toContain('pointer-events: none');
|
|
});
|
|
});
|
|
|
|
// ─── Chat message dedup ─────────────────────────────────────────
|
|
|
|
describe('chat message dedup (prevents repeat rendering)', () => {
|
|
const js = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.js'), 'utf-8');
|
|
|
|
test('renderedEntryIds Set exists for dedup tracking', () => {
|
|
expect(js).toContain('const renderedEntryIds = new Set()');
|
|
});
|
|
|
|
test('addChatEntry checks entry.id against renderedEntryIds', () => {
|
|
const addFn = js.slice(
|
|
js.indexOf('function addChatEntry(entry)'),
|
|
js.indexOf('\n // User messages', js.indexOf('function addChatEntry(entry)')),
|
|
);
|
|
expect(addFn).toContain('renderedEntryIds.has(entry.id)');
|
|
expect(addFn).toContain('renderedEntryIds.add(entry.id)');
|
|
// Should return early (skip) if already rendered
|
|
expect(addFn).toContain('return');
|
|
});
|
|
|
|
test('addChatEntry skips dedup for entries without id (local notifications)', () => {
|
|
const addFn = js.slice(
|
|
js.indexOf('function addChatEntry(entry)'),
|
|
js.indexOf('\n // User messages', js.indexOf('function addChatEntry(entry)')),
|
|
);
|
|
// Should only check dedup when entry.id is defined
|
|
expect(addFn).toContain('entry.id !== undefined');
|
|
});
|
|
|
|
test('clear chat resets renderedEntryIds', () => {
|
|
expect(js).toContain('renderedEntryIds.clear()');
|
|
});
|
|
});
|
|
|
|
// ─── Agent conciseness and focus stealing ───────────────────────
|
|
|
|
describe('sidebar agent conciseness + no focus stealing', () => {
|
|
const serverSrc = fs.readFileSync(path.join(ROOT, 'src', 'server.ts'), 'utf-8');
|
|
const bmSrc = fs.readFileSync(path.join(ROOT, 'src', 'browser-manager.ts'), 'utf-8');
|
|
|
|
test('system prompt tells agent to STOP when task is done', () => {
|
|
const promptSection = serverSrc.slice(
|
|
serverSrc.indexOf('const systemPrompt = ['),
|
|
serverSrc.indexOf("].join('\\n');", serverSrc.indexOf('const systemPrompt = [')),
|
|
);
|
|
expect(promptSection).toContain('STOP');
|
|
expect(promptSection).toContain('CONCISE');
|
|
expect(promptSection).toContain('Do NOT keep exploring');
|
|
});
|
|
|
|
test('sidebar agent uses opus (not sonnet) for prompt injection resistance', () => {
|
|
const spawnFn = serverSrc.slice(
|
|
serverSrc.indexOf('function spawnClaude('),
|
|
serverSrc.indexOf('\nfunction ', serverSrc.indexOf('function spawnClaude(') + 1),
|
|
);
|
|
expect(spawnFn).toContain("'opus'");
|
|
});
|
|
|
|
test('switchTab has bringToFront option', () => {
|
|
expect(bmSrc).toContain('bringToFront?: boolean');
|
|
expect(bmSrc).toContain('bringToFront !== false');
|
|
});
|
|
|
|
test('handleCommand tab pinning does NOT steal focus', () => {
|
|
// All switchTab calls in handleCommand should use bringToFront: false
|
|
const handleFn = serverSrc.slice(
|
|
serverSrc.indexOf('async function handleCommand('),
|
|
serverSrc.indexOf('\n// ', serverSrc.indexOf('async function handleCommand(') + 200),
|
|
);
|
|
const switchCalls = handleFn.match(/switchTab\([^)]+\)/g) || [];
|
|
for (const call of switchCalls) {
|
|
expect(call).toContain('bringToFront: false');
|
|
}
|
|
});
|
|
});
|
|
|
|
// ─── LLM-based cleanup architecture ─────────────────────────────
|
|
|
|
describe('LLM-based cleanup (smart agent cleanup)', () => {
|
|
const js = fs.readFileSync(path.join(ROOT, '..', 'extension', 'sidepanel.js'), 'utf-8');
|
|
const wcSrc = fs.readFileSync(path.join(ROOT, 'src', 'write-commands.ts'), 'utf-8');
|
|
|
|
test('cleanup button uses /sidebar-command not /command', () => {
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('async function runScreenshot('),
|
|
);
|
|
// Should POST to sidebar-command (agent) not /command (deterministic)
|
|
expect(cleanupFn).toContain('/sidebar-command');
|
|
// Should NOT directly call the cleanup command endpoint
|
|
expect(cleanupFn).not.toMatch(/fetch.*\/command['"]/);
|
|
});
|
|
|
|
test('cleanup prompt includes deterministic first pass', () => {
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('async function runScreenshot('),
|
|
);
|
|
// First run the deterministic sweep
|
|
expect(cleanupFn).toContain('cleanup --all');
|
|
});
|
|
|
|
test('cleanup prompt instructs agent to snapshot and analyze', () => {
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('async function runScreenshot('),
|
|
);
|
|
// Agent should take a snapshot to see what deterministic pass missed
|
|
expect(cleanupFn).toContain('snapshot -i');
|
|
// Agent should analyze what remains
|
|
expect(cleanupFn).toContain('identify remaining non-content');
|
|
});
|
|
|
|
test('cleanup prompt lists specific clutter categories for agent', () => {
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('async function runScreenshot('),
|
|
);
|
|
// Should guide the agent on what to look for
|
|
expect(cleanupFn).toContain('Ad placeholder');
|
|
expect(cleanupFn).toContain('ADVERTISEMENT');
|
|
expect(cleanupFn).toContain('Cookie');
|
|
expect(cleanupFn).toContain('Audio/podcast');
|
|
expect(cleanupFn).toContain('Sidebar widget');
|
|
expect(cleanupFn).toContain('Social share');
|
|
expect(cleanupFn).toContain('Floating chat');
|
|
});
|
|
|
|
test('cleanup prompt instructs agent to preserve site identity', () => {
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('async function runScreenshot('),
|
|
);
|
|
// Must keep the site looking like itself
|
|
expect(cleanupFn).toContain('KEEP');
|
|
expect(cleanupFn).toContain('header/masthead/logo');
|
|
expect(cleanupFn).toContain('article headline');
|
|
expect(cleanupFn).toContain('article body');
|
|
expect(cleanupFn).toContain('author byline');
|
|
});
|
|
|
|
test('cleanup prompt instructs agent to unlock scrolling', () => {
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('async function runScreenshot('),
|
|
);
|
|
expect(cleanupFn).toContain('unlock scrolling');
|
|
expect(cleanupFn).toContain('overflow');
|
|
});
|
|
|
|
test('cleanup prompt instructs agent to use $B eval for removal', () => {
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('async function runScreenshot('),
|
|
);
|
|
// Agent should use $B eval to hide elements via JavaScript
|
|
expect(cleanupFn).toContain('$B eval');
|
|
expect(cleanupFn).toContain("display=");
|
|
});
|
|
|
|
test('cleanup shows notification while agent works', () => {
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('async function runScreenshot('),
|
|
);
|
|
expect(cleanupFn).toContain('agent is analyzing');
|
|
});
|
|
|
|
test('cleanup removes loading state after short delay (agent is async)', () => {
|
|
const cleanupFn = js.slice(
|
|
js.indexOf('async function runCleanup('),
|
|
js.indexOf('async function runScreenshot('),
|
|
);
|
|
// Should use setTimeout since agent runs asynchronously
|
|
expect(cleanupFn).toContain('setTimeout');
|
|
expect(cleanupFn).toContain("classList.remove('loading')");
|
|
});
|
|
|
|
test('deterministic cleanup still has comprehensive selectors as first pass', () => {
|
|
// The deterministic $B cleanup --all still needs good selectors for the quick pass
|
|
expect(wcSrc).toContain('ads: [');
|
|
expect(wcSrc).toContain('cookies: [');
|
|
expect(wcSrc).toContain('social: [');
|
|
expect(wcSrc).toContain('overlays: [');
|
|
expect(wcSrc).toContain('clutter: [');
|
|
});
|
|
|
|
test('deterministic cleanup clutter covers audio/podcast widgets', () => {
|
|
expect(wcSrc).toContain('audio-player');
|
|
expect(wcSrc).toContain('podcast-player');
|
|
expect(wcSrc).toContain('listen-widget');
|
|
expect(wcSrc).toContain('everlit');
|
|
expect(wcSrc).toContain("'audio'"); // bare audio elements
|
|
});
|
|
|
|
test('deterministic cleanup clutter covers sidebar recirculation', () => {
|
|
expect(wcSrc).toContain('most-popular');
|
|
expect(wcSrc).toContain('most-read');
|
|
expect(wcSrc).toContain('recommended');
|
|
expect(wcSrc).toContain('taboola');
|
|
expect(wcSrc).toContain('outbrain');
|
|
expect(wcSrc).toContain('nativo');
|
|
});
|
|
|
|
test('deterministic cleanup clutter covers games/puzzles', () => {
|
|
expect(wcSrc).toContain('puzzle');
|
|
expect(wcSrc).toContain('daily-game');
|
|
expect(wcSrc).toContain('crossword-promo');
|
|
});
|
|
|
|
test('ad label text detection catches common patterns', () => {
|
|
expect(wcSrc).toContain('/^advertisement$/i');
|
|
expect(wcSrc).toContain('/^sponsored$/i');
|
|
expect(wcSrc).toContain('/^promoted$/i');
|
|
expect(wcSrc).toContain('/article continues/i');
|
|
expect(wcSrc).toContain('/continues below/i');
|
|
expect(wcSrc).toContain('/^paid content$/i');
|
|
expect(wcSrc).toContain('/^partner content$/i');
|
|
});
|
|
|
|
test('ad label detection skips elements with too much text (not a label)', () => {
|
|
// Should skip elements with >50 chars (probably real content)
|
|
expect(wcSrc).toContain('text.length > 50');
|
|
});
|
|
|
|
test('ad label detection hides parent wrapper when small enough', () => {
|
|
// If parent has little content, hide the whole wrapper
|
|
expect(wcSrc).toContain('parent.textContent');
|
|
expect(wcSrc).toContain('trim().length < 80');
|
|
});
|
|
|
|
test('sticky removal sorts by vertical position (topmost first)', () => {
|
|
expect(wcSrc).toContain('sort((a, b) => a.top - b.top)');
|
|
});
|
|
|
|
test('sticky removal preserves first full-width element near top', () => {
|
|
expect(wcSrc).toContain('preservedTopNav');
|
|
// Should check element spans most of viewport
|
|
expect(wcSrc).toContain('viewportWidth * 0.8');
|
|
// Should only preserve the first one
|
|
expect(wcSrc).toContain('!preservedTopNav');
|
|
// Should check it's near the top
|
|
expect(wcSrc).toContain('top <= 50');
|
|
// Should check it's not too tall (it's a nav, not a hero)
|
|
expect(wcSrc).toContain('height < 120');
|
|
});
|
|
|
|
test('sticky removal still skips semantic nav/header elements', () => {
|
|
expect(wcSrc).toContain("tag === 'nav'");
|
|
expect(wcSrc).toContain("tag === 'header'");
|
|
expect(wcSrc).toContain("role') === 'navigation'");
|
|
});
|
|
});
|