mirror of
https://github.com/garrytan/gstack.git
synced 2026-05-01 19:25:10 +02:00
b805aa0113
* feat: add Confusion Protocol to preamble resolver Injects a high-stakes ambiguity gate at preamble tier >= 2 so all workflow skills get it. Fires when Claude encounters architectural decisions, data model changes, destructive operations, or contradictory requirements. Does NOT fire on routine coding. Addresses Karpathy failure mode #1 (wrong assumptions) with an inline STOP gate instead of relying on workflow skill invocation. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add Hermes and GBrain host configs Hermes: tool rewrites for terminal/read_file/patch/delegate_task, paths to ~/.hermes/skills/gstack, AGENTS.md config file. GBrain: coding skills become brain-aware when GBrain mod is installed. Same tool rewrites as OpenClaw (agents spawn Claude Code via ACP). GBRAIN_CONTEXT_LOAD and GBRAIN_SAVE_RESULTS NOT suppressed on gbrain host, enabling brain-first lookup and save-to-brain behavior. Both registered in hosts/index.ts with setup script redirect messages. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: GBrain resolver — brain-first lookup and save-to-brain New scripts/resolvers/gbrain.ts with two resolver functions: - GBRAIN_CONTEXT_LOAD: search brain for context before skill starts - GBRAIN_SAVE_RESULTS: save skill output to brain after completion Placeholders added to 4 thinking skill templates (office-hours, investigate, plan-ceo-review, retro). Resolves to empty string on all hosts except gbrain via suppressedResolvers. GBRAIN suppression added to all 9 non-gbrain host configs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: wire slop:diff into /review as advisory diagnostic Adds Step 3.5 to the review template: runs bun run slop:diff against the base branch to catch AI code quality issues (empty catches, redundant return await, overcomplicated abstractions). Advisory only, never blocking. Skips silently if slop-scan is not installed. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: add Karpathy compatibility note to README Positions gstack as the workflow enforcement layer for Karpathy-style CLAUDE.md rules (17K stars). Links to forrestchang/andrej-karpathy-skills. Maps each Karpathy failure mode to the gstack skill that addresses it. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: improve native OpenClaw thinking skills office-hours: add design doc path visibility message after writing ceo-review: add HARD GATE reminder at review section transitions retro: add non-git context support (check memory for meeting notes) Mirrors template improvements to hand-crafted native skills. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: update tests and golden fixtures for new hosts - Host count: 8 → 10 (hermes, gbrain) - OpenClaw adapter test: expects undefined (dead code removed) - Golden ship fixtures: updated with Confusion Protocol + vendoring Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: regenerate all SKILL.md files Regenerated from templates after Confusion Protocol, GBrain resolver placeholders, slop:diff in review, HARD GATE reminders, investigation learnings, design doc visibility, and retro non-git context changes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: update project documentation for v0.18.0.0 - CHANGELOG: add v0.18.0.0 entry (Confusion Protocol, Hermes, GBrain, slop in review, Karpathy note, skill improvements) - CLAUDE.md: add hermes.ts and gbrain.ts to hosts listing - README.md: update agent count 8→10, add Hermes + GBrain to table - VERSION: bump to 0.18.0.0 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: sync package.json version to 0.18.0.0 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: extract Step 0 from review SKILL.md in E2E test The review-base-branch E2E test was copying the full 1493-line review/SKILL.md into the test fixture. The agent spent 8+ turns reading it in chunks, leaving only 7 turns for actual work, causing error_max_turns on every attempt. Now extracts only Step 0 (base branch detection, ~50 lines) which is all the test actually needs. Follows the CLAUDE.md rule: "NEVER copy a full SKILL.md file into an E2E test fixture." Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: update GBrain and Hermes host configs for v0.10.0 integration GBrain: add 'triggers' to keepFields so generated skills pass checkResolvable() validation. Add version compat comment. Hermes: un-suppress GBRAIN_CONTEXT_LOAD and GBRAIN_SAVE_RESULTS. The resolvers handle GBrain-not-installed gracefully, so Hermes agents with GBrain as a mod get brain features automatically. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: GBrain resolver DX improvements and preamble health check Resolver changes: - gbrain query → gbrain search (fast keyword search, not expensive hybrid) - Add keyword extraction guidance for agents - Show explicit gbrain put_page syntax with --title, --tags, heredoc - Add entity enrichment with false-positive filter - Name throttle error patterns (exit code 1, stderr keywords) - Add data-research routing for investigate skill - Expand skillSaveMap from 4 to 8 entries - Add brain operation telemetry summary Preamble changes: - Add gbrain doctor --fast --json health check for gbrain/hermes hosts - Parse check failures/warnings count - Show failing check details when score < 50 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: preserve keepFields in allowlist frontmatter mode The allowlist mode hard-coded name + description reconstruction but never iterated keepFields for additional fields. Adding 'triggers' to keepFields was a no-op because the field was silently stripped. Now iterates keepFields and preserves any field beyond name/description from the source template frontmatter, including YAML arrays. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add triggers to all 38 skill templates Multi-word, skill-specific trigger keywords for GBrain's RESOLVER.md router. Each skill gets 3-6 triggers derived from its "Use when asked to..." description text. Avoids single generic words that would collide across skills (e.g., "debug this" not "debug"). These are distinct from voice-triggers (speech-to-text aliases) and serve GBrain's checkResolvable() validation. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: regenerate all SKILL.md files and update golden fixtures Regenerated from updated templates (triggers, brain placeholders, resolver DX improvements, preamble health check). Golden fixtures updated to match. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: settings-hook remove exits 1 when nothing to remove gstack-settings-hook remove was exiting 0 when settings.json didn't exist, causing gstack-uninstall to report "SessionStart hook" as removed on clean systems where nothing was installed. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: update project documentation for GBrain v0.10.0 integration ARCHITECTURE.md: added GBRAIN_CONTEXT_LOAD and GBRAIN_SAVE_RESULTS to resolver table. CHANGELOG.md: expanded v0.18.0.0 entry with GBrain v0.10.0 integration details (triggers, expanded brain-awareness, DX improvements, Hermes brain support), updated date. CLAUDE.md: added gbrain to resolvers/ directory comment. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: routing E2E stops writing to user's ~/.claude/skills/ installSkills() was copying SKILL.md files to both project-level (.claude/skills/ in tmpDir) and user-level (~/.claude/skills/). Writing to the user's real install fails when symlinks point to different worktrees or dangling targets (ENOENT on copyFileSync). Now installs to project-level only. The test already sets cwd to the tmpDir, so project-level discovery works. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: scale Gemini E2E back to smoke test Gemini CLI gets lost in worktrees on complex tasks (review times out at 600s, discover-skill hits exit 124). Nobody uses Gemini for gstack skill execution. Replace the two failing tests (gemini-discover-skill and gemini-review-findings) with a single smoke test that verifies Gemini can start and read the README. 90s timeout, no skill invocation. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
210 lines
7.5 KiB
Cheetah
210 lines
7.5 KiB
Cheetah
---
|
|
name: open-gstack-browser
|
|
version: 0.2.0
|
|
description: |
|
|
Launch GStack Browser — AI-controlled Chromium with the sidebar extension baked in.
|
|
Opens a visible browser window where you can watch every action in real time.
|
|
The sidebar shows a live activity feed and chat. Anti-bot stealth built in.
|
|
Use when asked to "open gstack browser", "launch browser", "connect chrome",
|
|
"open chrome", "real browser", "launch chrome", "side panel", or "control my browser".
|
|
voice-triggers:
|
|
- "show me the browser"
|
|
triggers:
|
|
- open gstack browser
|
|
- launch chromium
|
|
- show me the browser
|
|
allowed-tools:
|
|
- Bash
|
|
- Read
|
|
- AskUserQuestion
|
|
|
|
---
|
|
|
|
{{PREAMBLE}}
|
|
|
|
# /open-gstack-browser — Launch GStack Browser
|
|
|
|
Launch GStack Browser — AI-controlled Chromium with the sidebar extension,
|
|
anti-bot stealth, and custom branding. You see every action in real time.
|
|
|
|
{{BROWSE_SETUP}}
|
|
|
|
## Step 0: Pre-flight cleanup
|
|
|
|
Before connecting, kill any stale browse servers and clean up lock files that
|
|
may have persisted from a crash. This prevents "already connected" false
|
|
positives and Chromium profile lock conflicts.
|
|
|
|
```bash
|
|
# Kill any existing browse server
|
|
if [ -f "$(git rev-parse --show-toplevel 2>/dev/null)/.gstack/browse.json" ]; then
|
|
_OLD_PID=$(cat "$(git rev-parse --show-toplevel)/.gstack/browse.json" 2>/dev/null | grep -o '"pid":[0-9]*' | grep -o '[0-9]*')
|
|
[ -n "$_OLD_PID" ] && kill "$_OLD_PID" 2>/dev/null || true
|
|
sleep 1
|
|
[ -n "$_OLD_PID" ] && kill -9 "$_OLD_PID" 2>/dev/null || true
|
|
rm -f "$(git rev-parse --show-toplevel)/.gstack/browse.json"
|
|
fi
|
|
# Clean Chromium profile locks (can persist after crashes)
|
|
_PROFILE_DIR="$HOME/.gstack/chromium-profile"
|
|
for _LF in SingletonLock SingletonSocket SingletonCookie; do
|
|
rm -f "$_PROFILE_DIR/$_LF" 2>/dev/null || true
|
|
done
|
|
echo "Pre-flight cleanup done"
|
|
```
|
|
|
|
## Step 1: Connect
|
|
|
|
```bash
|
|
$B connect
|
|
```
|
|
|
|
This launches GStack Browser (rebranded Chromium) in headed mode with:
|
|
- A visible window you can watch (not your regular Chrome — it stays untouched)
|
|
- The gstack sidebar extension auto-loaded via `launchPersistentContext`
|
|
- Anti-bot stealth patches (sites like Google and NYTimes work without captchas)
|
|
- Custom user agent and GStack Browser branding in Dock/menu bar
|
|
- A sidebar agent process for chat commands
|
|
|
|
The `connect` command auto-discovers the extension from the gstack install
|
|
directory. It always uses port **34567** so the extension can auto-connect.
|
|
|
|
After connecting, print the full output to the user. Confirm you see
|
|
`Mode: headed` in the output.
|
|
|
|
If the output shows an error or the mode is not `headed`, run `$B status` and
|
|
share the output with the user before proceeding.
|
|
|
|
## Step 2: Verify
|
|
|
|
```bash
|
|
$B status
|
|
```
|
|
|
|
Confirm the output shows `Mode: headed`. Read the port from the state file:
|
|
|
|
```bash
|
|
cat "$(git rev-parse --show-toplevel 2>/dev/null)/.gstack/browse.json" 2>/dev/null | grep -o '"port":[0-9]*' | grep -o '[0-9]*'
|
|
```
|
|
|
|
The port should be **34567**. If it's different, note it — the user may need it
|
|
for the Side Panel.
|
|
|
|
Also find the extension path so you can help the user if they need to load it manually:
|
|
|
|
```bash
|
|
_EXT_PATH=""
|
|
_ROOT=$(git rev-parse --show-toplevel 2>/dev/null)
|
|
[ -n "$_ROOT" ] && [ -f "$_ROOT/.claude/skills/gstack/extension/manifest.json" ] && _EXT_PATH="$_ROOT/.claude/skills/gstack/extension"
|
|
[ -z "$_EXT_PATH" ] && [ -f "$HOME/.claude/skills/gstack/extension/manifest.json" ] && _EXT_PATH="$HOME/.claude/skills/gstack/extension"
|
|
echo "EXTENSION_PATH: ${_EXT_PATH:-NOT FOUND}"
|
|
```
|
|
|
|
## Step 3: Guide the user to the Side Panel
|
|
|
|
Use AskUserQuestion:
|
|
|
|
> Chrome is launched with gstack control. You should see Playwright's Chromium
|
|
> (not your regular Chrome) with a golden shimmer line at the top of the page.
|
|
>
|
|
> The Side Panel extension should be auto-loaded. To open it:
|
|
> 1. Look for the **puzzle piece icon** (Extensions) in the toolbar — it may
|
|
> already show the gstack icon if the extension loaded successfully
|
|
> 2. Click the **puzzle piece** → find **gstack browse** → click the **pin icon**
|
|
> 3. Click the pinned **gstack icon** in the toolbar
|
|
> 4. The Side Panel should open on the right showing a live activity feed
|
|
>
|
|
> **Port:** 34567 (auto-detected — the extension connects automatically in the
|
|
> Playwright-controlled Chrome).
|
|
|
|
Options:
|
|
- A) I can see the Side Panel — let's go!
|
|
- B) I can see Chrome but can't find the extension
|
|
- C) Something went wrong
|
|
|
|
If B: Tell the user:
|
|
|
|
> The extension is loaded into Playwright's Chromium at launch time, but
|
|
> sometimes it doesn't appear immediately. Try these steps:
|
|
>
|
|
> 1. Type `chrome://extensions` in the address bar
|
|
> 2. Look for **"gstack browse"** — it should be listed and enabled
|
|
> 3. If it's there but not pinned, go back to any page, click the puzzle piece
|
|
> icon, and pin it
|
|
> 4. If it's NOT listed at all, click **"Load unpacked"** and navigate to:
|
|
> - Press **Cmd+Shift+G** in the file picker dialog
|
|
> - Paste this path: `{EXTENSION_PATH}` (use the path from Step 2)
|
|
> - Click **Select**
|
|
>
|
|
> After loading, pin it and click the icon to open the Side Panel.
|
|
>
|
|
> If the Side Panel badge stays gray (disconnected), click the gstack icon
|
|
> and enter port **34567** manually.
|
|
|
|
If C:
|
|
|
|
1. Run `$B status` and show the output
|
|
2. If the server is not healthy, re-run Step 0 cleanup + Step 1 connect
|
|
3. If the server IS healthy but the browser isn't visible, try `$B focus`
|
|
4. If that fails, ask the user what they see (error message, blank screen, etc.)
|
|
|
|
## Step 4: Demo
|
|
|
|
After the user confirms the Side Panel is working, run a quick demo:
|
|
|
|
```bash
|
|
$B goto https://news.ycombinator.com
|
|
```
|
|
|
|
Wait 2 seconds, then:
|
|
|
|
```bash
|
|
$B snapshot -i
|
|
```
|
|
|
|
Tell the user: "Check the Side Panel — you should see the `goto` and `snapshot`
|
|
commands appear in the activity feed. Every command Claude runs shows up here
|
|
in real time."
|
|
|
|
## Step 5: Sidebar chat
|
|
|
|
After the activity feed demo, tell the user about the sidebar chat:
|
|
|
|
> The Side Panel also has a **chat tab**. Try typing a message like "take a
|
|
> snapshot and describe this page." A sidebar agent (a child Claude instance)
|
|
> executes your request in the browser — you'll see the commands appear in
|
|
> the activity feed as they happen.
|
|
>
|
|
> The sidebar agent can navigate pages, click buttons, fill forms, and read
|
|
> content. Each task gets up to 5 minutes. It runs in an isolated session, so
|
|
> it won't interfere with this Claude Code window.
|
|
|
|
## Step 6: What's next
|
|
|
|
Tell the user:
|
|
|
|
> You're all set! Here's what you can do with the connected Chrome:
|
|
>
|
|
> **Watch Claude work in real time:**
|
|
> - Run any gstack skill (`/qa`, `/design-review`, `/benchmark`) and watch
|
|
> every action happen in the visible Chrome window + Side Panel feed
|
|
> - No cookie import needed — the Playwright browser shares its own session
|
|
>
|
|
> **Control the browser directly:**
|
|
> - **Sidebar chat** — type natural language in the Side Panel and the sidebar
|
|
> agent executes it (e.g., "fill in the login form and submit")
|
|
> - **Browse commands** — `$B goto <url>`, `$B click <sel>`, `$B fill <sel> <val>`,
|
|
> `$B snapshot -i` — all visible in Chrome + Side Panel
|
|
>
|
|
> **Window management:**
|
|
> - `$B focus` — bring Chrome to the foreground anytime
|
|
> - `$B disconnect` — close headed Chrome and return to headless mode
|
|
>
|
|
> **What skills look like in headed mode:**
|
|
> - `/qa` runs its full test suite in the visible browser — you see every page
|
|
> load, every click, every assertion
|
|
> - `/design-review` takes screenshots in the real browser — same pixels you see
|
|
> - `/benchmark` measures performance in the headed browser
|
|
|
|
Then proceed with whatever the user asked to do. If they didn't specify a task,
|
|
ask what they'd like to test or browse.
|