Files
gstack/AGENTS.md
Garry Tan 409019a1e0 Merge remote-tracking branch 'origin/main' into garrytan/auq-auto-mode
# Conflicts:
#	CHANGELOG.md
#	VERSION
#	package.json
2026-05-01 08:10:12 -07:00

5.8 KiB

gstack — AI Engineering Workflow

gstack is a collection of SKILL.md files that give AI agents structured roles for software development. Each skill is a specialist: CEO reviewer, eng manager, designer, QA lead, release engineer, debugger, and more.

Available skills

Skills live in .agents/skills/ (or ~/.claude/skills/gstack/ on Claude Code). Invoke them by name (e.g., /office-hours).

Plan-mode reviews

Skill What it does
/office-hours Start here. Reframes your product idea before you write code.
/plan-ceo-review CEO-level review: find the 10-star product in the request.
/plan-eng-review Lock architecture, data flow, edge cases, and tests.
/plan-design-review Rate each design dimension 0-10, explain what a 10 looks like.
/plan-devex-review DX-mode review: TTHW, magical moments, friction points, persona traces.
/plan-tune Self-tune AskUserQuestion sensitivity per question.
/autoplan One command runs CEO → design → eng → DX review.
/design-consultation Build a complete design system from scratch.

Implementation + review

Skill What it does
/review Pre-landing PR review. Finds bugs that pass CI but break in prod.
/codex Second opinion via OpenAI Codex. Review, challenge, or consult modes.
/investigate Systematic root-cause debugging. No fixes without investigation.
/design-review Live-site visual audit + fix loop with atomic commits.
/design-shotgun Generate multiple AI design variants, comparison board, iterate.
/design-html Generate production-quality Pretext-native HTML/CSS.
/devex-review Live developer experience audit (TTHW measured against the real flow).
/qa Open a real browser, find bugs, fix them, re-verify.
/qa-only Same methodology as /qa but report only — no code changes.
/scrape Pull data from a web page. First call prototypes; codified call runs in ~200ms.
/skillify Codify the most recent successful /scrape flow into a permanent browser-skill.

Release + deploy

Skill What it does
/ship Run tests, review, push, open PR. Workspace-aware version queue.
/land-and-deploy Merge the PR, wait for CI and deploy, verify production health.
/canary Post-deploy monitoring loop using the browse daemon.
/landing-report Read-only dashboard for the workspace-aware ship queue.
/document-release Update all docs to match what you just shipped.
/setup-deploy One-time deploy config detection (Fly.io, Render, Vercel, etc.).
/gstack-upgrade Update gstack to the latest version.

Operational + memory

Skill What it does
/context-save Save working context (git state, decisions, remaining work).
/context-restore Resume from a saved context, even across Conductor workspaces.
/learn Manage what gstack learned across sessions.
/retro Weekly retro with per-person breakdowns and shipping streaks.
/health Code quality dashboard (type checker, linter, tests, dead code).
/benchmark Performance regression detection (page load, Core Web Vitals).
/benchmark-models Cross-model benchmark for skills (Claude, GPT, Gemini side-by-side).
/cso OWASP Top 10 + STRIDE security audit.
/setup-gbrain Set up gbrain for cross-machine session memory sync.

Browser + agent integration

Skill What it does
/browse Headless browser — real Chromium, real clicks, ~100ms/command.
/open-gstack-browser Launch the visible GStack Browser with sidebar + stealth.
/setup-browser-cookies Import cookies from your real browser for authenticated testing.
/pair-agent Pair a remote AI agent (OpenClaw, Codex, etc.) with your browser.

Safety + scoping

Skill What it does
/careful Warn before destructive commands (rm -rf, DROP TABLE, force-push).
/freeze Lock edits to one directory. Hard block, not just a warning.
/guard Activate both careful + freeze at once.
/unfreeze Remove directory edit restrictions.
/make-pdf Turn any markdown file into a publication-quality PDF.

Build commands

bun install              # install dependencies
bun test                 # run free tests (no API spend)
bun run test:windows     # curated Windows-safe subset (runs on windows-latest)
bun run build            # generate docs + compile binaries
bun run gen:skill-docs   # regenerate SKILL.md files from templates
bun run skill:check      # health dashboard for all skills

Platform support

  • macOS + Linux: full test suite supported.
  • Windows: curated Windows-safe subset runs on windows-latest via the windows-free-tests CI job. Setup script (./setup) requires Git Bash or MSYS today; native PowerShell support is a future expansion. The bin/gstack-paths helper resolves state roots through CLAUDE_PLUGIN_DATA / GSTACK_HOME so plugin installs work on every platform.

Key conventions

  • SKILL.md files are generated from .tmpl templates. Edit the template, not the output.
  • Run bun run gen:skill-docs --host codex to regenerate Codex-specific output.
  • The browse binary provides headless browser access. Use $B <command> in skills.
  • Safety skills (careful, freeze, guard) use inline advisory prose — always confirm before destructive operations.
  • State paths resolve via bin/gstack-paths (sourced via eval "$(...)"). Honors GSTACK_HOME, CLAUDE_PLUGIN_DATA, CLAUDE_PLANS_DIR.
  • The claude CLI binary resolves via browse/src/claude-bin.ts (Bun.which() + GSTACK_CLAUDE_BIN override). Set GSTACK_CLAUDE_BIN=wsl plus GSTACK_CLAUDE_BIN_ARGS='["claude"]' to run Claude through WSL on Windows.