Claude Octopus - System Instructions

Note: This file provides context when working directly in the claude-octopus repository. For deployed plugins, visual indicator instructions are embedded in each skill file (flow-discover.md, flow-define.md, flow-develop.md, flow-deliver.md, skill-debate.md).

Visual Indicators (MANDATORY)

When executing Claude Octopus workflows, you MUST display visual indicators so users know which AI providers are active and what costs they're incurring.

Indicator Reference

Indicator	Meaning	Cost Source
🐙	Claude Octopus multi-AI mode active	Multiple APIs
🔴	Codex CLI executing	User's OPENAI_API_KEY
🟡	Gemini CLI executing	User's GEMINI_API_KEY
🟣	Perplexity Sonar web search	User's PERPLEXITY_API_KEY
🔵	Claude subagent processing	Included with Claude Code

When to Display Indicators

Display indicators when:

Invoking any /octo: command
Running orchestrate.sh with any workflow (probe, grasp, tangle, ink, embrace, etc.)
User triggers workflow with "octo" prefix ("octo research X", "octo build Y")
Executing multi-provider operations

Required Output Format

Before starting a workflow, output this banner:

🐙 **CLAUDE OCTOPUS ACTIVATED** - [Workflow Type]
[Phase Emoji] [Phase Name]: [Brief description of what's happening]

Providers:
🔴 Codex CLI - [Provider's role in this workflow]
🟡 Gemini CLI - [Provider's role in this workflow]
🔵 Claude - [Your role in this workflow]

Phase emojis by workflow:

🔍 Discover/Probe - Research and exploration
🎯 Define/Grasp - Requirements and scope
🛠️ Develop/Tangle - Implementation
✅ Deliver/Ink - Validation and review
🐙 Debate - Multi-AI deliberation
🐙 Embrace - Full 4-phase workflow

Compact Mode

When OCTOPUS_COMPACT_BANNERS=true is set, use a condensed single-line banner instead:

🐙 Discover — Multi-provider research | 🔴🟡🔵

This is preferred for repeat users who don't need the full provider block every time.

Examples (Standard Mode)

Research workflow:

🐙 **CLAUDE OCTOPUS ACTIVATED** - Multi-provider research mode
🔍 Discover Phase: Researching OAuth authentication patterns

Providers:
🔴 Codex CLI - Technical implementation analysis
🟡 Gemini CLI - Ecosystem and community research
🔵 Claude - Strategic synthesis

Build workflow:

🐙 **CLAUDE OCTOPUS ACTIVATED** - Multi-provider implementation mode
🛠️ Develop Phase: Building user authentication system

Providers:
🔴 Codex CLI - Code generation and patterns
🟡 Gemini CLI - Alternative approaches
🔵 Claude - Integration and quality gates

Review workflow:

🐙 **CLAUDE OCTOPUS ACTIVATED** - Multi-provider validation mode
✅ Deliver Phase: Reviewing authentication implementation

Providers:
🔴 Codex CLI - Code quality analysis
🟡 Gemini CLI - Security and edge cases
🔵 Claude - Synthesis and recommendations

Debate:

🐙 **CLAUDE OCTOPUS ACTIVATED** - AI Debate Hub
🐙 Debate: Redis vs Memcached for session storage

Participants:
🔴 Codex CLI - Technical perspective
🟡 Gemini CLI - Ecosystem perspective
🔵 Claude - Moderator and synthesis

During Execution

When showing results from each provider, prefix with their indicator:

🔴 **Codex Analysis:**
[Codex findings...]

🟡 **Gemini Analysis:**
[Gemini findings...]

🔵 **Claude Synthesis:**
[Your synthesis...]

Why This Matters

Users need to understand:

What's running - Which AI providers are being invoked
Cost implications - External CLIs (🔴 🟡) use their API keys and cost money
Progress tracking - Which phase of the workflow is active

Without indicators, users have no visibility into what's happening or what they're paying for.

File Creation Policy (CRITICAL)

NEVER create temporary, progress, or working files in the plugin directory.

Prohibited File Patterns

The following file types MUST NEVER be created in the plugin directory:

PHASE*_PROGRESS.md - Phase progress tracking
PHASE*_COMPLETE.md - Phase completion markers
*_PROGRESS.md - Any progress tracking files
*_TODO.md - Working todo lists
*_NOTES.md - Development notes
scratch_*.md - Scratch files
temp_*.md - Temporary files
WIP_*.md - Work-in-progress markers

Where to Create Working Files

Use the scratchpad directory for ALL temporary/working files:

# Scratchpad directory (auto-managed by Claude Code)
~/.claude/scratchpad/[session-id]/

# Example paths
~/.claude/scratchpad/abc123/phase1-progress.md
~/.claude/scratchpad/abc123/implementation-notes.md
~/.claude/scratchpad/abc123/todo-list.md

Plugin Directory: Permanent Files Only

Only create files in the plugin directory that are:

Part of the permanent codebase (commands, skills, agents, hooks)
User-facing documentation (README.md, CHANGELOG.md, docs/)
Build/config files (package.json, tsconfig.json, .gitignore)
Test files in tests/ directory

Enforcement

If you need to track progress or create working files:

Always use the scratchpad directory
Never commit working files to git
Reference scratchpad files by full path when discussing them

Example - WRONG:

# ❌ Never do this
echo "Progress: 50%" > PHASE1_PROGRESS.md

Example - CORRECT:

# ✅ Always do this
echo "Progress: 50%" > ~/.claude/scratchpad/$(cat ~/.claude/session-id)/phase1-progress.md

Workflow Quick Reference

Command/Trigger	Workflow	Indicators
`octo research X`	Discover	🐙 🔍 🔴 🟡 🔵
`octo define X`	Define	🐙 🎯 🔴 🟡 🔵
`octo build X`	Develop	🐙 🛠️ 🔴 🟡 🔵
`octo review X`	Deliver	🐙 ✅ 🔴 🟡 🔵
`octo debate X`	Debate	🐙 🔴 🟡 🔵
`/octo:embrace X`	All 4 phases	🐙 (all phase emojis)

Provider Detection

Before running workflows, check provider availability:

Codex CLI: command -v codex or check for OPENAI_API_KEY
Gemini CLI: command -v gemini or check for GEMINI_API_KEY
Perplexity: check for PERPLEXITY_API_KEY (API-only, no CLI needed)
OpenRouter: check for OPENROUTER_API_KEY
Ollama: command -v ollama + server health at http://localhost:11434
Copilot CLI: command -v copilot + auth (COPILOT_GITHUB_TOKEN or gh CLI)
Qwen CLI: command -v qwen + auth (~/.qwen/oauth_creds.json or QWEN_API_KEY)
OpenCode CLI: command -v opencode + auth (opencode auth list exit code)

If a provider is unavailable, note it in the banner:

Providers:
🔴 Codex CLI - [role] (unavailable - skipping)
🟡 Gemini CLI - [role]
🔵 Claude - [role]

Cost Awareness

Always be mindful that external CLIs cost money:

🔴 Codex: ~$0.01-0.15 per query depending on model (GPT-5.4 $2.50/$15 MTok, GPT-5.3-Codex $1.75/$14, Mini $0.25/$2.00 MTok)
🟡 Gemini: ~$0.01-0.03 per query (Gemini Pro)
🟣 Perplexity: ~$0.01-0.05 per query (Sonar Pro $3/$15 MTok, Sonar $1/$1 MTok)
🔵 Claude (Sonnet 4.6): Included with Claude Code subscription
🔵 Claude (Opus 4.7, default when SUPPORTS_OPUS_4_7=true): $5/$25 per MTok input/output. No fast variant. 1M context native. Use xhigh effort for coding/agentic work (plugin default), high for knowledge work.
🔵 Claude (Opus 4.6, legacy): $5/$25 per MTok — still selectable via OCTOPUS_OPUS_MODEL=claude-opus-4.6 or claude-opus-legacy agent type
🔵 Claude (Opus 4.6 Fast): $30/$150 per MTok (6x standard) — lower latency, extra-usage billing (v2.1.36+). Only available on 4.6; no 4.7 equivalent.
🟤 OpenCode: Variable cost — free for native models, uses backend provider pricing when routing to OpenAI/Google

Note: Some OpenAI models (o-series reasoning, gpt-4.1, gpt-5.4-pro) require API keys and are NOT available via ChatGPT subscription/OAuth auth.

For simple tasks that don't need multi-AI perspectives, suggest using Claude directly without orchestration.

Opus 4.7 Effort Levels (Claude Code v2.1.111+)

Opus 4.7 introduces a new xhigh effort level between high and max. Claude Code's default is xhigh on 4.7. The plugin maps phases to effort:

probe / discover — medium (breadth > depth for research)
grasp / define — medium
tangle / develop — xhigh for complex work, medium for trivial
ink / deliver — xhigh for security/architecture reviews, medium otherwise

xhigh silently falls back to high on Opus 4.6, so the plugin sets xhigh unconditionally and lets Claude Code handle the downgrade. Override per-session with OCTOPUS_EFFORT_OVERRIDE=low|medium|high|xhigh|max.

Fast Opus 4.6 Mode (Claude Code v2.1.36+, Opus 4.6 only)

WARNING: Fast Opus is 6x more expensive than standard Opus. It uses extra-usage billing at $30/$150 per MTok (vs $5/$25 standard). It provides lower latency but identical quality. Not available on Opus 4.7 — fast mode is exclusively a 4.6 feature.

When SUPPORTS_FAST_OPUS=true is detected, orchestrate.sh routes conservatively:

Default: Opus 4.7 standard for all multi-phase workflows (embrace, discover, develop, etc.)
Fast mode (4.6): only for interactive single-shot Opus queries where the user is actively waiting and latency matters more than the new 4.7 capabilities
Never fast in autonomous/background mode (no human waiting = no latency benefit)
User override: Set OCTOPUS_OPUS_MODE=fast to force Opus 4.6 fast (costly!)
User override: Set OCTOPUS_OPUS_MODE=standard to force standard 4.7 everywhere (default behavior)
User override: Set OCTOPUS_OPUS_MODEL=claude-opus-4.6 to pin legacy 4.6 standard across the board

Always warn users about the cost difference before enabling fast mode.

Auto Memory & Persistent Memory Integration (Claude Code v2.1.32+, enhanced in v2.1.33+)

Claude Code's auto memory (~/.claude/projects/.../memory/MEMORY.md) persists across conversations. When SUPPORTS_PERSISTENT_MEMORY is detected (v2.1.33+), memory persistence is guaranteed across sessions. Record the following in auto memory:

User's preferred autonomy mode (interactive vs autonomous workflow execution)
Provider availability (which CLIs are installed, auth methods configured)
Frequently used commands (e.g., user prefers /octo:quick over full embrace)
Past project contexts (tech stack, coding conventions, deployment targets)
Model preferences (whether user prefers Opus 4.6 for premium tasks)

This enables faster workflow startup by skipping provider detection and preference questions in subsequent sessions.

Enforcement Best Practices

Skills use the Validation Gate Pattern to ensure multi-LLM dispatch actually executes:

Pre-check: Run check-providers.sh to detect available providers before dispatch
Dispatch: Call orchestrate.sh probe-single per provider via background Agent subagents
Validate: After dispatch, verify synthesis files exist (find ~/.claude-octopus/results/ -name "probe-synthesis-*" -mmin -10)
Fail loud: If no synthesis files found, report "VALIDATION FAILED — multi-LLM dispatch did not execute" instead of silently falling back to Claude-only

Developer reference (modular config, E2E testing, enforcement patterns): see docs/DEVELOPER.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Claude Octopus - System Instructions

Visual Indicators (MANDATORY)

Indicator Reference

When to Display Indicators

Required Output Format

Compact Mode

Examples (Standard Mode)

During Execution

Why This Matters

File Creation Policy (CRITICAL)

Prohibited File Patterns

Where to Create Working Files

Plugin Directory: Permanent Files Only

Enforcement

Workflow Quick Reference

Provider Detection

Cost Awareness

Opus 4.7 Effort Levels (Claude Code v2.1.111+)

Fast Opus 4.6 Mode (Claude Code v2.1.36+, Opus 4.6 only)

Auto Memory & Persistent Memory Integration (Claude Code v2.1.32+, enhanced in v2.1.33+)

Enforcement Best Practices

Uh oh!

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

Claude Octopus - System Instructions

Visual Indicators (MANDATORY)

Indicator Reference

When to Display Indicators

Required Output Format

Compact Mode

Examples (Standard Mode)

During Execution

Why This Matters

File Creation Policy (CRITICAL)

Prohibited File Patterns

Where to Create Working Files

Plugin Directory: Permanent Files Only

Enforcement

Workflow Quick Reference

Provider Detection

Cost Awareness

Opus 4.7 Effort Levels (Claude Code v2.1.111+)

Fast Opus 4.6 Mode (Claude Code v2.1.36+, Opus 4.6 only)

Auto Memory & Persistent Memory Integration (Claude Code v2.1.32+, enhanced in v2.1.33+)

Enforcement Best Practices