Skip to content

[ENHANCEMENT] - Combinatory Creativity Alignment #109

@samueltait

Description

@samueltait

What This Is

Five new enhancements (E43-E47) that align Claude Octopus's orchestration architecture with James Webb Young's empirically-validated five-step creativity framework (1939), supported by 80+ years of neuroscience and cognitive psychology research.

Young's thesis -- that idea production follows a systematic, repeatable process (Gather -> Digest -> Incubate -> Illuminate -> Test) -- maps structurally onto the Double Diamond workflow. However, Octopus currently optimizes for throughput (how fast can we move through phases?) when the creativity research says the highest-leverage interventions are input diversity (are we gathering enough varied material?), productive frustration (are we forcing enough combinations before converging?), and incubation timing (are we creating space for non-obvious synthesis?).

This package addresses five gaps that the creativity research exposes:

ID Enhancement What It Does Priority Effort
E43 Material Diversity Score Quality gate on Probe/Discover input diversity before phase transition P0 6-10h
E44 Incubation Primitive Configurable pause between Diamond phases with daemon-driven synthesis on resume P1 8-14h
E45 Productive Frustration Signal Convergence detection in GRAPPLE debates -- auto-terminate when agents stop generating novel positions P0 4-8h
E46 Serendipity Agent Cross-domain "general material" injection during Probe/Discover via a deliberately off-topic research agent P1 10-16h
E47 Combinatory Replay Cross-session daemon that surfaces structural similarities between findings from prior sessions P2 12-20h

Total new effort: 40-68 hours across 3 waves
No existing code is modified. All additions are new capabilities layered on the v8.31.0 baseline.


The Core Argument

Young's Thesis (1939, validated 2001-2026)

"An idea is nothing more nor less than a new combination of old elements. The capacity to bring old elements into new combinations depends largely on the ability to see relationships."

Young's five-step method:

Step Action Octopus Equivalent Gap
1 Gather raw material (specific + general) Probe/Discover No diversity measurement; general material missing
2 Mental digestion (try combinations until frustrated) GRAPPLE debates No convergence detection; debates run fixed rounds
3 Drop it completely (let subconscious incubate) -- No incubation primitive exists
4 Idea appears on its own Synthesis step Synthesis runs immediately; no gap for non-obvious combinations
5 Test against reality Quality gates + SQUEEZE Adequate (75% threshold + red team)

Neuroscience validation:

  • The default mode network (DMN) performs sophisticated combinatory processing during apparent "rest" (Beaty et al., 2016; PMC3797545)
  • Incubation periods significantly improve creative problem-solving (Sio & Ormerod meta-analysis, 2009: d=0.393)
  • Diverse knowledge bases produce more novel combinations (Mednick, 1962; Simonton, 2010)
  • Forced convergence reduces creative output; frustration signals productive processing (Dijksterhuis & Nordgren, 2006)

Where Octopus Stands

Step 5  Test against reality   █████████░  Strong (quality gates, SQUEEZE)
Step 4  Illumination            ███████░░░  Good (synthesis), but no incubation gap
Step 3  Incubation              ██░░░░░░░░  Missing -- no deliberate pause mechanism
Step 2  Mental digestion        ██████░░░░  Partial (GRAPPLE exists, but fixed-round)
Step 1  Gather raw material     ██████░░░░  Partial (specific only; no diversity scoring)

The 5 Gaps

Gap 1 (Step 1): Input diversity is unmeasured. Probe/Discover gathers material for the specific task but does not measure whether the material is sufficiently diverse (cross-domain, contrarian, analogous). Young's sharpest warning: "You can't combine things you haven't collected." Without diversity scoring, the system proceeds to synthesis with potentially homogeneous input. E43 addresses this.

Gap 2 (Step 2): GRAPPLE debates run fixed rounds. When agents converge early (all agree after round 2 of 4), remaining rounds generate diminishing returns. When agents diverge productively (novel positions still emerging at round 4), the debate terminates prematurely. Young says the signal to stop is productive frustration -- when you've exhausted conscious combinations. E45 addresses this.

Gap 3 (Step 3): No incubation primitive exists. Phase transitions are immediate -- Probe output feeds directly into Grasp input. The creativity research says the gap between phases is where the DMN performs non-obvious combinatory work. For complex problems, proceeding immediately to the next phase forfeits the incubation benefit. E44 addresses this.

Gap 4 (Step 1): General material is absent. Every research agent in Probe/Discover is focused on the specific problem domain. Young distinguishes "specific material" (the problem) from "general material" (everything else). The general material provides the raw elements for cross-domain combination -- the "colored glass" in the kaleidoscope metaphor. E46 addresses this.

Gap 5 (cross-session): No combinatory memory. Each Octopus session gathers fresh material. Young says creative capacity grows with accumulated general knowledge. When a user researches authentication one week and logistics the next, structural similarities between findings are invisible unless someone actively looks for them. E47 addresses this.


Architecture Overview

         ┌────────────────────────────────────────────────────────┐
         │          COMBINATORY CREATIVITY LAYER                  │
         │                                                        │
  Input  │  ┌───────────────────────────────────────────────┐     │
  Quality│  │  E43: Material Diversity Score                 │     │
         │  │  Rubric: source types x domain breadth x       │     │
         │  │  contrarian coverage → min threshold before     │     │
         │  │  phase transition Probe → Grasp                 │     │
         │  └────────────────────────┬──────────────────────┘     │
         │                           │                            │
  General│  ┌────────────────────────▼──────────────────────┐     │
  Material│ │  E46: Serendipity Agent                        │     │
         │  │  One Tier-3 agent researches structurally       │     │
         │  │  analogous problems in unrelated domains         │     │
         │  │  during Probe/Discover                           │     │
         │  └────────────────────────┬──────────────────────┘     │
         │                           │                            │
  Debate │  ┌────────────────────────▼──────────────────────┐     │
  Quality│  │  E45: Productive Frustration Signal             │     │
         │  │  Convergence detection in GRAPPLE: track         │     │
         │  │  novelty rate per round → auto-terminate when    │     │
         │  │  agents repeat positions or diverge exhaustively │     │
         │  └────────────────────────┬──────────────────────┘     │
         │                           │                            │
  Phase  │  ┌────────────────────────▼──────────────────────┐     │
  Gaps   │  │  E44: Incubation Primitive                      │     │
         │  │  Configurable pause between Diamond phases       │     │
         │  │  Daemon continues background scanning            │     │
         │  │  "What emerged?" synthesis on resume             │     │
         │  └────────────────────────┬──────────────────────┘     │
         │                           │                            │
  Memory │  ┌────────────────────────▼──────────────────────┐     │
         │  │  E47: Combinatory Replay                         │     │
         │  │  Cross-session daemon surfaces structural         │     │
         │  │  similarities between prior session findings      │     │
         │  └───────────────────────────────────────────────┘     │
         └────────────────────────────────────────────────────────┘

Wave Summary

Wave Enhancements Effort Calendar Start Condition
CC-1 (Foundation) E43, E45 10-18h 0.5-1 week Zero external dependencies
CC-2 (Incubation) E44, E46 18-30h 1-2 weeks E43 complete (diversity score informs incubation need)
CC-3 (Memory) E47 12-20h 1-1.5 weeks E29 (agent memory isolation) -- soft dependency
Total 40-68h 2.5-4.5 weeks

Why This Ordering

  • CC-1 first: E43 (diversity score) and E45 (frustration signal) are lightweight quality improvements to existing mechanisms. No new infrastructure. Immediate ROI.
  • CC-2 second: E44 (incubation) and E46 (serendipity agent) create new workflow primitives. E43's diversity score informs when incubation is most valuable (high-diversity input benefits more from incubation).
  • CC-3 last: E47 (combinatory replay) requires cross-session memory infrastructure (E29, soft) and benefits from accumulated session data. Highest long-term value but longest ramp.

What's In This Submission

# Document ~Lines Content
00 00-cover-letter.md (this file) ~200 Executive summary, architecture, effort, reading guide
01 01-enhancement-architecture.md ~300 Young framework, CC-11, per-enhancement architecture, integration matrix
02 02-enhancement-details.md ~600 Detailed specifications for E43-E47 with CLI examples
03 03-effort-estimates.md ~250 Component-level effort breakdown, phasing, cost-benefit

How to Read This Submission

  • 5-minute read: This cover letter (00) -- architecture + effort + rationale
  • 15-minute read: Cover letter (00) -> Enhancement architecture (01)
  • 30-minute read: Cover letter (00) -> Enhancement details (02) -> Effort estimates (03)
  • Full read: All 4 documents + source research

00-cover-letter.md
01-enhancement-architecture.md
02-enhancement-details.md
03-effort-estimates.md

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestfutureArchitecturally valid, not yet prioritized

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions