# Design Spec — Futures Gate-Tree Forecasting Project **Date**: 2026-05-13 **Author**: Tamir + Claude Operator **Status**: Approved → research phase ## Context Tamir wants a structured, research-backed forecasting artifact covering the impact of AI and robotics on cost of living and quality of life over a **near-to-medium horizon (1–10 years)**. Outputs serve five overlapping uses, in priority order: (1) personal decisions, (2) long-term family planning, (3) public-facing artifact, (4) living monitor calibrated over time, (5) basis for colleagual discussion and business navigation. The unit of forecasting is a **gate** — a specific, measurable trigger event with a P10/P50/P90 timeline and a downstream essay on what changes for Tamir's life when it passes. This document is the design contract. It does not contain research findings; those land in `src/gates/.md` per gate. ## Scope (locked) - **11 gates**, listed below. Each is a peer-level trigger event (not a sub-step of another), measurable, near-to-medium horizon. - **Approach A — scenario-driven**: each gate's research includes a "downstream essay" mapping its impact across Tamir's dimensions. - **Dimensions covered**: food, housing, utilities, education, childcare, global metals supply, food availability, travel. - **Depth per gate**: ~2–4 hours of synthesis (Deep tier, not Sketch). - **Cross-gate dependencies**: captured as structured frontmatter (`sub_gates`, `cross_gate` with relation types `enables` / `correlates` / `substitutes`) plus a post-pass `cross-gate-synthesis.md`. ## The 11 gates | # | Slug | Trigger threshold | |---|---|---| | 1 | `humanoid-retail-20k` | Mass-market general-purpose humanoid robot (Figure / Optimus / 1X SKU) reaches **< $20,000 retail** in an OECD market | | 2 | `ai-agent-30pct-knowledge-work` | An AI agent autonomously completes ≥ **30%** of tasks in a typical knowledge-work role (legal / dev / content / support) without human review on each step | | 3 | `residential-solar-storage-0.04` | Residential solar + storage system delivers **< $0.04/kWh** over 25-yr amortized full-system cost in 3+ US states | | 4 | `robotaxi-unit-economics-5-cities` | Robotaxi (Waymo / Tesla / Zoox) reaches **profitable unit economics** in 5+ US cities concurrently | | 5 | `ai-tutor-k8-parity-20mo` | Conversational AI tutor matches a human teacher on standardized K-8 metrics at **< $20/month** retail, used by ≥ 1M families | | 6 | `cell-meat-beef-parity` | Cell-cultured beef achieves **retail price parity** with conventional ground beef in a major grocery chain | | 7 | `construction-robot-40pct-labor` | Construction-automation robot stack cuts **new-home labor cost by ≥ 40%** on representative single-family builds | | 8 | `metals-bom-30pct` | EV/robot bill-of-materials cost drops **30%** via Li + REE supply diversification (China dependency on key minerals < 60%) | | 9 | `evtol-1k-trips-major-city` | eVTOL / air-taxi commercial service runs **> 1,000 trips/day** in a major OECD city | | 10 | `smr-first-oecd-deployment` | First commercial **SMR (sub-300 MW)** deployment in an OECD country providing baseload to the grid | | 11 | `autonomous-freight-delivery` | Autonomous trucking + last-mile delivery cuts **labor share of US freight + last-mile cost by ≥ 30%** on national average (compound trigger across long-haul and last-mile) | ## File structure ``` ~/projects/futures/ ├── BRAIN.md ├── README.md # later ├── docs/ │ └── superpowers/ │ └── specs/ │ └── 2026-05-13-futures-gate-tree-design.md # this file ├── src/ │ └── gates/ │ ├── humanoid-retail-20k.md │ ├── ai-agent-30pct-knowledge-work.md │ ├── ... (11 total) │ └── cross-gate-synthesis.md # post-pass └── (V2) astro project files ``` ## Per-gate `.md` template ### Frontmatter (YAML) ```yaml --- title: slug: status: draft | in-research | reviewed | published dimensions: [food, housing, utilities, education, childcare, metals, food-availability, travel] # subset horizon: near | medium trigger: "" timeline: p10: # 10% chance the gate passes by this year p50: # median estimate p90: # 90% chance by this year (or "never" if multi-modal) confidence: low | medium | high # how solid the timeline is sub_gates: # upstream causal dependencies (not in our 11) - slug: p50: why: "" cross_gate: # links to OTHER gates in our 11 - other: relation: enables | correlates | substitutes strength: weak | medium | strong note: "" external_calibration: metaculus: manifold: expert_consensus: "" last_updated: YYYY-MM-DD sources_count: --- ``` ### Body sections (markdown) 1. **TL;DR** — one paragraph synthesizing the timeline + thesis. 2. **Current state** — where the threshold sits as of `last_updated`, with 2–3 hard numbers (price, capability, market share, deployment count). 3. **Key uncertainties** — 3–5 things that would shift the P50 by ≥ 2 years if resolved. 4. **Evidence synthesis** — bulk of the 2–4h research. Sub-sections per stream: - Academic (arXiv + Semantic Scholar) - Industry / market (Exa + Firecrawl) - Public sentiment (Reddit) - Prediction markets (Metaculus / Manifold via WebFetch) - Policy / regulation (Exa + WebSearch + claude-in-chrome) 5. **Sub-gates** — list upstream causal dependencies (frontmatter mirror). 6. **Cross-gate dependencies** — explicit prose about how this gate interacts with the other 10 (enables/correlates/substitutes). 7. **Downstream impact essay** — one paragraph per affected outcome dimension. What changes for food cost, housing cost, etc., if/when this gate passes. 8. **Decision implications** — 1–2 paragraphs: what Tamir should consider doing differently at P10 vs P50 vs P90 outcomes (career, family, investments, real estate, kids' education). 9. **Sources** — numbered, annotated list of every source used. ## HTML rendering (V2) Astro static site. Each `.md` → one route: - **Top half**: rendered prose (sections 1–8 above), styled, with inline source citations linking to the Sources section. - **Bottom half**: a `

` block titled "Full markdown" containing the entire `.md` file verbatim (frontmatter + body), with a copy-to-clipboard button. The `.md` is also served as a static asset at `/gates/.md` so the HTML page can link to it directly ("View raw" link). - **Index page**: grid of all gates with filter chips for dimension + status. Shows P50 timeline distribution as a single overlay chart. - **Per-dimension index pages**: aggregate all gates affecting `food` / `housing` / etc., compute "earliest meaningful change" date = P50 of the earliest gate delivering ≥ 20% of impact. Visual widgets — timeline distribution charts, dependency-graph SVG, outcome-impact bars — are V2 add-ons. V1 is "markdown rendered to clean HTML + raw-markdown section + raw-file download link". ## Research workflow per gate 1. **Dispatch a research agent** (`general-purpose`) with all 5 evidence-stream MCPs available and a brief specifying the gate's trigger sentence, dimensions, and required output sections. Each agent runs ~30–90 min. 2. **Agent returns** structured findings + draft `.md` content. 3. **I synthesize and finalize** the `.md`: confirm the P10/P50/P90 against the evidence, write the downstream impact essay (Tamir-specific), commit, push. 4. **Repeat** for all 11 gates. 5. **Post-pass**: write `cross-gate-synthesis.md` after all individual gates are done. For all 11 gates running in parallel: agent dispatch can be batched 3–4 at a time to keep context manageable. ## Cross-gate synthesis pass After every gate is reviewed/published, write `src/gates/cross-gate-synthesis.md`: - **Year clusters**: which gates' P50s land within ±1 year of each other? Compound effects. - **Enables chains**: full graph of which gates accelerate which others (#8 metals → #1 humanoid, #4 robotaxi, #11 trucks; #3 solar → #1, #11; #2 AI agent → meta-accelerator). - **Substitution loops**: which gates compete (#3 vs #10). - **Outcome bottlenecks**: per dimension, which gate is the rate-limiting step. - **Calibration summary**: aggregate Metaculus/Manifold deltas vs. our P50s. ## Verification End-to-end test of the workflow on **one gate first** to validate template + agent brief + synthesis quality before parallelizing across 11: 1. Run research workflow on gate #2 (`ai-agent-30pct-knowledge-work`) — highest decision-relevance for Tamir. 2. Inspect the resulting `.md`: frontmatter complete, all 9 body sections filled, source count ≥ 8, P10/P50/P90 grounded in cited evidence. 3. If quality is good, fan out to remaining 10 gates in 3 batches of 3–4. 4. After all 11 are reviewed, write the cross-gate synthesis. 5. (V2) Stand up Astro scaffold, render the .md files, deploy to Vercel. ## Out of scope (V1) - Astro / Next.js / any web app scaffold (deferred to V2 after research is solid) - Vercel deployment + domain (deferred) - Visual widgets — timeline charts, dependency graphs, impact bars (deferred) - Monte Carlo propagation over inter-gate dependencies (V2 stretch) - Notion mirror of working notes (out unless explicitly added) - Gates outside the 11 listed (e.g. downside / climate / geopolitical gates) — can be added in a future scope expansion ## Open questions None remaining at design time. Move to research.