M

How I Build

Context Engineering + Agentic Execution

How a solo founder operates at the output of a full engineering team. Not by working harder — by structuring information so AI agents can execute complex work with precision and minimal intervention.

80+
missions shipped
S11 · 11-E
current sprint
v32
schema version
L4.4
automation level
6 Levels Below

The Automation Levels

Level 1 — Manual

AI writes code, human integrates everything.

GitHub Copilot autocomplete, manual file management, no structure. Each file change required human context-switching between editor, terminal, and documentation. Useful for line-level suggestions, but no architectural awareness.

Level 2 — Assisted

AI edits files, human runs commands.

Windsurf for intelligent editing. VS Code Copilot for inline suggestions. Google’s Project Antigravity for multi-agent experiments. Each tool taught something — Antigravity showed the promise of agent orchestration, Windsurf proved that AI could manage multi-file edits. None delivered a structured, mission-driven pipeline for production code.

Key learning: The bottleneck was never code generation — it was context management.

Level 3 — Supervised Agentic

Claude Code executes missions. Human plans, approves, pushes.

The breakthrough. Claude Code shifted the paradigm from “AI assists a coder” to “architect directs an engineering team.” The core workflow crystallised: /start → /architect → /engineer → /summarise. Mission files became contracts — self-contained specs that agents execute line by line. The Golden Path rules emerged: package imports only, no Navigator.push, no setState, surgical migrations, dart analyze zero errors.

TrekMeet went from concept to Google Play Beta in this phase. 60+ missions shipped across 10 sprints.

Level 4 — Current

Currently at 4.4 · Phase 11-E live

Post-commit QA live. Multi-persona coordination. Diagnostic subagents.

Post-commit QA automation is live — every commit triggers dart analyze + flutter build into a structured JSON report, read by the CTO persona before push approval. Proven across Phase 11-E (April–May 2026): 17+ missions shipped under V-gate smoke discipline, 9 consecutive sibling-prone Mechanism+V-gates entries persisted to decisions.log, engineer end-to-end execution holds 3-for-3 under Persona Rule 26.

Multi-persona agent coordination is now formalised via the 2-scratchpad model — Architect writes architect_scratchpad.md (plan), CTO writes cto_scratchpad.md (sign-off), strict ownership prevents merge conflicts. The skill pipeline expanded from 4 commands to 12: /start, /architect, /engineer, /summarise, /sprint, /phase, /audit, /handover, /cto, /qa, /qa-review, /fixer.

Subagent-calibration awareness added: diagnostic subagents run narrowing disable-and-reproduce tests before fixes land, catching cases where theory doesn’t match reality.

The Agent Team

Opus

Orchestrator & AI Co-founder. Senior Solutions Architect who owns all tech decisions, writes mission files, runs sprint planning, and coordinates the full agent team.

Claude Code

Engineer. Implements missions autonomously, runs build gates, commits and reports.

UX Partner

Produces detailed UX specs. Reviewed by Opus, stored in docs, integrated into backlog.

Co-Pilot QA

Ingests screenshots and debug logs. Prepares structured reports for Opus triage.

Architect

Plans every mission. Reads context, drafts the step-by-step contract, HARD STOPs for CTO sign-off. Never writes code itself — separation of concerns is the discipline.

CTO

Reviews architect plans, signs off, owns rotation close-outs via /handover. Catches Rule 22 citation drift before engineer touches a file.

MCP — Backend Access

GitHub
Supabase
Firebase
PowerSync
Level 5 — Sprint 12+

Full autonomous loop: plan, build, test, deploy.

The engineering pipeline runs end-to-end without human message bussing. Sprint planning, mission generation, implementation, testing, and deployment happen as a continuous loop. The founder reviews results and makes product decisions — the agents handle everything else. The trekmeet-agent-framework is extracted as a reusable toolkit — the methodology becomes transferable.

Level 6 — Solo Founder Operating System

Multi-department agent orchestration. One founder, zero staff, two hours per day.

The endgame is not just automated engineering — it is an automated organisation. Engineering, QA, UX, community moderation, support, and growth are each managed by specialised agents. The founder starts each day with a Command Centre briefing: overnight metrics, pending actions with confidence levels, agent recommendations. Review each department (10–15 minutes each), approve or adjust, and the agents execute.

Virtual Departments

Engineering

Backlog management, bug triage, sprint execution, build monitoring.

Product & UX

Feature proposals from user feedback, UX specs, A/B test analysis.

Community

Content moderation, group approvals, dispute escalation.

Growth & Support

User acquisition, App Store optimisation, auto-categorised enquiries.

Self-healing agents monitor production health. Hotfix proposals auto-queue. UX validation agents compare deployed screens against specs. Sprint retrospectives auto-generate from mission metrics. The framework itself becomes extractable and reusable — a repeatable system for any solo founder building at scale with AI agents.

Live Session

trekmeet — phase 11-E
$ /start
Reading CLAUDE.md… Schema v32 confirmed
Reading PROJECT_STATE_SNAPSHOT.md… Phase 11-E
Reading BACKLOG.md… 18 items queued
MCP: GitHub ✓ Supabase ✓ Firebase ✓ PowerSync ✓
✓ All caught up. Ready for mission.
$ /architect BL-HF-BETA-LIFETIME-WIREUP
Opus reads context → validates arch → scopes 3 files
# HARD STOP — awaiting CEO approval
$ /engineer –approved
Sonnet executes → hooks enforce → dart analyze 0 errors
flutter build apk –debug –flavor dev → BUILD SUCCESSFUL
✓ Commit ready. SHA: 040df17
# Push review: git diff HEAD~1 → awaiting approval

Anatomy of a Mission

Every feature is a contract.

A mission file is the only context an executing agent receives. If something is ambiguous in the mission, the agent will guess wrong. Precision is the methodology.

Objective

One paragraph — what and why. No implementation detail here.

Architecture Decision

Pattern choices locked before execution begins. Agents must not freelance.

Infrastructure: CEO Parallel Work

Supabase SQL, dashboard config, RLS policies — runs while the agent builds.

Agent Steps (Numbered)

File path, exact changes, imports, before/after code blocks. Max 30KB or split.

Do Not Touch

Explicit list of files and patterns the agent must not modify. Prevents scope creep.

Verification Checklist

dart analyze zero errors, flutter build passes, manual test steps on device.

Risk Register

What could go wrong + mitigation. The agent knows its failure modes in advance.

This methodology shipped TrekMeet from zero to Google Play in 8 weeks.
The product is the proof. The framework is what scales.

© 2026 Manish Kumar. Cultivating focus, shipping software.  ·  hi@manish.pm