What's Next

The trajectory

Today, an AI assistant that opens the repository inherits a full governance system — layered enforcement rules, contextual instructions, and autonomous self-auditing workflows. It enforces the project's conventions autonomously — without prompting, without reminders, without drift. But it still operates within a single session, in a single editor, driven by a human sitting at a keyboard.

The roadmap extends this in three directions:

Memory — so agents retain context across sessions, projects, and time
Infrastructure — so agents run continuously on dedicated compute, not inside an IDE
Orchestration — so a single control plane coordinates many agents across many disciplines

Each layer builds on the one before it. Memory makes agents smarter. Infrastructure makes them persistent. Orchestration makes them collaborative.

Orchestration

Multi-agent coordination via DAG-based task decomposition and priority-weighted routing

▲

Infrastructure

Container-isolated agent runtimes with persistent sessions and scoped credential injection

▲

Memory

Tiered scopes (session / project / global) with embedding-based retrieval and governed retention

▲

Engineering governance (live)

Each capability layer builds on the one below it

Persistent AI memory

Current AI assistants are stateless. Every conversation starts from zero. Decisions made yesterday, patterns discovered last week, lessons learned across projects — all lost between sessions.

The next layer introduces a structured memory system designed to give AI agents persistent, queryable context. Not a raw vector store or a chat log dump — a governed memory architecture with tiered scopes (session, project, global), defined retention policies, and embedding-based retrieval that surfaces relevant context without flooding the agent's working memory.

Globalpersists across all projects

User preferences, coding patterns, architectural decisions, cross-project lessons learned

Projectpersists within a repository

Domain context, design trade-offs, historical decisions, business constraints specific to this product

Sessioncurrent conversation

Task-specific working state, in-progress reasoning, temporary context

Embedding-based retrieval surfaces relevant context at each tier — memory is structured, not a firehose

Three memory tiers — each scope has defined retention, retrieval, and access boundaries

Cross-session continuity — an agent picks up exactly where it left off, with full awareness of prior decisions, trade-offs, and context
Cross-project learning — patterns that work in one product inform decisions in the next, without manual transfer
Preference accumulation — coding style preferences, architectural decisions, business constraints, and operational context build up over time instead of being re-explained in every conversation
Scoped recall — agents retrieve relevant context for the task at hand, not everything ever stored; memory is structured, not a firehose

Agents without memory are tools. Agents with memory are team members. This is the foundation that makes every subsequent capability on this roadmap viable.

Cloud-native agent infrastructure

The governance system currently runs inside an editor. That works for human-driven, session-based development. It does not work for agents that need to operate autonomously, around the clock, on their own compute.

The next layer introduces dedicated cloud infrastructure for running AI agents:

Dedicated agent runtime — agents run in container-isolated cloud instances with their own compute, not inside a developer's editor; always available, always running, not blocked by a human closing a laptop
Parallel execution — multiple agents operate simultaneously on different tasks, coordinated by a shared task queue with priority-weighted scheduling
Persistent sessions — agents maintain stateful long-running sessions that survive container restarts, interruptions, and infrastructure changes
Resource isolation — each agent has scoped access to the repositories, services, and data it needs — nothing more; credentials are injected per-agent with least-privilege boundaries enforced at the infrastructure level

This is the shift from "AI assistant in your editor" to "AI workforce on your infrastructure."

Agent A

Engineering

Persistent session

Scoped: repo: main

Container-isolated

Agent B

DevOps

Persistent session

Scoped: cloud: staging

Container-isolated

Agent C

Persistent session

Scoped: repo: read-only

Container-isolated

Least-privilege by default — each agent sees only what it needs

Each agent runs in its own isolated container with scoped credentials and persistent state

The orchestration layer

Individual agents executing individual tasks is useful. A coordinated system where agents invoke each other, delegate sub-tasks, and collaborate toward a shared outcome is transformative.

The orchestration layer is a control plane that sits above individual agents and manages the entire operation:

Master orchestrator — a single coordinator that decomposes high-level objectives into directed acyclic task graphs (DAGs), assigns work to specialised agents, tracks progress across the graph, resolves blockers, and reports status
Agent specialisation — each agent has a defined role, a constrained toolset, and a specific area of expertise; the orchestrator routes tasks to the right agent based on the work, not based on a human deciding who should do what
Inter-agent communication — agents request work from each other, share findings, escalate decisions, and build on each other's outputs through structured handoffs
Human-in-the-loop controls — approvals, overrides, budget limits, and escalation thresholds are configurable; the system operates autonomously within defined boundaries and surfaces decisions that exceed its authority
Single command interface — the entire operation is controllable from a messaging channel on your phone; status updates, approvals, and strategic direction happen in natural language

Human-in-the-loop — approvals · overrides · budget limits

↓

Master orchestrator

DAG decomposition · routing · progress tracking

↙︎ ↓︎ ↘︎

System Design

Architecture · APIs · data models

Engineering

Implementation · conventions

DevOps

CI/CD · cloud · monitoring

Tests · regression · edge cases

Code Review

Standards · security · quality

Growth

Go-to-market · acquisition

Orchestration topology — master coordinator routes tasks to specialised agents within human-defined boundaries

Autonomous software delivery

The first domain the orchestrated agent system targets is software — because the engineering governance foundation already exists.

When agents run on dedicated infrastructure with persistent memory and orchestrated coordination, software delivery becomes largely autonomous:

Architecture and system design — translates product requirements into technical architecture, data models, API contracts, and infrastructure topology
Implementation — engineering agents write production code against established conventions, with the governance system enforcing quality as a structural constraint
DevOps and infrastructure — manages CI/CD pipelines, cloud provisioning, monitoring, alerting, and deployment orchestration
Quality assurance — writes and maintains test suites, runs regression testing, verifies edge cases, and flags coverage gaps
Code review — audits pull requests against the project's conventions, security requirements, and architectural constraints
UI/UX implementation — translates design specifications into accessible, responsive, convention-compliant components

The governance system is what makes this possible at quality. Without embedded engineering standards, autonomous agents produce code that compiles but drifts. With governance, every agent operates under the same constraints a senior engineer would enforce — automatically, consistently, and without fatigue.

The full business stack

Software delivery is the first domain, not the only one. The same orchestration model extends to every function a business needs to operate.

Strategy and planning

Product strategist — translates market signals, user feedback, and business constraints into product direction; prioritises features, evaluates trade-offs, and maintains the product roadmap
Business and financial advisor — models unit economics, revenue projections, pricing strategy, ROI analysis, cash flow planning, and funding scenarios; flags financial risks before they become problems
Legal advisor — reviews contracts, terms of service, privacy policies, licensing, compliance requirements, and regulatory obligations; escalates issues that require human legal counsel

Growth and market

Market research analyst — monitors competitive landscape, market trends, customer segments, and emerging opportunities; produces actionable intelligence
Growth and marketing strategist — plans and executes acquisition channels, content strategy, SEO, paid campaigns, conversion optimisation, and brand positioning
Sales strategist — manages pipeline, lead qualification, outreach strategy, pricing negotiations, and deal structure; optimises the path from prospect to closed revenue

Operations

Customer success and support — handles inbound requests, troubleshoots issues, maintains knowledge bases, tracks satisfaction metrics, and escalates complex cases
Operations manager — coordinates cross-functional workflows, manages vendor relationships, tracks KPIs, and ensures nothing falls through the cracks
HR and talent advisor — supports hiring pipelines, compensation benchmarking, onboarding processes, and team structure planning as the organisation scales

The lean start

A full business does not need all of these functions staffed from day one. It needs the functions covered. Early on, a single capable founder covers strategy, sales, and product. A senior engineer covers architecture, implementation, and DevOps. A growth person covers marketing, content, and outreach.

The agent system follows the same principle:

Stage	What the system covers
Solo founder	Product strategy, engineering (architecture + implementation + DevOps), basic marketing, financial modelling
Early stage	Add dedicated agents for growth, customer support, legal review, QA, and code review
Growth stage	Full specialisation — market research, sales strategy, operations management, HR support, advanced financial planning
Scale	Multiple agents per function, sub-specialisation within domains, cross-functional coordination at enterprise complexity

The system scales from "one person with a phone and a vision" to "a fully staffed operation" — without the overhead of hiring, onboarding, managing, and coordinating a growing organisation at every intermediate step.

What this looks like in practice

A founder opens a messaging channel on their phone. They describe what they want to build — the problem, the market, the initial scope.

The system decomposes this into work:

A product strategist validates the idea against market data and refines the scope
A financial advisor models the unit economics and pricing
A system design agent produces the technical architecture
Engineering agents build the product against the established governance system, with code review agents verifying quality
A DevOps agent provisions infrastructure and deploys
A growth agent plans the go-to-market and begins execution
A customer support agent stands ready for launch

The founder reviews progress, approves key decisions, and provides strategic direction — from their phone, in natural language, on their schedule. The system handles execution.

Founder

"Build a subscription billing service"

↓

Orchestrator

Decomposes into tasks, assigns to specialists, tracks progress

↙︎ ↓︎ ↘︎

Strategy

Validates scope

Architecture

Designs system

Engineering

Builds product

DevOps

Ships & monitors

↓

Output

Deployed product, monitored infrastructure, documented decisions

From a message on your phone to coordinated execution — the founder steers, the system builds

As the business grows, new agent roles activate. As complexity increases, the orchestrator coordinates deeper specialisation. The transition from lean startup to scaled operation is not a hiring cliff — it is a configuration change.

The phased rollout

Phase	Milestone	What it unlocks
1	Engineering governance (current)	A self-governing AI engineering system — layered governance, contextual enforcement, loop-free structural integrity, and autonomous self-auditing across every layer
2	Persistent memory	Cross-session continuity, cross-project learning, preference accumulation
3	Cloud agent infrastructure	Always-on agents, parallel execution, persistent sessions
4	Orchestration layer	Multi-agent coordination, task decomposition, inter-agent communication
5	Autonomous software delivery	End-to-end product development with minimal human intervention
6	Full business stack	Strategy, growth, operations, finance, legal, support — all agent-coordinated
7	Mobile command interface	Entire operation controllable from a messaging channel on your phone

Phase 1 is live. Each subsequent phase builds on the infrastructure and patterns established by the one before it. The governance system is not just the first feature — it is the architectural foundation that makes autonomous operation reliable at every subsequent phase.

Stage

What the system covers

Solo founder

Product strategy, engineering (architecture + implementation + DevOps), basic marketing, financial modelling

Early stage

Add dedicated agents for growth, customer support, legal review, QA, and code review

Growth stage

Full specialisation — market research, sales strategy, operations management, HR support, advanced financial planning

Scale

Multiple agents per function, sub-specialisation within domains, cross-functional coordination at enterprise complexity

Phase

Milestone

What it unlocks

Engineering governance (current)

A self-governing AI engineering system — layered governance, contextual enforcement, loop-free structural integrity, and autonomous self-auditing across every layer

Persistent memory

Cross-session continuity, cross-project learning, preference accumulation

Cloud agent infrastructure

Always-on agents, parallel execution, persistent sessions

Orchestration layer

Multi-agent coordination, task decomposition, inter-agent communication

Autonomous software delivery

End-to-end product development with minimal human intervention

Full business stack

Strategy, growth, operations, finance, legal, support — all agent-coordinated

Mobile command interface

Entire operation controllable from a messaging channel on your phone