The trajectory
Today, an AI assistant that opens the repository inherits a full governance system — layered enforcement rules, contextual instructions, and autonomous self-auditing workflows. It enforces the project's conventions autonomously — without prompting, without reminders, without drift. But it still operates within a single session, in a single editor, driven by a human sitting at a keyboard.
The roadmap extends this in three directions:
- Memory — so agents retain context across sessions, projects, and time
- Infrastructure — so agents run continuously on dedicated compute, not inside an IDE
- Orchestration — so a single control plane coordinates many agents across many disciplines
Each layer builds on the one before it. Memory makes agents smarter. Infrastructure makes them persistent. Orchestration makes them collaborative.
Persistent AI memory
Current AI assistants are stateless. Every conversation starts from zero. Decisions made yesterday, patterns discovered last week, lessons learned across projects — all lost between sessions.
The next layer introduces a structured memory system designed to give AI agents persistent, queryable context. Not a raw vector store or a chat log dump — a governed memory architecture with defined scopes, retention policies, and retrieval strategies.
- Cross-session continuity — an agent picks up exactly where it left off, with full awareness of prior decisions, trade-offs, and context
- Cross-project learning — patterns that work in one product inform decisions in the next, without manual transfer
- Preference accumulation — coding style preferences, architectural decisions, business constraints, and operational context build up over time instead of being re-explained in every conversation
- Scoped recall — agents retrieve relevant context for the task at hand, not everything ever stored; memory is structured, not a firehose
Agents without memory are tools. Agents with memory are team members. This is the foundation that makes every subsequent capability on this roadmap viable.
Cloud-native agent infrastructure
The governance system currently runs inside an editor. That works for human-driven, session-based development. It does not work for agents that need to operate autonomously, around the clock, on their own compute.
The next layer introduces dedicated cloud infrastructure for running AI agents:
- Dedicated agent runtime — agents run on provisioned cloud instances, not inside a developer's editor; always available, always running, not blocked by a human closing a laptop
- Parallel execution — multiple agents operate simultaneously on different tasks, coordinated by a shared task queue
- Persistent sessions — agents maintain long-running sessions that survive restarts, interruptions, and infrastructure changes
- Resource isolation — each agent has scoped access to the repositories, services, and data it needs — nothing more; credential management and permission boundaries are enforced at the infrastructure level
This is the shift from "AI assistant in your editor" to "AI workforce on your infrastructure."
The orchestration layer
Individual agents executing individual tasks is useful. A coordinated system where agents invoke each other, delegate sub-tasks, and collaborate toward a shared outcome is transformative.
The orchestration layer is a control plane that sits above individual agents and manages the entire operation:
- Master orchestrator — a single coordinator that decomposes high-level objectives into task graphs, assigns work to specialised agents, tracks progress, resolves blockers, and reports status
- Agent specialisation — each agent has a defined role, a constrained toolset, and a specific area of expertise; the orchestrator routes tasks to the right agent based on the work, not based on a human deciding who should do what
- Inter-agent communication — agents request work from each other, share findings, escalate decisions, and build on each other's outputs through structured handoffs
- Human-in-the-loop controls — approvals, overrides, budget limits, and escalation thresholds are configurable; the system operates autonomously within defined boundaries and surfaces decisions that exceed its authority
- Single command interface — the entire operation is controllable from a messaging channel on your phone; status updates, approvals, and strategic direction happen in natural language
Autonomous software delivery
The first domain the orchestrated agent system targets is software — because the engineering governance foundation already exists.
When agents run on dedicated infrastructure with persistent memory and orchestrated coordination, software delivery becomes largely autonomous:
- Architecture and system design — translates product requirements into technical architecture, data models, API contracts, and infrastructure topology
- Implementation — engineering agents write production code against established conventions, with the governance system enforcing quality as a structural constraint
- DevOps and infrastructure — manages CI/CD pipelines, cloud provisioning, monitoring, alerting, and deployment orchestration
- Quality assurance — writes and maintains test suites, runs regression testing, verifies edge cases, and flags coverage gaps
- Code review — audits pull requests against the project's conventions, security requirements, and architectural constraints
- UI/UX implementation — translates design specifications into accessible, responsive, convention-compliant components
The governance system is what makes this possible at quality. Without embedded engineering standards, autonomous agents produce code that compiles but drifts. With governance, every agent operates under the same constraints a senior engineer would enforce — automatically, consistently, and without fatigue.
The full business stack
Software delivery is the first domain, not the only one. The same orchestration model extends to every function a business needs to operate.
Strategy and planning
- Product strategist — translates market signals, user feedback, and business constraints into product direction; prioritises features, evaluates trade-offs, and maintains the product roadmap
- Business and financial advisor — models unit economics, revenue projections, pricing strategy, ROI analysis, cash flow planning, and funding scenarios; flags financial risks before they become problems
- Legal advisor — reviews contracts, terms of service, privacy policies, licensing, compliance requirements, and regulatory obligations; escalates issues that require human legal counsel
Growth and market
- Market research analyst — monitors competitive landscape, market trends, customer segments, and emerging opportunities; produces actionable intelligence
- Growth and marketing strategist — plans and executes acquisition channels, content strategy, SEO, paid campaigns, conversion optimisation, and brand positioning
- Sales strategist — manages pipeline, lead qualification, outreach strategy, pricing negotiations, and deal structure; optimises the path from prospect to closed revenue
Operations
- Customer success and support — handles inbound requests, troubleshoots issues, maintains knowledge bases, tracks satisfaction metrics, and escalates complex cases
- Operations manager — coordinates cross-functional workflows, manages vendor relationships, tracks KPIs, and ensures nothing falls through the cracks
- HR and talent advisor — supports hiring pipelines, compensation benchmarking, onboarding processes, and team structure planning as the organisation scales
The lean start
A full business does not need all of these functions staffed from day one. It needs the functions covered. Early on, a single capable founder covers strategy, sales, and product. A senior engineer covers architecture, implementation, and DevOps. A growth person covers marketing, content, and outreach.
The agent system follows the same principle:
| Stage | What the system covers |
|---|---|
| Solo founder | Product strategy, engineering (architecture + implementation + DevOps), basic marketing, financial modelling |
| Early stage | Add dedicated agents for growth, customer support, legal review, QA, and code review |
| Growth stage | Full specialisation — market research, sales strategy, operations management, HR support, advanced financial planning |
| Scale | Multiple agents per function, sub-specialisation within domains, cross-functional coordination at enterprise complexity |
The system scales from "one person with a phone and a vision" to "a fully staffed operation" — without the overhead of hiring, onboarding, managing, and coordinating a growing organisation at every intermediate step.
What this looks like in practice
A founder opens a messaging channel on their phone. They describe what they want to build — the problem, the market, the initial scope.
The system decomposes this into work:
- A product strategist validates the idea against market data and refines the scope
- A financial advisor models the unit economics and pricing
- A system design agent produces the technical architecture
- Engineering agents build the product against the established governance system, with code review agents verifying quality
- A DevOps agent provisions infrastructure and deploys
- A growth agent plans the go-to-market and begins execution
- A customer support agent stands ready for launch
The founder reviews progress, approves key decisions, and provides strategic direction — from their phone, in natural language, on their schedule. The system handles execution.
As the business grows, new agent roles activate. As complexity increases, the orchestrator coordinates deeper specialisation. The transition from lean startup to scaled operation is not a hiring cliff — it is a configuration change.
The phased rollout
| Phase | Milestone | What it unlocks |
|---|---|---|
| 1 | Engineering governance (current) | A self-governing AI engineering system — layered governance, contextual enforcement, loop-free structural integrity, and autonomous self-auditing across every layer |
| 2 | Persistent memory | Cross-session continuity, cross-project learning, preference accumulation |
| 3 | Cloud agent infrastructure | Always-on agents, parallel execution, persistent sessions |
| 4 | Orchestration layer | Multi-agent coordination, task decomposition, inter-agent communication |
| 5 | Autonomous software delivery | End-to-end product development with minimal human intervention |
| 6 | Full business stack | Strategy, growth, operations, finance, legal, support — all agent-coordinated |
| 7 | Mobile command interface | Entire operation controllable from a messaging channel on your phone |
Phase 1 is live. Each subsequent phase builds on the infrastructure and patterns established by the one before it. The governance system is not just the first feature — it is the architectural foundation that makes autonomous operation reliable at every subsequent phase.