My Claude Code Workflow for Running Multiple Agents

Most people use Claude Code the way they used Stack Overflow — paste the problem, wait for the answer, move on. That's fine. It's also leaving most of the leverage on the table.

What I run is different. Seven products, one operator, a system where AI handles research, content, monitoring, deployment scaffolding, and large chunks of code. The claude code workflow I've landed on after a lot of trial and failure isn't glamorous. It's structured, it's opinionated, and it's the thing that actually ships.

Here's how it works.

The Problem With Single-Agent Thinking

When you treat Claude Code as a single assistant — one conversation, one task, one back-and-forth — you're using it like a smarter autocomplete. Useful, but bounded.

The bottleneck isn't Claude's capability. It's the architecture you've built around it. Single-agent thinking means:

Context windows that bloat and drift

One failure cascades into the whole session

You're the bottleneck between every task

Work can't happen in parallel

The shift that changed how my studio operates was treating agents as workers in a system, not assistants in a conversation. That reframe changes everything about how you structure prompts, sessions, and outputs.

How I Structure the Workflow

One Agent, One Job

Every agent in my system has a defined scope. Not "help me build this feature" — something tighter: generate the data access layer, write the test suite for this module, produce a first draft of the API documentation. The job is specific enough that the agent can succeed or fail clearly, and I can see which without reading five hundred lines of output.

When a task bleeds into adjacent concerns, I split it. Two smaller agents with clear handoffs beat one large agent trying to hold the whole context.

Context Is a Resource — Spend It Carefully

Context windows fill. When they do, quality degrades. This isn't speculation; I've watched it happen enough times to build around it.

My approach: each agent session starts with a tight system prompt that defines the role, the constraint, and the output format. The system prompt doesn't change mid-session. If the work requires a different frame, I open a new session.

For longer tasks — anything that might run across multiple sessions — I use handoff documents. A structured markdown file that captures: what was done, what decisions were made, what assumptions are active, and where the next agent should start. Claude Code can produce these on request. I've standardized the template so any agent in the system can read in a handoff and resume without re-deriving context.

The Orchestration Layer

I built a lightweight orchestration system I call VERA. It's not magic — it's a task queue, a session manager, and a logging layer with some custom tooling on top. The point isn't automation for its own sake. The point is removing me as the bottleneck between sequential tasks.

VERA handles: kicking off agent sessions with the right context, routing outputs to the right place, flagging failures for human review, and logging what ran and when. I'm not babysitting each session. I'm reviewing outputs, making decisions, and queuing the next task.

You don't need VERA to implement this pattern. You need the discipline to define clear inputs, expected outputs, and a place to log what happened. A structured folder and a few markdown templates will get you most of the way there before you need custom infrastructure.

The Real Claude Code Workflow, Step by Step

1. Define the Work Unit

Before opening Claude Code, I write a one-paragraph task brief. What is this agent being asked to produce? What constraints matter (framework, file structure, naming conventions)? What does success look like?

This takes three minutes. It prevents a lot of wandering.

2. Load the Right Context

For code tasks: the relevant files, the interface contracts, the existing patterns I want matched. I don't dump the whole repo. I load what the agent needs and nothing else.

For content or ops tasks: the brief, the relevant style guide sections, examples of outputs I've accepted before. Same principle — tight, not comprehensive.

3. Run the Session

I let it run. I'm not hovering. I have other work happening in parallel — other agents, other tasks, my own focus work. The session produces output or it flags a blocker.

4. Review at the Boundary

I review outputs at the handoff point, not throughout. This is the discipline that makes the whole system work. If I'm reviewing mid-session, I've become the agent. The point is to push review to the boundary, evaluate the output against the brief, and decide: accept, revise the brief and rerun, or flag as a decision that needs human judgment.

5. Log and Move

Every completed task gets a log entry: what was produced, what changed, any decisions that were made. Thirty seconds of logging prevents hours of archaeology later.

Where It Breaks

Learned the hard way: the workflow degrades in predictable places.

Ambiguous briefs. If the task definition is fuzzy, the output is fuzzy. The agent isn't going to ask clarifying questions the way a senior engineer would. It fills gaps with assumptions. Tighten the brief.

Missing constraints. I once let an agent scaffold a module without specifying the file naming convention. It used a different pattern than the rest of the codebase. Technically correct, practically a refactor. Now naming conventions are in the standard context load.

Chained sessions with no handoff docs. When a multi-session task doesn't have a proper handoff, the second session re-derives context, sometimes differently. You get drift. The handoff document isn't overhead — it's what makes chaining reliable.

Reviewing too frequently. If you're checking in every few minutes, you're not running a multi-agent workflow. You're just a slower developer with an expensive autocomplete. Batch your review cycles.

What This Actually Produces

Shipping today across seven products as a solo operator means this workflow is running most days. A few concrete examples of what it handles:

Full test suite generation for a new module, with edge cases, starting from the interface contract

Product changelog drafts generated from commit summaries and diffed against prior release notes

API documentation written from the route definitions and type signatures

Monitoring and alerting configuration scaffolded from an infra checklist

Marketing copy variants produced from a positioning brief

None of that is me sitting in a conversation asking Claude to help. It's structured work units, defined outputs, and a system for reviewing what comes back.

The Bigger Picture

The claude code workflow I've described isn't cutting-edge research. It's operational discipline applied to a new kind of tool. The principles aren't new: clear task definition, context management, review at boundaries, logging. What's new is that following these principles with AI agents produces leverage that was previously impossible for a solo operator.

You don't need a team. You need the architecture.

If you want to see the full stack I use to run this — tools, file structures, agent templates — The Builder's Playbook covers the operating model in detail, including how the agent system fits into the broader studio structure.

If you want to work through how this applies to your specific product or stack, book a 1:1 and we'll map it out.

My Claude Code Workflow for Running Multiple Agents

Most people use Claude Code the way they used Stack Overflow — paste the problem, wait for the answer, move on. That's fine. It's also leaving most of the leverage on the table.

Here's how it works.

The Problem With Single-Agent Thinking

When you treat Claude Code as a single assistant — one conversation, one task, one back-and-forth — you're using it like a smarter autocomplete. Useful, but bounded.

The bottleneck isn't Claude's capability. It's the architecture you've built around it. Single-agent thinking means:

Context windows that bloat and drift

One failure cascades into the whole session

You're the bottleneck between every task

Work can't happen in parallel

How I Structure the Workflow

One Agent, One Job

When a task bleeds into adjacent concerns, I split it. Two smaller agents with clear handoffs beat one large agent trying to hold the whole context.

Context Is a Resource — Spend It Carefully

Context windows fill. When they do, quality degrades. This isn't speculation; I've watched it happen enough times to build around it.

The Orchestration Layer

The Real Claude Code Workflow, Step by Step

1. Define the Work Unit

This takes three minutes. It prevents a lot of wandering.

2. Load the Right Context

For code tasks: the relevant files, the interface contracts, the existing patterns I want matched. I don't dump the whole repo. I load what the agent needs and nothing else.

For content or ops tasks: the brief, the relevant style guide sections, examples of outputs I've accepted before. Same principle — tight, not comprehensive.

3. Run the Session

I let it run. I'm not hovering. I have other work happening in parallel — other agents, other tasks, my own focus work. The session produces output or it flags a blocker.

4. Review at the Boundary

5. Log and Move

Every completed task gets a log entry: what was produced, what changed, any decisions that were made. Thirty seconds of logging prevents hours of archaeology later.

Where It Breaks

Learned the hard way: the workflow degrades in predictable places.

What This Actually Produces

Shipping today across seven products as a solo operator means this workflow is running most days. A few concrete examples of what it handles:

Full test suite generation for a new module, with edge cases, starting from the interface contract

Product changelog drafts generated from commit summaries and diffed against prior release notes

API documentation written from the route definitions and type signatures

Monitoring and alerting configuration scaffolded from an infra checklist

Marketing copy variants produced from a positioning brief

None of that is me sitting in a conversation asking Claude to help. It's structured work units, defined outputs, and a system for reviewing what comes back.

The Bigger Picture

You don't need a team. You need the architecture.

If you want to work through how this applies to your specific product or stack, book a 1:1 and we'll map it out.

My Claude Code Workflow for Running Multiple Agents

My Claude Code Workflow for Running Multiple Agents

The Problem With Single-Agent Thinking

How I Structure the Workflow

One Agent, One Job

Context Is a Resource — Spend It Carefully

The Orchestration Layer

The Real Claude Code Workflow, Step by Step

1. Define the Work Unit

2. Load the Right Context

3. Run the Session

4. Review at the Boundary

5. Log and Move

Where It Breaks

What This Actually Produces

The Bigger Picture

Building an AI Story App: The Architecture of Inky

Building an AI Story App: Lessons from Agentic Engineering

Building an AI Story App: Lessons from the Studio Floor

My Claude Code Workflow for Running Multiple Agents

My Claude Code Workflow for Running Multiple Agents

The Problem With Single-Agent Thinking

How I Structure the Workflow

One Agent, One Job

Context Is a Resource — Spend It Carefully

The Orchestration Layer

The Real Claude Code Workflow, Step by Step

1. Define the Work Unit

2. Load the Right Context

3. Run the Session

4. Review at the Boundary

5. Log and Move

Where It Breaks

What This Actually Produces

The Bigger Picture

Building an AI Story App: The Architecture of Inky

Building an AI Story App: Lessons from Agentic Engineering

Building an AI Story App: Lessons from the Studio Floor

My Claude Code Workflow for Running Multiple Agents

My Claude Code Workflow for Running Multiple Agents

The Problem With Single-Agent Thinking

How I Structure the Workflow

One Agent, One Job

Context Is a Resource — Spend It Carefully

The Orchestration Layer

The Real Claude Code Workflow, Step by Step

1. Define the Work Unit

2. Load the Right Context

3. Run the Session

4. Review at the Boundary

5. Log and Move

Where It Breaks

What This Actually Produces

The Bigger Picture

How I’m building the studio.

Related posts

Building an AI Story App: The Architecture of Inky

Building an AI Story App: Lessons from Agentic Engineering

Building an AI Story App: Lessons from the Studio Floor

My Claude Code Workflow for Running Multiple Agents

My Claude Code Workflow for Running Multiple Agents

The Problem With Single-Agent Thinking

How I Structure the Workflow

One Agent, One Job

Context Is a Resource — Spend It Carefully

The Orchestration Layer

The Real Claude Code Workflow, Step by Step

1. Define the Work Unit

2. Load the Right Context

3. Run the Session

4. Review at the Boundary

5. Log and Move

Where It Breaks

What This Actually Produces

The Bigger Picture

How I’m building the studio.

Related posts

Building an AI Story App: The Architecture of Inky

Building an AI Story App: Lessons from Agentic Engineering

Building an AI Story App: Lessons from the Studio Floor