home / skills / oimiragieo / agent-studio / dispatching-parallel-agents

dispatching-parallel-agents skill

safe

/.claude/skills/_archive/dead/dispatching-parallel-agents

This skill coordinates parallel investigation by dispatching independent agents for unrelated failures to accelerate resolution.

npx playbooks add skill oimiragieo/agent-studio --skill dispatching-parallel-agents

Review the files below or copy the command above to add this skill to your agents.

Files (1)

SKILL.md

6.6 KB

---
name: dispatching-parallel-agents
description: Concurrent investigation of independent failures. Use when multiple unrelated issues need parallel resolution.
version: 1.0
model: sonnet
invoked_by: both
user_invocable: true
tools: [Task, Read]
best_practices:
  - Only parallelize truly independent issues
  - Group by domain/subsystem
  - Verify no conflicts after integration
error_handling: graceful
streaming: supported
---

# Dispatching Parallel Agents

## Overview

When you have multiple unrelated failures (different test files, different subsystems, different bugs), investigating them sequentially wastes time. Each investigation is independent and can happen in parallel.

**Core principle:** Dispatch one agent per independent problem domain. Let them work concurrently.

## When to Use

```dot
digraph when_to_use {
    "Multiple failures?" [shape=diamond];
    "Are they independent?" [shape=diamond];
    "Single agent investigates all" [shape=box];
    "One agent per problem domain" [shape=box];
    "Can they work in parallel?" [shape=diamond];
    "Sequential agents" [shape=box];
    "Parallel dispatch" [shape=box];

    "Multiple failures?" -> "Are they independent?" [label="yes"];
    "Are they independent?" -> "Single agent investigates all" [label="no - related"];
    "Are they independent?" -> "Can they work in parallel?" [label="yes"];
    "Can they work in parallel?" -> "Parallel dispatch" [label="yes"];
    "Can they work in parallel?" -> "Sequential agents" [label="no - shared state"];
}
```

**Use when:**

- 3+ test files failing with different root causes
- Multiple subsystems broken independently
- Each problem can be understood without context from others
- No shared state between investigations

**Don't use when:**

- Failures are related (fix one might fix others)
- Need to understand full system state
- Agents would interfere with each other

## The Pattern

### 1. Identify Independent Domains

Group failures by what's broken:

- File A tests: Tool approval flow
- File B tests: Batch completion behavior
- File C tests: Abort functionality

Each domain is independent - fixing tool approval doesn't affect abort tests.

### 2. Create Focused Agent Tasks

Each agent gets:

- **Specific scope:** One test file or subsystem
- **Clear goal:** Make these tests pass
- **Constraints:** Don't change other code
- **Expected output:** Summary of what you found and fixed

### 3. Dispatch in Parallel

```typescript
// In Claude Code / AI environment
Task('Fix agent-tool-abort.test.ts failures');
Task('Fix batch-completion-behavior.test.ts failures');
Task('Fix tool-approval-race-conditions.test.ts failures');
// All three run concurrently
```

### 4. Review and Integrate

When agents return:

- Read each summary
- Verify fixes don't conflict
- Run full test suite
- Integrate all changes

## Agent Prompt Structure

Good agent prompts are:

1. **Focused** - One clear problem domain
2. **Self-contained** - All context needed to understand the problem
3. **Specific about output** - What should the agent return?

```markdown
Fix the 3 failing tests in src/agents/agent-tool-abort.test.ts:

1. "should abort tool with partial output capture" - expects 'interrupted at' in message
2. "should handle mixed completed and aborted tools" - fast tool aborted instead of completed
3. "should properly track pendingToolCount" - expects 3 results but gets 0

These are timing/race condition issues. Your task:

1. Read the test file and understand what each test verifies
2. Identify root cause - timing issues or actual bugs?
3. Fix by:
   - Replacing arbitrary timeouts with event-based waiting
   - Fixing bugs in abort implementation if found
   - Adjusting test expectations if testing changed behavior

Do NOT just increase timeouts - find the real issue.

Return: Summary of what you found and what you fixed.
```

## Common Mistakes

**X Too broad:** "Fix all the tests" - agent gets lost
**V Specific:** "Fix agent-tool-abort.test.ts" - focused scope

**X No context:** "Fix the race condition" - agent doesn't know where
**V Context:** Paste the error messages and test names

**X No constraints:** Agent might refactor everything
**V Constraints:** "Do NOT change production code" or "Fix tests only"

**X Vague output:** "Fix it" - you don't know what changed
**V Specific:** "Return summary of root cause and changes"

## When NOT to Use

**Related failures:** Fixing one might fix others - investigate together first
**Need full context:** Understanding requires seeing entire system
**Exploratory debugging:** You don't know what's broken yet
**Shared state:** Agents would interfere (editing same files, using same resources)

## Real Example from Session

**Scenario:** 6 test failures across 3 files after major refactoring

**Failures:**

- agent-tool-abort.test.ts: 3 failures (timing issues)
- batch-completion-behavior.test.ts: 2 failures (tools not executing)
- tool-approval-race-conditions.test.ts: 1 failure (execution count = 0)

**Decision:** Independent domains - abort logic separate from batch completion separate from race conditions

**Dispatch:**

```
Agent 1 -> Fix agent-tool-abort.test.ts
Agent 2 -> Fix batch-completion-behavior.test.ts
Agent 3 -> Fix tool-approval-race-conditions.test.ts
```

**Results:**

- Agent 1: Replaced timeouts with event-based waiting
- Agent 2: Fixed event structure bug (threadId in wrong place)
- Agent 3: Added wait for async tool execution to complete

**Integration:** All fixes independent, no conflicts, full suite green

**Time saved:** 3 problems solved in parallel vs sequentially

## Key Benefits

1. **Parallelization** - Multiple investigations happen simultaneously
2. **Focus** - Each agent has narrow scope, less context to track
3. **Independence** - Agents don't interfere with each other
4. **Speed** - 3 problems solved in time of 1

## Verification

After agents return:

1. **Review each summary** - Understand what changed
2. **Check for conflicts** - Did agents edit same code?
3. **Run full suite** - Verify all fixes work together
4. **Spot check** - Agents can make systematic errors

## Real-World Impact

From debugging session (2025-10-03):

- 6 failures across 3 files
- 3 agents dispatched in parallel
- All investigations completed concurrently
- All fixes integrated successfully
- Zero conflicts between agent changes

## Memory Protocol (MANDATORY)

**Before starting:**
Read `.claude/context/memory/learnings.md`

**After completing:**

- New pattern -> `.claude/context/memory/learnings.md`
- Issue found -> `.claude/context/memory/issues.md`
- Decision made -> `.claude/context/memory/decisions.md`

> ASSUME INTERRUPTION: If it's not in memory, it didn't happen.

Overview

This skill organizes concurrent investigations by dispatching independent agents to work on separate failures in parallel. It reduces time-to-resolution when multiple unrelated issues appear across tests or subsystems. The pattern emphasizes narrow scopes, clear goals, and safe integration of concurrent fixes.

How this skill works

Inspect failing tests or subsystems and group defects by independent domains. Create focused agent tasks (one per domain) with explicit scope, constraints, and expected outputs, then run those agents concurrently. When agents finish, review summaries, check for conflicts, run the full test suite, and integrate non-conflicting fixes.

When to use it

Three or more failing test files with different root causes
Multiple independently broken subsystems
Problems that can be investigated without cross-context information
No shared state or resources between investigations
You want to reduce overall debugging time by parallel work

Best practices

Define one clear goal per agent (specific file or subsystem)
Provide complete, self-contained context and error messages in the prompt
Set constraints (e.g., do NOT change production code) to limit scope
Require a concise summary of root cause and exact changes from each agent
Run full-suite verification and conflict checks before merging fixes

Example use cases

Three test files fail after a refactor; dispatch one agent per file to investigate simultaneously
Independent services show unrelated failures; assign one agent to each service to triage
A CI run reports failures across different modules with no shared state; parallel agents accelerate resolution
Time-sensitive incident with multiple distinct error traces; parallel agents reduce mean time to repair

FAQ

What if agents touch the same files?

Avoid parallel dispatch if investigations will edit the same files or share mutable state. If unavoidable, coordinate ownership or run sequentially to prevent conflicts.

How do I verify agent outputs before merging?

Require a short summary of root cause and changes, run the full test suite, and perform a manual conflict and spot-check review of edits before integration.