home / skills / doanchienthangdev / omgkit / systematic-debugging

systematic-debugging skill

safe

/plugin/skills/methodology/systematic-debugging

This skill helps you debug systematically by guiding reproduction, isolation, hypothesis testing, and verification to identify and fix root causes efficiently.

npx playbooks add skill doanchienthangdev/omgkit --skill systematic-debugging

Review the files below or copy the command above to add this skill to your agents.

Files (1)

SKILL.md

2.9 KB

---
name: debugging-systematically
description: AI agent follows a 5-phase debugging process with reproduction, isolation, hypothesis testing, and root cause resolution. Use when investigating bugs, troubleshooting issues, or hunting errors.
---

# Debugging Systematically

## Quick Start

1. **Reproduce** - Create reliable reproduction steps, document environment
2. **Isolate** - Binary search to narrow down, create minimal repro case
3. **Hypothesize** - Generate 3+ theories with evidence and test cost
4. **Test** - Design tests to prove/disprove each hypothesis
5. **Fix & Verify** - Write failing test first, implement fix, verify green

## Features

| Feature | Description | Guide |
|---------|-------------|-------|
| Reproduction | Consistent steps to trigger bug | Exact steps, environment, frequency |
| Isolation | Narrow down problem area | Binary search code, git bisect |
| Hypothesis Ranking | Prioritize theories to test | Evidence strength x (1/test cost) |
| Strategic Logging | Add targeted debug output | `[DEBUG][Service][method]` format |
| Git Bisect | Find regression commit | `git bisect start`, mark good/bad |
| Regression Tests | Prevent bug from returning | Write failing test before fixing |

## Common Patterns

```
# Reproduction Template
Environment:
  OS: [version]
  Node: [version]
  Browser: [version]

Steps:
1. [Step 1]
2. [Step 2]
3. Observe: [Error]

Frequency: 100% | Intermittent (~50%)
First observed: [date]
Last known good: [commit/version]

# Isolation via Binary Search
function problematic() {
  // BLOCK A
  await stepA1();
  await stepA2();

  // BLOCK B
  await stepB1();
  await stepB2();

  // Comment out BLOCK B
  // Still fails? Bug in BLOCK A
  // Works now? Bug in BLOCK B
}

# Git Bisect
git bisect start
git bisect bad HEAD
git bisect good v2.0.0
# Git checks out middle, test and mark
git bisect good  # or: git bisect bad
# Repeat until culprit found
git bisect reset
```

```
# Hypothesis Template
| # | Hypothesis | Evidence | Test Cost | Priority |
|---|------------|----------|-----------|----------|
| H1 | Missing index | Seq scan in EXPLAIN | Low | 1st |
| H2 | N+1 query | Loop in code | Low | 2nd |
| H3 | Memory leak | Gradual increase | High | 3rd |
```

## Best Practices

| Do | Avoid |
|----|-------|
| Always reproduce before debugging | Debugging without reproduction |
| Write down hypotheses before testing | Testing multiple hypotheses at once |
| Use binary search for large codebases | Random code changes |
| Write failing test before fixing | Assuming cause without evidence |
| Document the debugging session | Ignoring intermittent bugs |
| Add logging strategically | Keeping debug code in production |
| Check for related issues | Fixing symptoms instead of root cause |

## Related Skills

- `solving-problems` - 5-phase problem-solving framework
- `tracing-root-causes` - 5 Whys and Fishbone analysis
- `avoiding-testing-anti-patterns` - Prevent flaky tests
- `verifying-before-completion` - Ensure fix is complete

Overview

This skill guides an AI agent through a disciplined 5-phase debugging process: Reproduce, Isolate, Hypothesize, Test, and Fix & Verify. It emphasizes creating reliable reproduction steps, narrowing scope with binary search, ranking hypotheses by evidence and test cost, and preventing regressions with failing tests. The approach is language-agnostic but includes JavaScript-friendly patterns and commands. Use it to turn chaotic troubleshooting into repeatable, measurable work.

How this skill works

The agent first produces a minimal, documented reproduction case and environment matrix so the issue can be triggered reliably. It then isolates the faulty area using techniques like binary search and git bisect to reduce the blast radius. The agent generates multiple ranked hypotheses with supporting evidence and estimated test costs, then designs targeted tests to prove or disprove each theory. Finally, it implements a failing test, applies a fix, and verifies the fix with regression tests and environment checks.

When to use it

Investigating bugs that are reproducible or intermittent
Troubleshooting performance regressions or unexpected behavior
Hunting errors introduced by recent commits or deployments
Preparing fixes that must include regression tests
Diagnosing issues across environments (local, CI, production)

Best practices

Always reproduce the bug and document environment, frequency, and steps
Isolate with binary search or git bisect before changing unrelated code
Write down 3+ hypotheses, record evidence and test cost, then prioritize
Add targeted, temporary logging instead of broad noisy traces
Write a failing test first, implement the smallest fix, then verify and commit the test

Example use cases

A JavaScript API endpoint returns incorrect data after a deploy — reproduce, bisect commits, add targeted logs, write a failing test, fix, and verify CI passes
Performance slowdown after dependency upgrade — reproduce with a benchmark, isolate hot paths, hypothesize missing indexes or N+1 queries, and validate with profiling
A regression introduced between two releases — run git bisect to find the offending commit, write a regression test that captures the bug, and apply a minimal corrective change

FAQ

What if the bug is non-deterministic?

Document variability and environment, increase logging and inputs to trigger it more reliably, and use statistical hypothesis tests or longer-duration runs to surface patterns.

How many hypotheses should I create?

Aim for at least three ranked hypotheses with evidence and estimated test cost; this prevents tunnel vision and helps prioritize inexpensive checks first.