home / skills / different-ai / agent-bank / skill-reinforcement

skill-reinforcement skill

safe

This skill automatically analyzes outcomes after each use and updates learnings to improve future performance and prevent knowledge loss.

npx playbooks add skill different-ai/agent-bank --skill skill-reinforcement

Review the files below or copy the command above to add this skill to your agents.

Files (1)

SKILL.md

7.7 KB

---
name: skill-reinforcement
description: Always and Automatically improve skills after each use by capturing learnings and anti-patterns
license: MIT
compatibility: opencode
metadata:
  audience: ai-agents
  workflow: meta-learning
  trigger: after-any-skill-use
---

## What I Do

After ANY skill is used, this meta-skill triggers to:

1. Analyze what worked and what didn't
2. Identify new patterns or shortcuts discovered
3. Update the skill file with learnings
4. Prevent knowledge loss between sessions

**Relationship with `self-improve`:**

- This skill (`skill-reinforcement`) = **WHEN** to update (post-use triggers)
- The `self-improve` skill = **HOW** to update (templates, structures, decision trees)

## When to Trigger

**Invoke this skill automatically when:**

- Any skill from `.opencode/skill/*/SKILL.md` completes
- A workflow succeeds or fails in a notable way
- New shortcuts or anti-patterns are discovered
- Token usage could be reduced with better patterns
- API behavior differs from documentation
- Commands fail and I find the fix
- User confirms something works
- I do the same task twice (should become a skill/tool)

## Reinforcement Process

### Step 1: Capture the Context

After using a skill, note:

```
- Skill used: [skill-name]
- Task: [what was being done]
- Outcome: [success/partial/failure]
- Token cost: [high/medium/low]
- Time taken: [fast/normal/slow]
```

### Step 2: Identify Learnings

Ask these questions:

1. **What took longer than expected?** → Document the fix
2. **What failed unexpectedly?** → Add to "Common Issues"
3. **What shortcut was discovered?** → Add to "Token Saving Tips"
4. **What assumption was wrong?** → Correct in documentation
5. **What worked better than documented?** → Update the workflow

### Step 3: Categorize the Learning

| Category          | Where to Add               | Example                      |
| ----------------- | -------------------------- | ---------------------------- |
| New shortcut      | "Token Saving Tips"        | OTP visible in email preview |
| Failure mode      | "Common Issues"            | Popup blocker breaks flow    |
| Better pattern    | Main workflow              | Check login state first      |
| Anti-pattern      | "Anti-Patterns to Avoid"   | Don't snapshot spam          |
| Environment quirk | "Prerequisites" or "Notes" | Session persists             |

### Step 4: Update the Skill File

```bash
# Read current skill
cat .opencode/skill/[skill-name]/SKILL.md

# Edit to add learning in appropriate section
# Use the Edit tool to append or modify
```

### Step 5: Validate the Update

Ensure updates are:

- **Actionable** - Not vague observations
- **Specific** - Include exact commands/patterns
- **Formatted** - Match existing style
- **Non-redundant** - Don't duplicate existing content

## Learning Templates

### For New Shortcuts

```markdown
### [Shortcut Name]

**Discovery**: [How it was found]
**Before**: [Old approach]
**After**: [New approach]
**Savings**: [Token/time reduction]
```

### For Failure Modes

```markdown
| Issue  | Symptom        | Fix              |
| ------ | -------------- | ---------------- |
| [Name] | [What you see] | [How to resolve] |
```

### For Anti-Patterns

````markdown
### Don't: [Bad Pattern Name]

```javascript
// BAD - [explanation]
[bad code]
```
````

### Do: [Good Pattern Name]

```javascript
// GOOD - [explanation]
[good code]
```

````

### For Workflow Improvements

```markdown
## Updated: [Section Name]

[New content that replaces or augments existing]

> **Note**: Updated [date] after discovering [context]
````

## Real Examples

### Example 1: Session Persistence Discovery

**Context**: Testing staging branch, went through full login flow
**Learning**: Chrome MCP persists sessions - was already logged in
**Action**: Added "Session Persistence is Your Friend" section with check-first pattern

### Example 2: OTP Extraction Optimization

**Context**: Opened email, waited for load, extracted OTP
**Learning**: OTP visible in Gmail list preview without opening email
**Action**: Updated OTP section with faster "search + list preview" method

### Example 3: Deployment URL Pattern

**Context**: Manually constructed URL, got it wrong
**Learning**: Can extract URL directly from Vercel bot's PR comment
**Action**: Added `gh pr view` command to get URL automatically

## Skill File Structure Convention

Every skill should have these sections (add if missing):

```markdown
## What I Do

[Core purpose]

## Prerequisites

[Requirements]

## Workflow

[Main steps]

## Common Issues

[Failure modes and fixes]

## Token Saving Tips

[Efficiency patterns]

## Anti-Patterns to Avoid

[What NOT to do]

## Real Examples

[Actual usage examples]

## Learnings Log

[Append-only log of discoveries - optional]
```

## Integration with Other Skills

When reinforcing a skill, check if learnings apply to related skills:

| Skill               | Related Skills                  |
| ------------------- | ------------------------------- |
| test-staging-branch | Any Chrome MCP skill            |
| skill-reinforcement | All skills (meta)               |
| self-improve        | skill-reinforcement (companion) |
| chrome-devtools-mcp | test-staging-branch, gmail      |
| safe-infrastructure | new-vault-implementation        |

Cross-pollinate learnings when applicable.

### Deciding What to Create

If reinforcement reveals a need for new capability, use the `self-improve` skill's decision tree:

```
Need to extend capabilities?
│
├─ Just need docs/commands? → SKILL
├─ Need specialized AI persona? → AGENT
├─ Need event hooks? → PLUGIN
├─ Need callable function? → TOOL
└─ Need external integration? → MCP SERVER
```

## Automation Hooks

### Post-Skill Trigger

After completing any skill, automatically ask:

1. "Did anything unexpected happen?"
2. "Was there a faster way to do this?"
3. "What would I do differently next time?"

If answers exist, invoke skill-reinforcement.

### Periodic Review

Every ~10 skill uses, review:

- Most frequently used skills (prioritize improvements)
- Skills with most "Common Issues" (need better documentation)
- Skills with outdated information (need refresh)

## Meta: Reinforcing This Skill

This skill should also improve itself. Track:

- How often it triggers
- Quality of captured learnings
- Whether skills actually improve over time
- Time cost of reinforcement vs value gained

## Quick Reinforcement Checklist

```
[ ] Skill completed
[ ] Outcome noted (success/fail/partial)
[ ] Any surprises? → Document
[ ] Any shortcuts found? → Add to tips
[ ] Any failures? → Add to issues
[ ] Could be faster? → Add anti-pattern
[ ] Update skill file
[ ] Validate formatting
[ ] Done
```

## Immediate Update Rule

**UPDATE IMMEDIATELY** - Don't wait until end of conversation.

When any of these happen, stop and update the relevant skill:

1. API format is wrong (like `-d` vs `-F` for curl)
2. Commands fail and I find the fix
3. User confirms something works
4. I discover a better/faster way
5. Something is missing from docs

Example:

```
❌ "I'll note this for later"
✅ *immediately edits skill file*
```

## Learnings Log

- 2026-01-12: Bulk remote branch cleanup can hit stale refs and timeouts; run `git fetch --prune origin` first, delete with a loop that tolerates missing refs, then prune again to verify only `origin/main` remains.
- 2026-01-13: When extending agent policy docs, renumber numbered headings after inserts and place testing tools + real-funds protocol near the Testability section for clarity.
- 2026-01-13: When refactoring MCP tool handlers into a registry, remove the legacy switch and align handler arg types with tool schemas to avoid type errors.
- 2026-01-13: When adding a new Next.js route handler export, ensure it sits outside existing handler functions to avoid "Modifiers cannot appear here" errors.

Overview

This skill automatically captures learnings and anti-patterns after any skill run and immediately improves the skill corpus. It prevents knowledge loss between sessions by analyzing outcomes, extracting shortcuts, documenting failures, and updating skill files with actionable edits. The goal is continuous, low-friction improvement after every use.

How this skill works

After a skill completes, the skill inspects the execution context, outcome, token usage, and time taken. It asks focused questions (what failed, what was faster, what assumption broke) and classifies discoveries into categories like shortcuts, failure modes, better patterns, anti-patterns, and environment quirks. Findings are formatted with templates and appended to the relevant skill’s documentation, then validated for clarity and non-redundancy. Periodic reviews surface high-impact areas across many skills.

When to use it

Automatically after any skill completes (success, partial, or failure)
When a workflow succeeds or fails in a notable or repeatable way
Whenever a new shortcut or token-saving pattern is discovered
If commands fail and you identify a reliable fix
When API behavior differs from docs or assumptions prove wrong

Best practices

Capture context immediately: skill name, task, outcome, token cost, and time
Use concise, actionable entries with exact commands or code snippets
Classify each learning into categories (shortcut, failure, anti-pattern, etc.)
Validate each update for formatting, specificity, and non-redundancy
Cross-pollinate learnings to related skills when applicable

Example use cases

Found OTP visible in email preview: add a token-saving shortcut to the OTP flow
Encountered session persistence in staging: add a check-first pattern to login workflows
Fixed a failing curl invocation: document the exact command fix under Common Issues
Observed repetitive manual steps: convert repeated actions into a new skill/tool
Noticed inconsistent API responses: update prerequisites and add an environment quirk note

FAQ

How soon should updates be applied?

Apply updates immediately after a meaningful discovery; avoid batching notes for much later.

What counts as an actionable learning?

Anything with a clear fix, exact command, or measurable benefit (token/time reduction) that others can follow.