home / skills / zhanghandong / rust-skills / m06-error-handling

m06-error-handling skill

/skills/m06-error-handling

This skill helps you assess and apply error handling strategies in Rust, guiding when to use Result, Option, or panic with context.

npx playbooks add skill zhanghandong/rust-skills --skill m06-error-handling

Review the files below or copy the command above to add this skill to your agents.

Files (3)
SKILL.md
4.5 KB
---
name: m06-error-handling
description: "CRITICAL: Use for error handling. Triggers: Result, Option, Error, ?, unwrap, expect, panic, anyhow, thiserror, when to panic vs return Result, custom error, error propagation, 错误处理, Result 用法, 什么时候用 panic"
user-invocable: false
---

# Error Handling

> **Layer 1: Language Mechanics**

## Core Question

**Is this failure expected or a bug?**

Before choosing error handling strategy:
- Can this fail in normal operation?
- Who should handle this failure?
- What context does the caller need?

---

## Error → Design Question

| Pattern | Don't Just Say | Ask Instead |
|---------|----------------|-------------|
| unwrap panics | "Use ?" | Is None/Err actually possible here? |
| Type mismatch on ? | "Use anyhow" | Are error types designed correctly? |
| Lost error context | "Add .context()" | What does the caller need to know? |
| Too many error variants | "Use Box<dyn Error>" | Is error granularity right? |

---

## Thinking Prompt

Before handling an error:

1. **What kind of failure is this?**
   - Expected → Result<T, E>
   - Absence normal → Option<T>
   - Bug/invariant → panic!
   - Unrecoverable → panic!

2. **Who handles this?**
   - Caller → propagate with ?
   - Current function → match/if-let
   - User → friendly error message
   - Programmer → panic with message

3. **What context is needed?**
   - Type of error → thiserror variants
   - Call chain → anyhow::Context
   - Debug info → anyhow or tracing

---

## Trace Up ↑

When error strategy is unclear:

```
"Should I return Result or Option?"
    ↑ Ask: Is absence/failure normal or exceptional?
    ↑ Check: m09-domain (what does domain say?)
    ↑ Check: domain-* (error handling requirements)
```

| Situation | Trace To | Question |
|-----------|----------|----------|
| Too many unwraps | m09-domain | Is the data model right? |
| Error context design | m13-domain-error | What recovery is needed? |
| Library vs app errors | m11-ecosystem | Who are the consumers? |

---

## Trace Down ↓

From design to implementation:

```
"Expected failure, library code"
    ↓ Use: thiserror for typed errors

"Expected failure, application code"
    ↓ Use: anyhow for ergonomic errors

"Absence is normal (find, get, lookup)"
    ↓ Use: Option<T>

"Bug or invariant violation"
    ↓ Use: panic!, assert!, unreachable!

"Need to propagate with context"
    ↓ Use: .context("what was happening")
```

---

## Quick Reference

| Pattern | When | Example |
|---------|------|---------|
| `Result<T, E>` | Recoverable error | `fn read() -> Result<String, io::Error>` |
| `Option<T>` | Absence is normal | `fn find() -> Option<&Item>` |
| `?` | Propagate error | `let data = file.read()?;` |
| `unwrap()` | Dev/test only | `config.get("key").unwrap()` |
| `expect()` | Invariant holds | `env.get("HOME").expect("HOME set")` |
| `panic!` | Unrecoverable | `panic!("critical failure")` |

## Library vs Application

| Context | Error Crate | Why |
|---------|-------------|-----|
| Library | `thiserror` | Typed errors for consumers |
| Application | `anyhow` | Ergonomic error handling |
| Mixed | Both | thiserror at boundaries, anyhow internally |

## Decision Flowchart

```
Is failure expected?
├─ Yes → Is absence the only "failure"?
│        ├─ Yes → Option<T>
│        └─ No → Result<T, E>
│                 ├─ Library → thiserror
│                 └─ Application → anyhow
└─ No → Is it a bug?
        ├─ Yes → panic!, assert!
        └─ No → Consider if really unrecoverable

Use ? → Need context?
├─ Yes → .context("message")
└─ No → Plain ?
```

---

## Common Errors

| Error | Cause | Fix |
|-------|-------|-----|
| `unwrap()` panic | Unhandled None/Err | Use `?` or match |
| Type mismatch | Different error types | Use `anyhow` or `From` |
| Lost context | `?` without context | Add `.context()` |
| `cannot use ?` | Missing Result return | Return `Result<(), E>` |

---

## Anti-Patterns

| Anti-Pattern | Why Bad | Better |
|--------------|---------|--------|
| `.unwrap()` everywhere | Panics in production | `.expect("reason")` or `?` |
| Ignore errors silently | Bugs hidden | Handle or propagate |
| `panic!` for expected errors | Bad UX, no recovery | Result |
| Box<dyn Error> everywhere | Lost type info | thiserror |

---

## Related Skills

| When | See |
|------|-----|
| Domain error strategy | m13-domain-error |
| Crate boundaries | m11-ecosystem |
| Type-safe errors | m05-type-driven |
| Mental models | m14-mental-model |

Overview

This skill codifies Rust error-handling guidance for deciding between Result, Option, panic, and error libraries. It focuses on practical choices: when to propagate, when to add context, and when panicking is appropriate. Use it to align code with library vs application boundaries and to avoid common anti-patterns.

How this skill works

The skill inspects failure semantics and the callsite responsibility to recommend a pattern: Option for normal absence, Result for recoverable errors, and panic/assert for invariant violations. It recommends crates (thiserror for libraries, anyhow for applications), use of ? for propagation, and .context() when additional context is needed. It also maps common mistakes to concrete fixes and traces decisions up and down the design stack.

When to use it

  • Choosing between Result<T, E> and Option<T> when designing APIs
  • Deciding whether to propagate errors with ? or handle them locally
  • Designing library boundaries and selecting thiserror vs anyhow
  • Adding context to propagated errors using .context()
  • Auditing code for unwrap/expect/panic anti-patterns

Best practices

  • Ask: is this failure expected, an absence, or a bug before picking a strategy
  • Use Option<T> when absence is a normal outcome; use Result<T, E> for recoverable failures
  • Library code: define typed errors with thiserror; application code: use anyhow for ergonomics
  • Propagate errors with ? and add .context("what happened") where caller needs more info
  • Avoid unwrap() in production; reserve panic!/assert! for true invariants or unrecoverable states

Example use cases

  • Implementing a crate API: define explicit thiserror variants for consumers
  • CLI application: use anyhow and .context() to present useful error messages to users
  • Refactoring code that panics on missing config: switch to Result and surface error to caller
  • Fixing unwrap-related panics by replacing unwrap() with ? or a match and a clear error
  • Choosing Box<dyn Error> vs typed errors: prefer typed errors unless granularity is unnecessary

FAQ

When should I use panic! instead of Result?

Use panic! only for violated invariants or unrecoverable bugs. If the failure can reasonably be handled or reported to a caller, return Result.

Should libraries use anyhow for errors?

Prefer thiserror for library boundaries so consumers get typed errors. You can use anyhow internally in applications for ergonomics.