home / skills / ed3dai / ed3d-plugins / property-based-testing

property-based-testing skill

safe

/plugins/ed3d-house-style/skills/property-based-testing

This skill helps you apply property-based testing to validate serialization, normalization, and pure functions by asserting key invariants across inputs.

npx playbooks add skill ed3dai/ed3d-plugins --skill property-based-testing

Review the files below or copy the command above to add this skill to your agents.

Files (1)

SKILL.md

5.2 KB

---
name: property-based-testing
description: Use when writing tests for serialization, validation, normalization, or pure functions - provides property catalog, pattern detection, and library reference for property-based testing
user-invocable: false
---

# Property-Based Testing

## Overview

Property-based testing (PBT) generates random inputs and verifies that properties hold for all of them. Instead of testing specific examples, you test invariants.

**When PBT beats example-based tests:**
- Serialization pairs (encode/decode)
- Pure functions with clear contracts
- Validators and normalizers
- Data structure operations

## Property Catalog

| Property | Formula | When to Use |
|----------|---------|-------------|
| **Roundtrip** | `decode(encode(x)) == x` | Serialization, conversion pairs |
| **Idempotence** | `f(f(x)) == f(x)` | Normalization, formatting, sorting |
| **Invariant** | Property holds before/after | Any transformation |
| **Commutativity** | `f(a, b) == f(b, a)` | Binary/set operations |
| **Associativity** | `f(f(a,b), c) == f(a, f(b,c))` | Combining operations |
| **Identity** | `f(x, identity) == x` | Operations with neutral element |
| **Inverse** | `f(g(x)) == x` | encrypt/decrypt, compress/decompress |
| **Oracle** | `new_impl(x) == reference(x)` | Optimization, refactoring |
| **Easy to Verify** | `is_sorted(sort(x))` | Complex algorithms |
| **No Exception** | No crash on valid input | Baseline (weakest) |

**Strength hierarchy** (weakest to strongest):
```
No Exception -> Type Preservation -> Invariant -> Idempotence -> Roundtrip
```

Always aim for the strongest property that applies.

## Pattern Detection

**Use PBT when you see:**

| Pattern | Property | Priority |
|---------|----------|----------|
| `encode`/`decode`, `serialize`/`deserialize` | Roundtrip | HIGH |
| `toJSON`/`fromJSON`, `pack`/`unpack` | Roundtrip | HIGH |
| Pure functions with clear contracts | Multiple | HIGH |
| `normalize`, `sanitize`, `canonicalize` | Idempotence | MEDIUM |
| `is_valid`, `validate` with normalizers | Valid after normalize | MEDIUM |
| Sorting, ordering, comparators | Idempotence + ordering | MEDIUM |
| Custom collections (add/remove/get) | Invariants | MEDIUM |
| Builder/factory patterns | Output invariants | LOW |

## When NOT to Use

- Simple CRUD without transformation logic
- UI/presentation logic
- Integration tests requiring complex external setup
- Code with side effects that cannot be isolated
- Prototyping where requirements are fluid
- Tests where specific examples suffice and edge cases are understood

## Library Quick Reference

| Language | Library | Import |
|----------|---------|--------|
| Python | Hypothesis | `from hypothesis import given, strategies as st` |
| TypeScript/JS | fast-check | `import fc from 'fast-check'` |
| Rust | proptest | `use proptest::prelude::*` |
| Go | rapid | `import "pgregory.net/rapid"` |
| Java | jqwik | `@Property` annotations |
| Haskell | QuickCheck | `import Test.QuickCheck` |

**For library-specific syntax and patterns:** Use `@ed3d-research-agents:internet-researcher` to get current documentation.

## Input Strategy Best Practices

1. **Constrain early:** Build constraints INTO the strategy, not via `assume()`
   ```python
   # GOOD
   st.integers(min_value=1, max_value=100)

   # BAD - high rejection rate
   st.integers().filter(lambda x: 1 <= x <= 100)
   ```

2. **Size limits:** Prevent slow tests
   ```python
   st.lists(st.integers(), max_size=100)
   st.text(max_size=1000)
   ```

3. **Realistic data:** Match real-world constraints
   ```python
   st.integers(min_value=0, max_value=150)  # Real ages, not arbitrary ints
   ```

4. **Reuse strategies:** Define once, use across tests
   ```python
   valid_users = st.builds(User, ...)

   @given(valid_users)
   def test_one(user): ...

   @given(valid_users)
   def test_two(user): ...
   ```

## Settings Guide

```python
# Development (fast feedback)
@settings(max_examples=10)

# CI (thorough)
@settings(max_examples=200)

# Nightly/Release (exhaustive)
@settings(max_examples=1000, deadline=None)
```

## Quality Checklist

Before committing PBT tests:

- [ ] Not tautological (assertion doesn't compare same expression)
- [ ] Strong assertion (not just "no crash")
- [ ] Not vacuous (inputs not over-filtered by `assume()`)
- [ ] Edge cases covered with explicit examples (`@example`)
- [ ] No reimplementation of function logic in assertion
- [ ] Strategy constraints are realistic
- [ ] Settings appropriate for context

## Red Flags

- **Tautological:** `assert sorted(xs) == sorted(xs)` tests nothing
- **Only "no crash":** Always look for stronger properties
- **Vacuous:** Multiple `assume()` calls filter out most inputs
- **Reimplementation:** `assert add(a, b) == a + b` if that's how add is implemented
- **Missing edge cases:** No `@example([])`, `@example([1])` decorators
- **Overly constrained:** Many `assume()` calls means redesign the strategy

## Common Mistakes

| Mistake | Fix |
|---------|-----|
| Testing mock behavior | Test real behavior |
| Reimplementing function in test | Use algebraic properties |
| Filtering with assume() | Build constraints into strategy |
| No edge case examples | Add @example decorators |
| One property only | Add multiple properties (length, ordering, etc.) |

Overview

This skill helps you apply property-based testing (PBT) when writing tests for serialization, validation, normalization, and pure functions. It provides a compact property catalog, pattern detection guidance, library references, and practical input and settings best practices. Use it to find the strongest meaningful properties and avoid common PBT pitfalls.

How this skill works

The skill inspects function signatures, naming patterns (encode/decode, normalize, sort, validate), and test intent to recommend appropriate properties (roundtrip, idempotence, invariant, commutativity, etc.). It suggests input strategies, Hypothesis/fast-check snippets, and test settings for development, CI, and nightly runs. It flags red flags like tautological assertions, vacuous tests, overuse of assume(), and accidental reimplementation.

When to use it

Testing serialization/deserialization or encode/decode pairs (roundtrip)
Verifying normalizers, sanitizers, or canonicalizers (idempotence)
Checking pure functions with algebraic contracts (commutativity, associativity)
Validating validators/normalizers where output must satisfy an invariant
Testing data structure operations (add/remove/get invariants)

Best practices

Constrain inputs in the strategy, avoid heavy use of assume()
Limit sizes (max_size, max_length) to prevent slow tests
Reuse shared strategies for consistent, maintainable tests
Prefer the strongest applicable property (roundtrip > idempotence)
Add explicit edge-case @example tests alongside generated cases

Example use cases

Roundtrip test for JSON: decode(encode(x)) == x for domain types
Idempotence for a normalize() function: normalize(normalize(x)) == normalize(x)
Invariant for collections: remove(add(coll, x), x) leaves coll unchanged
Oracle test: new_impl(x) == reference_impl(x) across generated inputs
No-crash baseline for inputs that must never raise exceptions

FAQ

When is property-based testing a poor fit?

Avoid PBT for simple CRUD with no transformation logic, UI rendering, integration tests with complex external setup, or highly stateful code you cannot isolate.

How do I avoid flaky or vacuous PBT tests?

Build constraints into strategies instead of using assume(), keep realistic data ranges and sizes, and include explicit examples for edge cases so tests are meaningful.