home / skills / coowoolf / insighthunt-skills / long-horizon-holdout
This skill helps product teams implement long-horizon holdout experiments to verify sustained value beyond short-term wins.
npx playbooks add skill coowoolf/insighthunt-skills --skill long-horizon-holdoutReview the files below or copy the command above to add this skill to your agents.
---
name: long-horizon-holdout
description: Use when running experiments on platforms where user value compounds over time, when growth teams claim credit for revenue that would have happened anyway, or when short-term wins don't translate to long-term value
---
# The Long-Horizon Holdout Protocol
## Overview
A measurement framework that challenges short-term growth wins by maintaining **long-term control groups (holdouts)** to verify if immediate lifts translate to sustainable value over 1-3 years.
**Core principle:** 30-40% of short-term "wins" show neutral or zero impact long-term.
## The Process
```
┌─────────────────────────────────────────────────────────────────┐
│ T-0 (Launch) │
│ Split Traffic: 90% Treatment / 10% Long-term Holdout │
├─────────────────────────────────────────────────────────────────┤
│ T+3 Weeks │
│ Initial Decision: Ship if short-term positive/neutral │
│ Keep holdout running │
├─────────────────────────────────────────────────────────────────┤
│ T+6 Months │
│ Automated Ping: Check retention and emerging GMV patterns │
├─────────────────────────────────────────────────────────────────┤
│ T+12-18 Months │
│ Final Reckoning: Compare Cohort GMV │
│ If neutral/negative: deprecate feature or pivot strategy │
└─────────────────────────────────────────────────────────────────┘
```
## Key Principles
| Principle | Description |
|-----------|-------------|
| **Long holdout** | Keep control group for 12-18 months |
| **Automated pings** | System reminds team to check at 6, 12 months |
| **Cohort GMV focus** | Long-term value, not short-term conversion |
| **Accept reversals** | Be willing to deprecate "successful" features |
## Common Mistakes
- Declaring victory after 2 weeks of significance
- Assuming neutral short-term = failure (might compound later)
- Not setting up infrastructure for long-term tracking
---
*Source: Archie Abrams (Shopify VP Product & Growth) via Lenny's Podcast*
This skill implements the Long-Horizon Holdout protocol to validate whether short-term product or growth lifts produce sustainable user value over 12–18 months. It codifies a 90/10 treatment/long-term holdout split, automated checkpoints, and a final cohort-based revenue reckoning to avoid false positives. Use it to reduce attribution error and prevent shipping features that only deliver transient gains.
The skill provisions a persistent control group that remains excluded from the treatment for up to 12–18 months while the treatment sees wide exposure. It schedules automated pings at set intervals (3 weeks, 6 months, 12–18 months) to capture early signals and long-term cohort GMV and retention metrics. The final decision combines cohort-level revenue and retention comparisons, with clear criteria to deprecate or iterate on treatments that fail to deliver durable value.
How large should the long-term holdout be?
A common choice is 10% of traffic; choose a size that balances statistical power with the cost of withholding treatment.
What if business needs force early rollout?
Document the trade-offs, run an accelerated monitoring cadence, and keep a smaller persistent holdout to preserve a long-term baseline.