shap skill

safe

This skill explains ML predictions with SHAP values, providing feature attribution, visualization, and debugging insights for interpretable models.

npx playbooks add skill eyadsibai/ltk --skill shap

Review the files below or copy the command above to add this skill to your agents.

Files (1)

SKILL.md

4.2 KB

---
name: shap
description: Use when "SHAP", "Shapley values", "feature importance", "model explainability", or asking about "explain predictions", "interpretable ML", "feature attribution", "waterfall plot", "beeswarm plot", "model debugging"
version: 1.0.0
---

<!-- Adapted from: claude-scientific-skills/scientific-skills/shap -->

# SHAP Model Explainability

Explain ML predictions using Shapley values - feature importance and attribution.

## When to Use

- Explain why a model made specific predictions
- Calculate feature importance with attribution
- Debug model behavior and validate predictions
- Create interpretability plots (waterfall, beeswarm, bar)
- Analyze model fairness and bias

## Quick Start

```python
import shap
import xgboost as xgb

# Train model
model = xgb.XGBClassifier().fit(X_train, y_train)

# Create explainer
explainer = shap.TreeExplainer(model)

# Compute SHAP values
shap_values = explainer(X_test)

# Visualize
shap.plots.beeswarm(shap_values)
```

## Choose Explainer

```python
# Tree-based models (XGBoost, LightGBM, RF) - FAST
explainer = shap.TreeExplainer(model)

# Deep learning (TensorFlow, PyTorch)
explainer = shap.DeepExplainer(model, background_data)

# Linear models
explainer = shap.LinearExplainer(model, X_train)

# Any model (slower but universal)
explainer = shap.KernelExplainer(model.predict, X_train[:100])

# Auto-select best explainer
explainer = shap.Explainer(model)
```

## Compute SHAP Values

```python
# Compute for test set
shap_values = explainer(X_test)

# Access components
shap_values.values      # SHAP values (feature attributions)
shap_values.base_values # Expected model output (baseline)
shap_values.data        # Original feature values
```

## Visualizations

### Global Feature Importance

```python
# Beeswarm - shows distribution and importance
shap.plots.beeswarm(shap_values)

# Bar - clean summary
shap.plots.bar(shap_values)
```

### Individual Predictions

```python
# Waterfall - breakdown of single prediction
shap.plots.waterfall(shap_values[0])

# Force - additive visualization
shap.plots.force(shap_values[0])
```

### Feature Relationships

```python
# Scatter - feature vs SHAP value
shap.plots.scatter(shap_values[:, "feature_name"])

# With interaction coloring
shap.plots.scatter(shap_values[:, "Age"], color=shap_values[:, "Income"])
```

### Heatmap (Multiple Samples)

```python
shap.plots.heatmap(shap_values[:100])
```

## Common Patterns

### Complete Analysis

```python
import shap

# 1. Create explainer and compute
explainer = shap.TreeExplainer(model)
shap_values = explainer(X_test)

# 2. Global importance
shap.plots.beeswarm(shap_values)

# 3. Top feature relationships
shap.plots.scatter(shap_values[:, "top_feature"])

# 4. Individual explanation
shap.plots.waterfall(shap_values[0])
```

### Compare Groups

```python
# Compare feature importance across groups
group_a = X_test['category'] == 'A'
group_b = X_test['category'] == 'B'

shap.plots.bar({
    "Group A": shap_values[group_a],
    "Group B": shap_values[group_b]
})
```

### Debug Errors

```python
# Find misclassified samples
errors = model.predict(X_test) != y_test
error_idx = np.where(errors)[0]

# Explain why they failed
for idx in error_idx[:5]:
    shap.plots.waterfall(shap_values[idx])
```

## Interpret Values

- **Positive SHAP** → Feature pushes prediction higher
- **Negative SHAP** → Feature pushes prediction lower
- **Magnitude** → Strength of impact
- **Sum of SHAP values** = Prediction - Baseline

```
Baseline: 0.30
Age: +0.15
Income: +0.10
Education: -0.05
Prediction: 0.30 + 0.15 + 0.10 - 0.05 = 0.50
```

## Best Practices

1. **Use TreeExplainer** for tree models (fast, exact)
2. **Use 100-1000 background samples** for KernelExplainer
3. **Start global** (beeswarm) then go **local** (waterfall)
4. **Check model output type** (probability vs log-odds)
5. **Validate with domain knowledge**

## vs Alternatives

| Tool | Best For |
|------|----------|
| **SHAP** | Theoretically grounded, all model types |
| LIME | Quick local explanations |
| Feature Importance | Simple tree-based importance |

## Resources

- Docs: <https://shap.readthedocs.io/>
- Paper: Lundberg & Lee (2017) "A Unified Approach to Interpreting Model Predictions"

Overview

This skill explains machine learning predictions using SHAP (Shapley values) to produce feature attributions, global importance, and per‑prediction breakdowns. It helps you interpret black‑box and white‑box models, generate common plots (beeswarm, waterfall, bar, force), and debug or validate model behavior.

How this skill works

The skill creates an explainer appropriate to the model type (TreeExplainer, DeepExplainer, LinearExplainer, KernelExplainer or the auto Explaner) and computes SHAP values for samples. SHAP values decompose each prediction into a baseline plus additive contributions from features; visualizers then summarize distributions, individual impacts, and feature interactions.

When to use it

Explaining why a model produced a specific prediction (local explanation)
Quantifying global feature importance and ranking drivers of predictions
Debugging model errors and investigating unexpected behavior
Validating model fairness or comparing feature effects across groups
Creating interpretability visualizations for reports or stakeholder review

Best practices

Choose explainer by model type: TreeExplainer for tree models, Deep/Linear for neural/linear models, KernelExplainer when necessary
Use a representative background/baseline (100–1000 samples) for sampling-based explainers
Start with global views (beeswarm or bar) then drill into local explanations (waterfall, force) for individual cases
Confirm the model output type (probabilities vs log‑odds) and interpret SHAP values accordingly
Cross‑check explanations with domain knowledge and known feature behavior

Example use cases

Create a beeswarm and bar plot to summarize top predictors driving model outputs across a test set
Use waterfall plots to explain why a loan application was approved or denied for a single applicant
Compare SHAP summaries for two demographic groups to surface potential bias or distributional differences
Inspect misclassified samples with waterfall plots to identify systematic model errors
Plot feature interactions using scatter plots colored by another feature to reveal nonlinear dependencies

FAQ

Do I need access to training data to compute SHAP values?

Some explainers require representative background data (e.g., KernelExplainer, DeepExplainer). TreeExplainer can often compute values directly from the model without separate background samples for many tree models.

Which explainer is fastest and most accurate for tree models?

TreeExplainer is both fast and exact for common tree ensembles (XGBoost, LightGBM, RandomForest) and should be the default for those models.