home / skills / benchflow-ai / skillsbench / flood-detection

This skill detects flood events by comparing water levels to thresholds, aggregates daily maxima, and reports flood days and severity for actionable insights.

npx playbooks add skill benchflow-ai/skillsbench --skill flood-detection

Review the files below or copy the command above to add this skill to your agents.

Files (1)
SKILL.md
3.6 KB
---
name: flood-detection
description: Detect flood events by comparing water levels to thresholds. Use when determining if flooding occurred, counting flood days, aggregating instantaneous data to daily values, or classifying flood severity.
license: MIT
---

# Flood Detection Guide

## Overview

Flood detection involves comparing observed water levels against established flood stage thresholds. This guide covers how to process water level data and identify flood events.

## Flood Stage Definition

According to the National Weather Service, flood stage is the water level at which overflow of the natural banks begins to cause damage. A flood event occurs when:
```
water_level >= flood_stage_threshold
```

## Aggregating Instantaneous Data to Daily

USGS instantaneous data is recorded at ~15-minute intervals. For flood detection, aggregate to daily maximum:
```python
# df is DataFrame from nwis.get_iv() with datetime index
# gage_col is the column name containing water levels

daily_max = df[gage_col].resample('D').max()
```

### Why Daily Maximum?

| Aggregation | Use Case |
|-------------|----------|
| `max()` | Flood detection - captures peak water level |
| `mean()` | Long-term trends - may miss short flood peaks |
| `min()` | Low flow analysis |

## Detecting Flood Days

Compare daily maximum water level against flood threshold:
```python
flood_threshold = <threshold_from_nws>  # feet

# Count days with flooding
flood_days = (daily_max >= flood_threshold).sum()

# Get specific dates with flooding
flood_dates = daily_max[daily_max >= flood_threshold].index.tolist()
```

## Processing Multiple Stations
```python
flood_results = []

for site_id, site_data in all_data.items():
    daily_max = site_data['water_levels'].resample('D').max()
    threshold = thresholds[site_id]['flood']

    days_above = int((daily_max >= threshold).sum())

    if days_above > 0:
        flood_results.append({
            'station_id': site_id,
            'flood_days': days_above
        })

# Sort by flood days descending
flood_results.sort(key=lambda x: x['flood_days'], reverse=True)
```

## Flood Severity Classification

If multiple threshold levels are available:
```python
def classify_flood(water_level, thresholds):
    if water_level >= thresholds['major']:
        return 'major'
    elif water_level >= thresholds['moderate']:
        return 'moderate'
    elif water_level >= thresholds['flood']:
        return 'minor'
    elif water_level >= thresholds['action']:
        return 'action'
    else:
        return 'normal'
```

## Output Format Examples

### Simple CSV Output
```python
import csv

with open('flood_results.csv', 'w', newline='') as f:
    writer = csv.writer(f)
    writer.writerow(['station_id', 'flood_days'])
    for result in flood_results:
        writer.writerow([result['station_id'], result['flood_days']])
```

### JSON Output
```python
import json

output = {
    'flood_events': flood_results,
    'total_stations_with_flooding': len(flood_results)
}

with open('flood_report.json', 'w') as f:
    json.dump(output, f, indent=2)
```

## Common Issues

| Issue | Cause | Solution |
|-------|-------|----------|
| No floods detected | Threshold too high or dry period | Verify threshold values |
| All days show flooding | Threshold too low or data error | Check threshold units (feet vs meters) |
| NaN in daily_max | Missing data for entire day | Check data availability |

## Best Practices

- Use daily maximum for flood detection to capture peaks
- Ensure water level and threshold use same units (typically feet)
- Only report stations with at least 1 flood day
- Sort results by flood severity or duration for prioritization

Overview

This skill detects flood events by comparing observed water levels to defined flood-stage thresholds. It aggregates high-frequency instantaneous measurements to daily maxima, counts flood days per station, and can classify flood severity when multiple thresholds are available. Results are returned in simple tabular or JSON-friendly formats for reporting and prioritization.

How this skill works

The skill resamples sub-daily water-level time series to daily maximum values to capture peak conditions. It then compares each daily maximum to the configured flood-stage threshold (and optional action/moderate/major thresholds) to identify flood days and assign severity classes. For multiple stations it tallies flood-days per station and can sort or filter outputs to include only stations with at least one flood day.

When to use it

  • Determining whether flooding occurred during a period of interest
  • Counting days with water levels at or above flood stage
  • Aggregating instantaneous USGS-style data (~15-min) into daily values
  • Classifying flood severity when multiple threshold levels exist
  • Prioritizing stations by flood duration or severity for response planning

Best practices

  • Aggregate to daily maximum to capture peak flood conditions rather than daily mean or min
  • Ensure water-level units match threshold units (commonly feet) before comparison
  • Report only stations with at least one flood day to reduce noise in summaries
  • Sort results by flood days or severity to prioritize impacts and response
  • Validate threshold values against authoritative sources (e.g., National Weather Service) and check for data gaps

Example use cases

  • Count the number of flood days at a single gage for a given month using daily maxima
  • Process multiple station time series to produce a ranked list of stations by flood-days
  • Classify each flood day as action/minor/moderate/major for impact assessment
  • Export flood summary as CSV or JSON for downstream dashboards or incident reports
  • Detect floods in historical high-frequency datasets by converting to daily peaks and applying thresholds

FAQ

How are flood days defined?

A flood day is any day where the daily maximum water level is greater than or equal to the configured flood-stage threshold.

Why use daily maximum instead of daily mean?

Daily maximum captures short-duration peak levels that cause flooding; daily mean can miss brief but damaging peaks.

What if my results show no floods or all days flooded?

Check threshold values and units, and inspect data availability for gaps. A too-high threshold or missing data can hide floods; a too-low threshold or unit mismatch can produce false positives.