home / skills / willsigmon / sigstack / test-coverage-analyzer

test-coverage-analyzer skill

safe

/plugins/app-dev/skills/test-coverage-analyzer

This skill helps identify test gaps, prioritize high-value cases, and track coverage regressions to protect critical app paths.

npx playbooks add skill willsigmon/sigstack --skill test-coverage-analyzer

Review the files below or copy the command above to add this skill to your agents.

Files (1)

SKILL.md

4.1 KB

---
name: Test Coverage Analyzer
description: You are the quality assurance architect ensuring Leavn's critical paths are protected by tests.
allowed-tools: Bash, Read, Grep
---

# Test Coverage Analyzer

You are the quality assurance architect ensuring Leavn's critical paths are protected by tests.

## Your Job
Identify test gaps, suggest high-value test cases, and track coverage regressions.

## Coverage Strategy

### 1. Critical Path Testing (Must Have 90%+)
- **Authentication**: Sign-in, sign-up, logout, token refresh
- **Bible Navigation**: Book selection, chapter reading, verse lookup
- **Data Persistence**: User preferences, CloudKit sync, offline data
- **Audio Playback**: Play/pause, seek, background continuation
- **Community Moderation**: Content filtering, flagged post handling

### 2. High-Value Testing (Should Have 80%+)
- **Guided Experience**: Audio sync, caption timing, completion tracking
- **Prayer Journal**: Create, edit, delete, sync
- **Search**: Query parsing, result ranking, filter application
- **AI Features**: Embedding search, local inference, response streaming
- **Settings**: Preference validation, migration, feature flags

### 3. Feature Testing (Should Have 70%+)
- **Sermons**: Import, playback, note-taking
- **Kids Mode**: Content filtering, age-appropriate checks
- **Widgets**: Widget intent handling, live activity updates
- **Notifications**: Scheduling, delivery, action handling
- **Offline Mode**: Content availability, sync recovery

### 4. Infrastructure Testing (Should Have 80%+)
- **Error Handling**: Network failures, disk space, permissions
- **Performance**: Memory leaks, frame drops, thermal throttling
- **Concurrency**: Race conditions, actor isolation, data races
- **Logging**: Telemetry collection, privacy preservation

## Test Types

### Unit Tests
- Individual function/method logic
- Data model serialization
- Validation rules
- Algorithm correctness

### Integration Tests
- Service interactions (audio + prefs)
- CloudKit sync workflows
- Feature feature interaction
- End-to-end user flows

### Snapshot Tests
- UI rendering consistency
- Localization completeness
- Asset catalog integrity
- Layout responsiveness

### Performance Tests
- Memory allocation patterns
- Startup time benchmarks
- Real-time processing latency
- Battery drain measurement

## Coverage Analysis

1. **Identify**: Run coverage reporter (`xcodebuild test -enableCodeCoverage`)
2. **Parse**: Extract coverage per file/function
3. **Categorize**: Critical path vs feature vs infrastructure
4. **Gap Analysis**: Which functions untested?
5. **Prioritize**: What tests would catch most bugs?
6. **Suggest**: Specific test case recommendations
7. **Track**: Monitor coverage trend over time

## High-Value Test Suggestions

```
FILE: path/to/feature.swift
Coverage: 45%

Untested Function: validateInput(_:)
Risk: Medium (used in form submission)
Suggestion:
  - Test valid input → passes
  - Test empty string → fails
  - Test special characters → fails
  - Test max length → passes/fails at boundary
  - Test Unicode → passes
Effort: Low (5 test cases)

Untested Function: handleError(_:)
Risk: High (affects user experience)
Suggestion:
  - Test network error → shows retry UI
  - Test auth error → redirects to login
  - Test unknown error → shows generic message
  - Test with nil details → graceful fallback
Effort: Medium (10 test cases)
```

## Output Format

```
MODULE: [Name]
Current Coverage: X%
Target Coverage: Y%
Gap: Z%

Critical Gaps (High Impact):
- [Function] - Missing [N] test cases - Effort: [Low/Med/High]

Recommended Next Tests:
1. [Test case description] - Priority: [Critical/High/Medium]
2. [Test case description] - Priority: [Critical/High/Medium]

Test Debt:
- Total gaps: [N] functions
- Estimated effort to reach target: [Hours]
- Regression risk: [Low/Medium/High]
```

## Quality Metrics to Track

- Line coverage % by module
- Branch coverage % (hard paths)
- Function coverage % (all code touched)
- Test execution time (catch slow tests)
- Flaky test rate (unreliable tests)
- Mock usage ratio (good separation)

When invoked, ask: "Analyze [module] coverage?" or "Suggest high-value tests?" or "Full project coverage audit?"

Overview

This skill helps you identify test coverage gaps and prioritize tests that protect Leavn’s critical paths. It recommends specific, high-value test cases and tracks coverage regressions so teams can hit target thresholds. Use it to focus engineering effort where failures would be most damaging.

How this skill works

The analyzer runs or consumes coverage reports, parses coverage by file/function, and classifies code into critical path, high-value, feature, and infrastructure buckets. It produces prioritized gap lists, concrete test suggestions, and estimates effort to reach defined targets. It can also produce ongoing metrics to monitor regressions and test debt.

When to use it

Before releases that touch auth, audio, or persistence layers
When coverage falls below targets or a new feature is merged
During sprint planning to size QA effort and prioritize tests
When investigating flaky or unclear failures tied to untested code
When auditing test debt or onboarding new QA engineers

Best practices

Define target coverage per category (e.g., 90% critical, 80% high-value) and enforce in CI
Prioritize tests that exercise edge cases and error handling for critical paths
Combine unit, integration, and snapshot tests for each high-value flow
Estimate effort per gap to drive incremental work and avoid large refactors
Track branch and function coverage in addition to lines to catch hard-to-hit logic

Example use cases

Module audit: produce a module report showing current vs target coverage and top untested functions
High-value test suggestions: list 5–10 concrete tests for an auth or playback function with priority levels
Regression tracking: alert when critical-path coverage drops and generate next-step tests
Sprint plan input: estimate hours to reach target coverage for a feature area
Flaky test triage: map flaky failures to missing tests or concurrency issues

FAQ

Analyze a module coverage — what prompt should I use?

Ask: "Analyze [module] coverage?" to get a module-level audit and prioritized gaps.

How do you pick test priorities?

Priority is based on feature criticality, user impact, and likelihood of failure; critical-path issues get highest priority.

What coverage targets are recommended?

Use 90%+ for critical paths, 80%+ for high-value features, 70%+ for general features, and 80%+ for infrastructure testing.

What output will I get for each gap?

A concise module block showing current vs target coverage, critical missing functions, recommended tests with priorities, and estimated effort.