home / skills / anthropics / skills / webapp-testing

webapp-testing skill

/skills/webapp-testing

This skill helps you automate local web app testing with Playwright, manage servers, capture screenshots, and inspect dynamic pages for reliable UIs.

npx playbooks add skill anthropics/skills --skill webapp-testing

Review the files below or copy the command above to add this skill to your agents.

Files (6)
SKILL.md
3.8 KB
---
name: webapp-testing
description: Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.
license: Complete terms in LICENSE.txt
---

# Web Application Testing

To test local web applications, write native Python Playwright scripts.

**Helper Scripts Available**:
- `scripts/with_server.py` - Manages server lifecycle (supports multiple servers)

**Always run scripts with `--help` first** to see usage. DO NOT read the source until you try running the script first and find that a customized solution is abslutely necessary. These scripts can be very large and thus pollute your context window. They exist to be called directly as black-box scripts rather than ingested into your context window.

## Decision Tree: Choosing Your Approach

```
User task → Is it static HTML?
    ├─ Yes → Read HTML file directly to identify selectors
    │         ├─ Success → Write Playwright script using selectors
    │         └─ Fails/Incomplete → Treat as dynamic (below)
    │
    └─ No (dynamic webapp) → Is the server already running?
        ├─ No → Run: python scripts/with_server.py --help
        │        Then use the helper + write simplified Playwright script
        │
        └─ Yes → Reconnaissance-then-action:
            1. Navigate and wait for networkidle
            2. Take screenshot or inspect DOM
            3. Identify selectors from rendered state
            4. Execute actions with discovered selectors
```

## Example: Using with_server.py

To start a server, run `--help` first, then use the helper:

**Single server:**
```bash
python scripts/with_server.py --server "npm run dev" --port 5173 -- python your_automation.py
```

**Multiple servers (e.g., backend + frontend):**
```bash
python scripts/with_server.py \
  --server "cd backend && python server.py" --port 3000 \
  --server "cd frontend && npm run dev" --port 5173 \
  -- python your_automation.py
```

To create an automation script, include only Playwright logic (servers are managed automatically):
```python
from playwright.sync_api import sync_playwright

with sync_playwright() as p:
    browser = p.chromium.launch(headless=True) # Always launch chromium in headless mode
    page = browser.new_page()
    page.goto('http://localhost:5173') # Server already running and ready
    page.wait_for_load_state('networkidle') # CRITICAL: Wait for JS to execute
    # ... your automation logic
    browser.close()
```

## Reconnaissance-Then-Action Pattern

1. **Inspect rendered DOM**:
   ```python
   page.screenshot(path='/tmp/inspect.png', full_page=True)
   content = page.content()
   page.locator('button').all()
   ```

2. **Identify selectors** from inspection results

3. **Execute actions** using discovered selectors

## Common Pitfall

❌ **Don't** inspect the DOM before waiting for `networkidle` on dynamic apps
✅ **Do** wait for `page.wait_for_load_state('networkidle')` before inspection

## Best Practices

- **Use bundled scripts as black boxes** - To accomplish a task, consider whether one of the scripts available in `scripts/` can help. These scripts handle common, complex workflows reliably without cluttering the context window. Use `--help` to see usage, then invoke directly. 
- Use `sync_playwright()` for synchronous scripts
- Always close the browser when done
- Use descriptive selectors: `text=`, `role=`, CSS selectors, or IDs
- Add appropriate waits: `page.wait_for_selector()` or `page.wait_for_timeout()`

## Reference Files

- **examples/** - Examples showing common patterns:
  - `element_discovery.py` - Discovering buttons, links, and inputs on a page
  - `static_html_automation.py` - Using file:// URLs for local HTML
  - `console_logging.py` - Capturing console logs during automation

Overview

This skill provides a toolkit for interacting with and testing local web applications using Playwright and helper scripts. It streamlines starting local servers, performing reconnaissance on rendered pages, capturing screenshots and logs, and running reliable automated checks. The package favors simple, synchronous Playwright scripts and includes ready-made helpers to manage server lifecycles.

How this skill works

The skill supplies a with_server.py helper to launch one or more local servers and run your Playwright automation as a child process. For dynamic apps the workflow is reconnaissance-then-action: navigate, wait for networkidle, inspect the rendered DOM (screenshots, page.content, locators), discover robust selectors, then execute interactions. Example scripts and patterns demonstrate launching Chromium headless, capturing console logs, and discovering elements.

When to use it

  • Testing local static HTML files or automation against file:// URLs
  • Automating and validating dynamic single-page apps served locally
  • Debugging UI behavior by capturing screenshots and browser console logs
  • Running end-to-end tests that require starting backend and frontend together
  • Exploring unknown pages to discover reliable selectors before scripting actions

Best practices

  • Run each helper script with --help first and use them as black-box tools when possible
  • Always wait for page.wait_for_load_state('networkidle') before inspecting DOM on dynamic apps
  • Use sync_playwright() for straightforward synchronous scripts
  • Prefer descriptive selectors (text=, role=, IDs, clear CSS selectors) over brittle XPaths
  • Always close the browser when your script completes to avoid resource leaks

Example use cases

  • Start frontend and backend together with with_server.py and run a smoke test that verifies login flow and takes screenshots
  • Run a reconnaissance script that captures full-page screenshots and page.content to identify selectors for later interaction scripts
  • Capture browser console logs during a test run to surface JavaScript errors and warnings
  • Automate a static HTML verification using file:// URLs to confirm accessibility attributes and link targets
  • Combine element_discovery.py with your automation to quickly map buttons, inputs, and links for larger test suites

FAQ

How do I start multiple servers for a full-stack app?

Use scripts/with_server.py with multiple --server and --port flags, then append -- and the command to run your Playwright script.

What if my selectors are flaky?

Perform reconnaissance after networkidle, capture page.content and screenshots, then choose stable selectors like role=, text=, or element IDs and add explicit waits with page.wait_for_selector().