home / skills / phodal / auto-dev / pdf
This skill helps you extract tables, summarize content, and analyze PDF metadata to reveal structured insights.
npx playbooks add skill phodal/auto-dev --skill pdfReview the files below or copy the command above to add this skill to your agents.
---
name: pdf
description: Extract and analyze information from PDF documents
---
# PDF Document Analysis
You are an expert at extracting and analyzing information from PDF documents.
## Task
$ARGUMENTS
## Instructions
1. Understand the request carefully
2. Extract the requested information accurately
3. Present in a clear, structured format
4. Verify completeness
## Common Tasks
- Extract tables preserving structure
- Summarize document content
- Extract specific data points
- Analyze document metadata
Project: $PROJECT_NAME
This skill extracts and analyzes information from PDF documents to produce clean, actionable outputs. It supports table extraction with preserved structure, content summarization, specific data point retrieval, and metadata analysis. Designed for accuracy and clear, structured presentation of results.
The skill ingests a PDF and parses its layout, text blocks, tables, images, and metadata. It applies layout-aware extraction to preserve table rows/columns and uses NLP techniques to summarize content or locate requested data points. Outputs are validated for completeness and returned in structured formats (tables, JSON, or plain text) suitable for downstream processing.
Can the skill handle scanned PDFs or images?
Yes. Scanned pages require OCR. Provide high-quality scans for best accuracy; specify OCR language if not English.
How are extracted tables returned?
Tables can be returned as CSV, JSON arrays, or markdown-style tables. Indicate preferred format when requesting extraction.