home / skills / 0xdarkmatter / claude-mods / markitdown
This skill converts local documents to clean Markdown using markitdown for PDFs, Word, Excel, slides, OCR images, and audio.
npx playbooks add skill 0xdarkmatter/claude-mods --skill markitdownReview the files below or copy the command above to add this skill to your agents.
---
name: markitdown
description: "Convert local documents to Markdown using Microsoft's markitdown CLI. Best for: PDF, Word, Excel, PowerPoint, images (OCR), audio. Can fetch URLs but Jina is faster for web. Triggers on: convert to markdown, read PDF, parse document, extract text from, docx, xlsx, pptx, OCR image, local file."
compatibility: "Requires markitdown. Install: pip install markitdown"
allowed-tools: "Bash"
---
# markitdown - Document to Markdown
Convert local documents to clean Markdown. One tool for PDF, Word, Excel, PowerPoint, images, and more.
## When to Use markitdown
| Use Case | Recommendation |
|----------|----------------|
| **Local files (PDF, Word, Excel)** | ✅ **Use markitdown** - unique capability |
| **Web pages** | ❌ Use Jina (`r.jina.ai/`) - 5x faster |
| **Blocked/anti-bot sites** | ❌ Use Firecrawl |
| **OCR on images** | ✅ **Use markitdown** |
| **Audio transcription** | ✅ **Use markitdown** |
## Basic Usage
```bash
# Local files (primary use case)
markitdown document.pdf
markitdown report.docx
markitdown data.xlsx
markitdown slides.pptx
markitdown screenshot.png # OCR
# URLs (works, but Jina is faster)
markitdown https://example.com
# Save output
markitdown document.pdf > document.md
```
## Supported Formats
| Format | Extensions | Notes |
|--------|------------|-------|
| PDF | `.pdf` | Text extraction, tables |
| Word | `.docx` | Formatting preserved |
| Excel | `.xlsx` | Tables to markdown |
| PowerPoint | `.pptx` | Slides as sections |
| Images | `.jpg`, `.png` | OCR text extraction |
| HTML | `.html` | Clean conversion |
| Audio | `.mp3`, `.wav` | Speech-to-text |
| Text | `.txt`, `.csv`, `.json`, `.xml` | Pass-through/structure |
| URLs | `https://...` | Works but slower than Jina |
## Benchmarked Performance (URLs)
| Tool | Avg Speed | Success Rate |
|------|-----------|--------------|
| Jina | **0.5s** | 10/10 |
| markitdown | 2.5s | 9/10 |
| Firecrawl | 4.5s | 10/10 |
**Verdict**: For URLs, use Jina. For local files, markitdown is the only option.
## Examples
```bash
# PDF to markdown (primary use case)
markitdown report.pdf > report.md
# Excel spreadsheet
markitdown financials.xlsx
# Image with text (OCR)
markitdown screenshot.png
# PowerPoint deck
markitdown presentation.pptx > slides.md
# Audio transcription
markitdown meeting.mp3 > transcript.md
```
## Comparison with Alternatives
| Task | markitdown | Alternative |
|------|------------|-------------|
| PDF text | `markitdown file.pdf` | PyMuPDF, pdfplumber |
| Word docs | `markitdown file.docx` | python-docx |
| Excel | `markitdown file.xlsx` | pandas, openpyxl |
| OCR | `markitdown image.png` | Tesseract |
| Web pages | Use Jina instead | `r.jina.ai/URL` (5x faster) |
**markitdown's advantage**: One CLI for all local document formats. No code needed.
This skill converts local documents into clean, readable Markdown using the markitdown CLI. It handles PDFs, Word, Excel, PowerPoint, images (with OCR), and audio transcription without requiring code. Use it when you need fast, reliable local file conversion into Markdown for notes, publishing, or downstream processing.
markitdown inspects the input file type and applies format-specific extraction: text and tables from PDFs and Office files, slide sections from PPTX, OCR for images, and speech-to-text for audio. It outputs structured Markdown with headings, lists, and tables where appropriate. For web URLs it can fetch and convert pages, but local file handling is its core strength and is optimized for fidelity over raw crawling speed.
Can markitdown handle web pages as well as local files?
Yes, it can fetch and convert URLs, but converting web pages is slower than using a web-optimized tool like Jina; prefer markitdown for local files.
How accurate is the OCR and audio transcription?
OCR and transcription work well on clear inputs: high-resolution images and clean audio yield the best results. Complex layouts or noisy audio may require manual cleanup.