home / skills / openclaw / skills / image-ocr
This skill extracts text from images using Tesseract OCR, supporting multiple languages and common formats to enable quick content digitization.
npx playbooks add skill openclaw/skills --skill image-ocrReview the files below or copy the command above to add this skill to your agents.
---
name: image-ocr
description: "Extract text from images using Tesseract OCR"
metadata:
{
"openclaw":
{
"emoji": "👁️",
"requires": { "bins": ["tesseract"] },
"install":
[
{
"id": "dnf",
"kind": "dnf",
"package": "tesseract",
"bins": ["tesseract"],
"label": "Install via dnf",
},
],
},
}
---
# Image OCR
Extract text from images using Tesseract OCR. Supports multiple languages and image formats including PNG, JPEG, TIFF, and BMP.
## Commands
```bash
# Extract text from an image (default: English)
image-ocr "screenshot.png"
# Extract text with a specific language
image-ocr "document.jpg" --lang eng
```
## Install
```bash
sudo dnf install tesseract
```
This skill extracts text from images using Tesseract OCR. It supports multiple image formats (PNG, JPEG, TIFF, BMP) and can run OCR in different languages. The tool is lightweight and designed for batch or single-image text extraction in automation pipelines.
The skill calls Tesseract OCR on supplied image files and returns the recognized text. You can pass a language code to improve accuracy for non-English content. It accepts common image formats and can be invoked for single files or scripted to process folders in bulk.
What image formats are supported?
Common formats are supported: PNG, JPEG, TIFF, and BMP.
How do I improve OCR accuracy for non-English text?
Install the appropriate Tesseract language data and pass the corresponding language code (for example, --lang eng for English).