ocrmypdf-image
OCRmyPDF image processing skill — deskew, rotate, clean, despeckle, remove border from scanned documents. Use when the user needs to improve scanned PDF quality, fix skewed pages, remove noise, or clean up scanned documents before OCR.
Best use case
ocrmypdf-image is best used when you need a repeatable AI agent workflow instead of a one-off prompt. It is especially useful for teams working in multi. OCRmyPDF image processing skill — deskew, rotate, clean, despeckle, remove border from scanned documents. Use when the user needs to improve scanned PDF quality, fix skewed pages, remove noise, or clean up scanned documents before OCR.
OCRmyPDF image processing skill — deskew, rotate, clean, despeckle, remove border from scanned documents. Use when the user needs to improve scanned PDF quality, fix skewed pages, remove noise, or clean up scanned documents before OCR.
Users should expect a more consistent workflow output, faster repeated execution, and less time spent rewriting prompts from scratch.
Practical example
Example input
Use the "ocrmypdf-image" skill to help with this workflow task. Context: OCRmyPDF image processing skill — deskew, rotate, clean, despeckle, remove border from scanned documents. Use when the user needs to improve scanned PDF quality, fix skewed pages, remove noise, or clean up scanned documents before OCR.
Example output
A structured workflow result with clearer steps, more consistent formatting, and an output that is easier to reuse in the next run.
When to use this skill
- Use this skill when you want a reusable workflow rather than writing the same prompt again and again.
When not to use this skill
- Do not use this when you only need a one-off answer and do not need a reusable workflow.
- Do not use it if you cannot install or maintain the related files, repository context, or supporting tools.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/ocrmypdf-image/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How ocrmypdf-image Compares
| Feature / Agent | ocrmypdf-image | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
OCRmyPDF image processing skill — deskew, rotate, clean, despeckle, remove border from scanned documents. Use when the user needs to improve scanned PDF quality, fix skewed pages, remove noise, or clean up scanned documents before OCR.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# OCRmyPDF — Image Processing Guide ## Overview [OCRmyPDF](https://github.com/ocrmypdf/OCRmyPDF) includes powerful image processing capabilities to improve scan quality before OCR. These tools help fix skewed pages, remove noise, clean borders, and enhance readability. For core OCR functionality, see the **ocrmypdf** skill. For optimization and PDF/A options, see **ocrmypdf-optimize**. For batch/Docker/scripting, see **ocrmypdf-batch**. ## Deskew Deskew corrects pages that are slightly rotated (e.g., from feed scanner skew). ```bash # Auto deskew (recommended) ocrmypdf --deskew input.pdf output.pdf # Force deskew even if rotation is minimal ocrmypdf --deskew --force-ocr input.pdf output.pdf ``` ## Rotation Rotate pages to correct upside-down or sideways scans: ```bash # Auto-rotate based on text orientation ocrmypdf --rotate-pages input.pdf output.pdf # Force rotate all pages ocrmypdf --rotate-pages --force-ocr input.pdf output.pdf ``` ## Remove Borders / Cleaning Remove unwanted borders, artifacts, and noise from scanned pages: ```bash # Remove borders (dots, solid borders) ocrmypdf --remove-bordering input.pdf output.pdf # Combine with cleanup ocrmypdf --remove-bordering --clean input.pdf output.pdf ``` ## Despeckle Remove speckles and isolated noise pixels: ```bash # Remove speckles ocrmypdf --despeckle input.pdf output.pdf # Aggressive despeckle for very noisy scans ocrmypdf --despeckle --clean input.pdf output.pdf ``` ## Unpaper [unpaper](https://github.com/Flameeyes/unpaper) provides advanced post-processing: ```bash # Apply unpaper with default settings ocrmypdf --unpaper input.pdf output.pdf # Custom unpaper board options ocrmypdf --unpaper-args "--board A4" input.pdf output.pdf ``` ## Oversampling Increase image resolution before OCR for better accuracy: ```bash # Oversample to 300 DPI before OCR ocrmypdf --oversample 300 input.pdf output.pdf # Common for low-resolution scans ocrmypdf --oversample 400 input.pdf output.pdf ``` ## Combined Recipes ### Fix a skewed scan ```bash ocrmypdf --deskew --remove-bordering --despeckle scanned.pdf fixed.pdf ``` ### Clean up a very noisy scan ```bash ocrmypdf --deskew --rotate-pages --despeckle --clean --oversample 300 noisy.pdf clean.pdf ``` ### Remove all artifacts ```bash ocrmypdf --remove-bordering --unpaper --despeckle dirty.pdf clean.pdf ``` ## Quick Reference | Task | Command | |------|---------| | Auto deskew | `--deskew` | | Auto rotate | `--rotate-pages` | | Remove borders | `--remove-bordering` | | Remove speckles | `--despeckle` | | Unpaper | `--unpaper` | | Oversample DPI | `--oversample N` | ## Troubleshooting - **Poor OCR after cleaning**: Try `--oversample 300` to increase input quality. - **Artifacts remain**: Use `--unpaper` for aggressive cleanup. - **Over-cleaned image**: Reduce cleaning options for preserve original quality.
Related Skills
tui-image
Generate and render a pixel-precise ASCII TUI Image component with complete output blocks (TUI_RENDER, COMPONENT_SPEC, PENCIL_SPEC, PENCIL_BATCH_DESIGN) for Pencil MCP drawing workflows. Use when the user asks to create an image in a terminal UI, text-based interface, or Pencil MCP project.
ocrmypdf
OCRmyPDF core skill — add searchable OCR text layer to scanned PDFs, convert images to searchable PDFs, support 100+ languages via Tesseract. Use when the user needs to OCR a PDF, make a scanned PDF searchable, or extract text from scanned documents.
ocrmypdf-optimize
OCRmyPDF optimization skill — compress PDFs, configure PDF/A output, JBIG2 encoding, and lossless optimization. Use when the user needs to reduce PDF file size, create archival PDF/A files, or optimize OCR output.
ocrmypdf-batch
OCRmyPDF batch processing skill — process multiple PDFs, Docker automation, shell scripting, and CI/CD integration. Use when the user needs to OCR many PDFs, set up automated OCR pipelines, or integrate OCR into workflows.
ascii-image-to-ascii
“Convert an image into ASCII art (readable + detail variants, width/charset controls, optional ANSI), for terminal previews and plain-text image substitutes.”
ocrmypdf-api
OCRmyPDF Python API and plugin skill — use OCRmyPDF programmatically from Python, integrate with applications, and extend with plugins (EasyOCR, PaddleOCR, AppleOCR). Use when the user needs to call OCRmyPDF from Python code, build OCR pipelines, or use alternative OCR engines.
vant-vue3
Provides structured guidance for Vant of Vue 3.0. Use when the user needs Vant with Vue 3, asks about mobile UI components such as Button, Cell, Form, Dialog, Toast, Popup, ConfigProvider, theme customization, project setup, or wants to implement mobile-first interfaces with vant or van- components.
layui-vue3
Provides comprehensive guidance for Layui Vue component library including components, layer dialogs, and utilities. Use when the user asks about Layui Vue, needs to use Layui components in Vue 3, or implement UI components.
element-plus-vue3
Provides comprehensive guidance for Element Plus Vue 3 component library including installation, components, themes, internationalization, and API reference. Use when the user asks about Element Plus for Vue 3, needs to build Vue 3 applications with Element Plus, or customize component styles.
bootstrap-vue3
Provides comprehensive guidance for Bootstrap Vue 3 component library including Bootstrap components, grid system, utilities, and Vue 3 integration. Use when the user asks about Bootstrap Vue 3, needs to use Bootstrap components in Vue 3, or implement responsive layouts.
vuex-vue2
Provides comprehensive guidance for Vuex 2.x state management in Vue 2 applications including state, mutations, actions, getters, modules, and plugins. Use when the user asks about Vuex for Vue 2, needs to manage state in Vue 2 applications, or implement Vuex patterns.
vue3
Guidance for Vue 3 using the official guide and API reference. Use when the user needs Vue 3 concepts, patterns, or API details to build components, apps, and tooling.