multiAI Summary Pending

ocr-local

Extract text from images using Tesseract.js OCR (100% local, no API key required). Supports Chinese (simplified/traditional) and English.

3,556 stars

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/ocr-local-v2/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/15914355527/ocr-local-v2/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/ocr-local-v2/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How ocr-local Compares

Feature / Agentocr-localStandard Approach
Platform SupportmultiLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Extract text from images using Tesseract.js OCR (100% local, no API key required). Supports Chinese (simplified/traditional) and English.

Which AI agents support this skill?

This skill is compatible with multi.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# OCR - Image Text Recognition (Local)

Extract text from images using Tesseract.js. **100% local run, no API key required.** Supports Chinese and English.

## Quick start

```bash
node {baseDir}/scripts/ocr.js /path/to/image.jpg
node {baseDir}/scripts/ocr.js /path/to/image.png --lang chi_sim
node {baseDir}/scripts/ocr.js /path/to/image.jpg --lang chi_tra+eng
```

## Options

- `--lang <langs>`: Language codes (default: chi_sim+eng)
  - `chi_sim` - Simplified Chinese
  - `chi_tra` - Traditional Chinese  
  - `eng` - English
  - Combine with `+`: `chi_sim+eng`

- `--json`: Output as JSON instead of plain text

## Examples

```bash
# Recognize Chinese screenshot
node {baseDir}/scripts/ocr.js screenshot.png

# Recognize English document
node {baseDir}/scripts/ocr.js document.jpg --lang eng

# Mixed Chinese + English
node {baseDir}/scripts/ocr.js mixed.png --lang chi_sim+eng
```

## Notes

- First run downloads language data (~20MB per language)
- Subsequent runs are cached locally
- Works best with clear, high-contrast images
- For handwritten text, accuracy may vary