multiAI Summary Pending
ocr-local
Extract text from images using Tesseract.js OCR (100% local, no API key required). Supports Chinese (simplified/traditional) and English.
3,556 stars
byopenclaw
Installation
Claude Code / Cursor / Codex
$curl -o ~/.claude/skills/ocr-local-v2/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/15914355527/ocr-local-v2/SKILL.md"
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/ocr-local-v2/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How ocr-local Compares
| Feature / Agent | ocr-local | Standard Approach |
|---|---|---|
| Platform Support | multi | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Extract text from images using Tesseract.js OCR (100% local, no API key required). Supports Chinese (simplified/traditional) and English.
Which AI agents support this skill?
This skill is compatible with multi.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# OCR - Image Text Recognition (Local)
Extract text from images using Tesseract.js. **100% local run, no API key required.** Supports Chinese and English.
## Quick start
```bash
node {baseDir}/scripts/ocr.js /path/to/image.jpg
node {baseDir}/scripts/ocr.js /path/to/image.png --lang chi_sim
node {baseDir}/scripts/ocr.js /path/to/image.jpg --lang chi_tra+eng
```
## Options
- `--lang <langs>`: Language codes (default: chi_sim+eng)
- `chi_sim` - Simplified Chinese
- `chi_tra` - Traditional Chinese
- `eng` - English
- Combine with `+`: `chi_sim+eng`
- `--json`: Output as JSON instead of plain text
## Examples
```bash
# Recognize Chinese screenshot
node {baseDir}/scripts/ocr.js screenshot.png
# Recognize English document
node {baseDir}/scripts/ocr.js document.jpg --lang eng
# Mixed Chinese + English
node {baseDir}/scripts/ocr.js mixed.png --lang chi_sim+eng
```
## Notes
- First run downloads language data (~20MB per language)
- Subsequent runs are cached locally
- Works best with clear, high-contrast images
- For handwritten text, accuracy may vary