handwriting-recognition-guide
Apply handwriting OCR to digitize historical and archival documents
Best use case
handwriting-recognition-guide is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Apply handwriting OCR to digitize historical and archival documents
Teams using handwriting-recognition-guide should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/handwriting-recognition-guide/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How handwriting-recognition-guide Compares
| Feature / Agent | handwriting-recognition-guide | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Apply handwriting OCR to digitize historical and archival documents
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Handwriting Recognition Guide
A skill for applying handwriting text recognition (HTR) to digitize historical documents, archival manuscripts, and handwritten research notes. Covers HTR platforms, image preprocessing, model training, post-correction, and integration into digital humanities research workflows.
## Handwriting Recognition vs. Printed OCR
### Key Differences
```
Printed Text OCR:
- Characters are standardized and uniform
- Well-solved problem (>99% accuracy on clean scans)
- Tools: Tesseract, ABBYY FineReader, Adobe Acrobat
Handwriting Text Recognition (HTR):
- Characters vary by writer, mood, pen, era
- Much harder -- typically 85-95% character accuracy
- Requires training on specific handwriting styles
- Tools: Transkribus, Kraken, HTR-Flor, Google Cloud Vision
Challenges specific to historical documents:
- Faded ink, bleed-through, stains, tears
- Archaic letterforms and abbreviations
- Multiple hands in one document
- Non-standard orthography
- Mixed languages and scripts
```
## HTR Platforms
### Transkribus (State of the Art for Historical Documents)
**Pricing note:** Transkribus uses a credit-based pricing model. A limited free tier is available, but processing large volumes of pages requires purchasing credits.
```
Transkribus is the leading platform for historical HTR.
Workflow:
1. Upload document images
2. Automatic layout analysis (detect text regions and baselines)
3. Manual correction of layout (if needed)
4. Apply a pre-trained HTR model (or train your own)
5. Review and correct transcription
6. Export as TEXT, PAGE XML, TEI, DOCX, or PDF
Pre-trained models:
- Noscemus GM (general model for Latin scripts)
- English Writing M1 (18th-19th century English)
- German Kurrent models
- Dutch, French, Italian, Spanish models available
Training a custom model:
- Requires ~15,000-25,000 words of ground truth (manually transcribed)
- Can start with a pre-trained base model and fine-tune
- Training takes 1-8 hours depending on dataset size
```
### Other Tools
| Tool | Type | Strengths |
|------|------|----------|
| Transkribus | Cloud platform | Best for historical documents, active community |
| Kraken | Open source (Python) | Flexible, scriptable, custom training |
| eScriptorium | Open source (web) | Based on Kraken, collaborative interface |
| Google Cloud Vision | API | Good for modern handwriting, many languages |
| Azure AI Vision | API | Competitive with Google for modern text |
| HTR-Flor | Open source | Research-focused, PyTorch-based |
## Image Preprocessing
### Preparing Scans for HTR
```python
from PIL import Image, ImageFilter, ImageEnhance
def preprocess_document_image(image_path: str,
output_path: str) -> dict:
"""
Preprocess a document scan for optimal HTR performance.
Args:
image_path: Path to the input scan
output_path: Path to save the preprocessed image
"""
img = Image.open(image_path)
# Convert to grayscale
img = img.convert("L")
# Enhance contrast
enhancer = ImageEnhance.Contrast(img)
img = enhancer.enhance(1.5)
# Remove noise
img = img.filter(ImageFilter.MedianFilter(size=3))
# Binarize (convert to black and white)
threshold = 128
img = img.point(lambda x: 255 if x > threshold else 0, "1")
img.save(output_path)
return {
"original": image_path,
"processed": output_path,
"steps_applied": [
"Grayscale conversion",
"Contrast enhancement (1.5x)",
"Median filter (noise removal)",
"Binarization (threshold=128)"
],
"additional_steps_if_needed": [
"Deskewing (correct rotation)",
"Dewarping (correct page curvature)",
"Bleed-through removal",
"Background normalization"
]
}
```
### Scanning Best Practices
```
Resolution: 300-400 DPI for most documents
600 DPI for fine handwriting or damaged originals
Color: Grayscale usually sufficient; color for illuminated MSS
Format: TIFF (lossless) for archival; PNG for working copies
Lighting: Even, diffused light; avoid shadows and glare
Flatness: Use a book cradle or V-shaped scanner for bound volumes
Calibration: Include a color/grayscale chart for batch consistency
```
## Post-OCR Correction
### Semi-Automated Correction Workflow
```python
def post_correction_workflow(raw_transcription: str,
dictionary: set,
confidence_threshold: float = 0.8) -> dict:
"""
Post-correction strategy for HTR output.
Args:
raw_transcription: Raw OCR/HTR text output
dictionary: Set of valid words for the document's language/period
confidence_threshold: Below this, flag for manual review
"""
words = raw_transcription.split()
flagged = []
corrected = []
for word in words:
clean = word.strip(".,;:!?()[]")
if clean.lower() in dictionary:
corrected.append(word)
else:
flagged.append({
"word": word,
"position": len(corrected),
"suggestion": "Manual review needed"
})
corrected.append(word)
return {
"total_words": len(words),
"flagged_words": len(flagged),
"estimated_accuracy": 1 - len(flagged) / max(len(words), 1),
"flagged": flagged[:20],
"correction_strategies": [
"Dictionary-based spell checking (period-appropriate dictionary)",
"N-gram language model for context-aware correction",
"Crowdsourcing (Zooniverse, FromThePage)",
"Double-keying (two independent transcribers, compare)",
"AI-assisted correction with human verification"
]
}
```
## Integration with Research Workflows
### From Transcription to Analysis
```
1. Transcribe documents using HTR
2. Correct and validate transcriptions
3. Encode in TEI-XML for digital editions
4. Apply NLP for named entity recognition, topic modeling
5. Link entities to knowledge bases (Wikidata, VIAF)
6. Publish as a searchable digital archive
Tools for TEI encoding:
- oXygen XML Editor (standard for digital humanities)
- TEI Publisher (web-based publishing platform)
- FromThePage (collaborative transcription with TEI export)
```
## Evaluating HTR Accuracy
Report Character Error Rate (CER) and Word Error Rate (WER) on a held-out test set. CER below 5% is generally considered production-quality for historical documents. Always compare against a manually created ground truth. Report accuracy separately for different document types, hands, or time periods if your corpus is heterogeneous.Related Skills
thuthesis-guide
Write Tsinghua University theses using the ThuThesis LaTeX template
thesis-writing-guide
Templates, formatting rules, and strategies for thesis and dissertation writing
thesis-template-guide
Set up LaTeX templates for PhD and Master's thesis documents
sjtuthesis-guide
Write SJTU theses using the SJTUThesis LaTeX template with full compliance
novathesis-guide
LaTeX thesis template supporting multiple universities and formats
graphical-abstract-guide
Create SVG graphical abstracts for journal paper submissions
beamer-presentation-guide
Guide to creating academic presentations with LaTeX Beamer
plagiarism-detection-guide
Use plagiarism detection tools and ensure manuscript originality
paper-polish-guide
Review and polish LaTeX research papers for clarity and style
grammar-checker-guide
Use grammar and style checking tools to polish academic manuscripts
conciseness-editing-guide
Eliminate wordiness and redundancy in academic prose for clarity
academic-translation-guide
Academic translation, post-editing, and Chinglish correction guide