table-extraction-and-reconstruction

taule extraction and reconstruction

7,385 stars

Best use case

table-extraction-and-reconstruction is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

taule extraction and reconstruction

Teams using table-extraction-and-reconstruction should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/table-extraction-and-reconstruction/SKILL.md --create-dirs "https://raw.githubusercontent.com/kreuzberg-dev/kreuzberg/main/.ai-rulez/domains/ocr-integration/skills/table-extraction-and-reconstruction/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/table-extraction-and-reconstruction/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How table-extraction-and-reconstruction Compares

Feature / Agenttable-extraction-and-reconstructionStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

taule extraction and reconstruction

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

Identify and reconstruct tables from OCR data

1. Extract word positions from hOCR
2. Perform spatial clustering:
   - Group words by vertical alignment
   - Identify column boundaries
   - Identify row boundaries
3. Detect table cells:
   - Assign words to cells
   - Handle merged cells
   - Detect cell padding
4. Parse TSV output if available:
   - Map cells to TSV rows/cols
   - Merge with hOCR data
5. Convert to Markdown table:
   - Generate table header
   - Add alignment indicators
   - Handle spanning cells
   - Escape special characters
6. Validate table structure