data-migration-validator

Validate data integrity during and after migration with comprehensive verification checks

509 stars

Best use case

data-migration-validator is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Validate data integrity during and after migration with comprehensive verification checks

Teams using data-migration-validator should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/data-migration-validator/SKILL.md --create-dirs "https://raw.githubusercontent.com/a5c-ai/babysitter/main/library/specializations/code-migration-modernization/skills/data-migration-validator/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/data-migration-validator/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How data-migration-validator Compares

Feature / Agentdata-migration-validatorStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Validate data integrity during and after migration with comprehensive verification checks

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Data Migration Validator Skill

Validates data integrity throughout the migration process with comprehensive verification checks and reconciliation reporting.

## Purpose

Enable data validation for:
- Row count validation
- Checksum verification
- Sample data comparison
- Referential integrity checking
- Business rule validation

## Capabilities

### 1. Row Count Validation
- Compare source/target counts
- Track by table/partition
- Identify discrepancies
- Generate count reports

### 2. Checksum Verification
- Calculate table checksums
- Compare hash values
- Identify data drift
- Verify data consistency

### 3. Sample Data Comparison
- Random sample selection
- Field-by-field comparison
- Statistical sampling
- Confidence scoring

### 4. Referential Integrity Checking
- Verify foreign keys
- Check orphaned records
- Validate relationships
- Report violations

### 5. Business Rule Validation
- Apply custom rules
- Check data constraints
- Verify transformations
- Validate calculations

### 6. Reconciliation Reporting
- Generate audit reports
- Track discrepancies
- Document exceptions
- Provide sign-off reports

## Tool Integrations

| Tool | Purpose | Integration Method |
|------|---------|-------------------|
| Great Expectations | Data validation | Library |
| dbt tests | Transform validation | CLI |
| Custom SQL | Database checks | CLI |
| DataGrip | Manual verification | GUI |
| Apache Griffin | Data quality | API |

## Output Schema

```json
{
  "validationId": "string",
  "timestamp": "ISO8601",
  "results": {
    "rowCounts": {
      "tables": [
        {
          "name": "string",
          "source": "number",
          "target": "number",
          "match": "boolean"
        }
      ]
    },
    "checksums": {
      "tables": [],
      "overall": "string"
    },
    "samples": {
      "checked": "number",
      "matched": "number",
      "discrepancies": []
    },
    "referentialIntegrity": {
      "valid": "boolean",
      "violations": []
    },
    "businessRules": {
      "passed": "number",
      "failed": "number",
      "failures": []
    }
  },
  "summary": {
    "status": "passed|failed|warning",
    "score": "number"
  }
}
```

## Integration with Migration Processes

- **database-schema-migration**: Post-migration validation
- **cloud-migration**: Data validation

## Related Skills

- `schema-comparator`: Pre-migration comparison
- `etl-pipeline-builder`: Migration execution

## Related Agents

- `data-integrity-validator`: Orchestrates validation
- `database-migration-orchestrator`: Uses for verification

Related Skills

structured-data

509
from a5c-ai/babysitter

JSON-LD schema markup and validation.

design-system-validator

509
from a5c-ai/babysitter

Validate design system compliance in code and detect token usage violations

link-validator

509
from a5c-ai/babysitter

Comprehensive link checking and validation for documentation. Validate internal links, external URLs, anchors, detect redirects, monitor link rot, and generate sitemap validation reports.

code-sample-validator

509
from a5c-ai/babysitter

Extract, validate, and test code samples in documentation. Verify syntax, execute samples, check outputs, validate imports, and ensure code samples are up-to-date with current APIs.

openapi-validator

509
from a5c-ai/babysitter

Validate OpenAPI specifications for correctness, security, and best practices

k8s-validator

509
from a5c-ai/babysitter

Validate Kubernetes manifests for security, best practices, and resource limits

CVE/CWE Database Skill

509
from a5c-ai/babysitter

CVE and CWE database querying and management

test-data-generation

509
from a5c-ai/babysitter

Synthetic test data generation and management using Faker.js and similar tools. Generate realistic test data, create data factories, implement database seeding, and manage test data anonymization.

iOS Persistence (Core Data/Realm)

509
from a5c-ai/babysitter

Specialized skill for iOS local data persistence solutions

Room Database

509
from a5c-ai/babysitter

Expert skill for Android Room persistence library

specialization-validator

509
from a5c-ai/babysitter

Validate specialization completeness across all 7 phases, score each phase, identify gaps, and generate validation reports.

process-validator

509
from a5c-ai/babysitter

Validate process JS files for correct SDK patterns, task definitions, syntax, and quality gate implementation.