Great Expectations Generator

Generates Great Expectations suites from data profiles and business rules

509 stars

Best use case

Great Expectations Generator is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Generates Great Expectations suites from data profiles and business rules

Teams using Great Expectations Generator should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/great-expectations-generator/SKILL.md --create-dirs "https://raw.githubusercontent.com/a5c-ai/babysitter/main/library/specializations/data-engineering-analytics/skills/great-expectations-generator/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/great-expectations-generator/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How Great Expectations Generator Compares

Feature / AgentGreat Expectations GeneratorStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Generates Great Expectations suites from data profiles and business rules

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Great Expectations Generator

## Overview

Generates Great Expectations suites from data profiles and business rules. This skill automates the creation of comprehensive expectation suites that enforce data quality constraints.

## Capabilities

- Expectation suite generation from profiling
- Custom expectation creation
- Checkpoint configuration
- Data docs generation
- Validation result analysis
- Expectation parameterization
- Suite versioning recommendations
- Integration with dbt and Airflow

## Input Schema

```json
{
  "dataProfile": "object",
  "businessRules": ["object"],
  "existingSuite": "object",
  "strictness": "strict|moderate|lenient"
}
```

## Output Schema

```json
{
  "expectationSuite": "object",
  "checkpointConfig": "object",
  "documentation": "string",
  "coverageReport": {
    "columnsWithExpectations": "number",
    "totalExpectations": "number"
  }
}
```

## Target Processes

- Data Quality Framework
- ETL/ELT Pipeline
- dbt Project Setup

## Usage Guidelines

1. Provide data profile results from profiling analysis
2. Define business rules that should be enforced
3. Specify strictness level based on use case requirements
4. Include existing suite if extending an existing configuration

## Best Practices

- Start with moderate strictness and adjust based on validation results
- Include both column-level and table-level expectations
- Document business rationale for each custom expectation
- Version expectation suites alongside data transformations
- Configure appropriate data docs for stakeholder visibility

Related Skills

color-palette-generator

509
from a5c-ai/babysitter

Generate accessible color palettes with WCAG compliance

tracing-schema-generator

509
from a5c-ai/babysitter

Generate distributed tracing schemas for OpenTelemetry with Jaeger/Zipkin integration

metrics-schema-generator

509
from a5c-ai/babysitter

Generate metrics schemas for Prometheus, OpenTelemetry, and Grafana dashboards

log-schema-generator

509
from a5c-ai/babysitter

Generate structured logging schemas with correlation ID patterns and ELK/Splunk integration

load-test-generator

509
from a5c-ai/babysitter

Generate load test scripts for k6, Locust, and Gatling from OpenAPI specs

graphql-schema-generator

509
from a5c-ai/babysitter

Generate GraphQL schemas from data models with resolver stubs and federation support

docs-site-generator

509
from a5c-ai/babysitter

Generate documentation sites using Docusaurus, MkDocs, or VuePress

dependency-graph-generator

509
from a5c-ai/babysitter

Generate module dependency graphs with circular dependency detection and coupling metrics

dashboard-generator

509
from a5c-ai/babysitter

Generate monitoring dashboards for Grafana and DataDog with alert integration

c4-diagram-generator

509
from a5c-ai/babysitter

Specialized skill for generating C4 model architecture diagrams. Supports Structurizr DSL, PlantUML, and Mermaid formats with multi-level abstraction (Context, Container, Component, Code).

adr-generator

509
from a5c-ai/babysitter

Specialized skill for generating and managing Architecture Decision Records (ADRs). Supports Nygard, MADR, and custom templates with auto-numbering, linking, and status management.

typespec-sdk-generator

509
from a5c-ai/babysitter

Microsoft TypeSpec-based API and SDK generation