Data Governance Framework
Assess, score, and remediate your organization's data governance posture across 6 domains.
About this skill
The Data Governance Framework skill empowers AI agents to conduct comprehensive, guided data governance assessments. It covers six critical domains: Data Quality, Data Cataloging, Access Control, Compliance Mapping, Retention & Lifecycle, and AI/Agent Data Governance. The agent interactively guides the user to evaluate 8 specific controls within each domain on a 0-3 scale, ranging from 'Not implemented' to 'Automated and continuously monitored'. Based on these evaluations, the skill calculates domain-specific and overall governance scores, providing clear interpretations (e.g., Critical, Developing, Managed, Optimized). Users can prioritize specific domains for assessment or evaluate all six. Following the assessment, the AI agent generates a prioritized remediation roadmap, highlighting areas of weakness and suggesting actionable improvements. This structured approach helps organizations understand their current data governance maturity, identify risks, and develop strategic plans to enhance their data handling practices. This skill is invaluable for organizations aiming to strengthen their data governance, ensure compliance with various regulations (like GDPR, CCPA, HIPAA), and manage data effectively, especially in the evolving landscape of AI and agent-based workflows. It provides a systematic way to measure progress and target specific areas for improvement, mitigating regulatory risks and improving operational efficiency.
Best use case
The primary use case for this skill is to conduct a structured, guided assessment of an organization's data governance maturity. Organizations, data governance officers, compliance teams, and IT managers would benefit most by using this skill to identify gaps, measure compliance, understand their risk posture, and create actionable plans to improve their data management and regulatory adherence.
Assess, score, and remediate your organization's data governance posture across 6 domains.
Users should expect a detailed assessment report of their data governance posture, including domain-specific scores, an overall governance score with a maturity rating, and a prioritized remediation roadmap.
Practical example
Example input
Assess my organization's data governance posture.
Example output
Based on your input, your organization's overall data governance score is 38% (Developing), requiring a 90-day improvement plan. Key areas for immediate remediation include implementing data profiling automation and establishing training data provenance.
When to use this skill
- When initiating a new data governance program or reviewing an existing one.
- To assess compliance with specific data regulations like GDPR, HIPAA, or CCPA.
- To identify weaknesses and risks in data quality, access control, or lifecycle management.
- To generate a prioritized remediation roadmap for data governance improvements.
When not to use this skill
- When seeking automated, real-time data governance monitoring without human input.
- If an organization lacks basic understanding or documentation of its data processes.
- For tasks unrelated to data governance assessment or remediation planning.
- As a substitute for a full, external data governance audit by human experts.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/afrexai-data-governance/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How Data Governance Framework Compares
| Feature / Agent | Data Governance Framework | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | easy | N/A |
Frequently Asked Questions
What does this skill do?
Assess, score, and remediate your organization's data governance posture across 6 domains.
How difficult is it to install?
The installation complexity is rated as easy. You can find the installation instructions above.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
Best AI Skills for Claude
Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.
Cursor vs Codex for AI Workflows
Compare Cursor and Codex for AI coding workflows, repository assistance, debugging, refactoring, and reusable developer skills.
SKILL.md Source
# Data Governance Framework Assess, score, and remediate your organization's data governance posture across 6 domains. ## What This Covers 1. **Data Quality** — Completeness, accuracy, consistency, timeliness scoring 2. **Data Cataloging** — Asset inventory, lineage tracking, metadata management 3. **Access Control** — Role-based permissions, least privilege, data classification (public/internal/confidential/restricted) 4. **Compliance Mapping** — GDPR, CCPA, SOX, HIPAA, PCI-DSS, industry-specific regulations 5. **Retention & Lifecycle** — Retention policies, archival schedules, deletion procedures, legal hold 6. **AI/Agent Data Governance** — Training data provenance, model input/output logging, bias detection, PII handling in agent workflows ## How to Use When asked to assess data governance: 1. Ask which domains are priority (or assess all 6) 2. For each domain, evaluate 8 controls on a 0-3 scale: - 0 = Not implemented - 1 = Ad hoc / informal - 2 = Documented and partially enforced - 3 = Automated and continuously monitored 3. Calculate domain score (sum / 24 × 100) 4. Calculate overall governance score (average of domains) 5. Generate remediation roadmap prioritized by risk ## Scoring Interpretation | Score | Rating | Action | |-------|--------|--------| | 0-25% | Critical | Immediate remediation — regulatory risk | | 26-50% | Developing | 90-day improvement plan required | | 51-75% | Managed | Optimize and automate weak areas | | 76-100% | Optimized | Maintain and benchmark against peers | ## Domain 1: Data Quality Controls 1. Data profiling automation (duplicate detection, format validation) 2. Quality dashboards with SLA thresholds 3. Root cause analysis for quality failures 4. Stewardship program (assigned data owners per domain) 5. Quality gates in data pipelines (reject bad data at ingestion) 6. Business rule validation (domain-specific logic checks) 7. Cross-system reconciliation (source vs target matching) 8. Quality trend tracking (month-over-month improvement metrics) ## Domain 2: Data Cataloging Controls 1. Automated asset discovery (databases, APIs, files, SaaS) 2. Business glossary with agreed definitions 3. Data lineage tracking (source → transformation → consumption) 4. Search and discovery interface for business users 5. Metadata enrichment (tags, classifications, sensitivity labels) 6. Catalog coverage tracking (% of assets documented) 7. Usage analytics (who accesses what, how often) 8. Integration with BI/analytics tools (catalog-aware queries) ## Domain 3: Access Control 1. Role-based access control (RBAC) with regular review 2. Data classification enforcement (labels drive permissions) 3. Least privilege principle (minimal default access) 4. Access request and approval workflows 5. Privileged access management (admin accounts monitored) 6. Access certification (quarterly re-certification of permissions) 7. Anomaly detection (unusual access patterns flagged) 8. De-provisioning automation (access removed on role change/exit) ## Domain 4: Compliance Mapping 1. Regulation inventory (which laws apply, by geography and industry) 2. Control-to-regulation mapping (which controls satisfy which requirements) 3. Data processing records (Article 30 GDPR / equivalent) 4. Consent management (capture, storage, withdrawal tracking) 5. Data subject rights automation (access, deletion, portability) 6. Cross-border transfer compliance (SCCs, adequacy decisions) 7. Breach notification procedures (72-hour GDPR, state-specific) 8. Regular compliance audits (internal + third-party) ## Domain 5: Retention & Lifecycle 1. Retention schedule by data type (contractual, regulatory, operational) 2. Automated archival pipelines (hot → warm → cold → delete) 3. Legal hold management (litigation preservation) 4. Deletion verification (confirmed purge with audit trail) 5. Storage cost optimization (tiered storage aligned to access patterns) 6. Backup and recovery testing (regular restore drills) 7. Data minimization enforcement (collect only what is needed) 8. End-of-life procedures for decommissioned systems ## Domain 6: AI/Agent Data Governance 1. Training data provenance tracking (source, consent, bias review) 2. Model input/output logging (what went in, what came out) 3. PII detection and masking in agent workflows 4. Hallucination monitoring (output accuracy validation) 5. Agent decision audit trail (explainability for automated decisions) 6. Data feedback loops (human review of agent data modifications) 7. Vendor data sharing agreements (what third-party APIs see your data) 8. Synthetic data policies (when and how to use generated data) ## Cost of Poor Governance | Risk | Average Cost | Prevention Cost | |------|-------------|-----------------| | GDPR fine | $4.3M (average 2025) | $45K-$120K/year | | Data breach | $4.88M (IBM 2025) | $60K-$200K/year | | Failed audit | $150K-$500K remediation | $30K-$80K/year | | Bad data decisions | 15-25% revenue impact | $20K-$60K/year | | AI bias incident | $2M-$50M (litigation + brand) | $25K-$75K/year | ## Remediation Priority Matrix Always fix in this order: 1. **Compliance gaps** — regulatory fines are existential 2. **Access control** — breaches destroy trust overnight 3. **AI governance** — fastest-growing risk category 4. **Data quality** — garbage in = garbage out at scale 5. **Cataloging** — you cannot govern what you cannot find 6. **Retention** — storage costs compound, legal risk accumulates ## Industry Benchmarks (2026) | Industry | Avg Governance Score | Top Quartile | Regulatory Pressure | |----------|---------------------|-------------|-------------------| | Financial Services | 68% | 85%+ | Extreme (SOX, PCI, GDPR) | | Healthcare | 62% | 80%+ | High (HIPAA, FDA, state) | | SaaS/Tech | 55% | 78%+ | Growing (SOC 2, GDPR, CCPA) | | Manufacturing | 45% | 70%+ | Moderate (ITAR, ISO) | | Retail/Ecommerce | 48% | 72%+ | Growing (PCI, CCPA, GDPR) | ## Next Steps Need a complete data governance implementation tailored to your industry? - [Calculate your AI revenue leak](https://afrexai-cto.github.io/ai-revenue-calculator/) - [Industry context packs — $47 each](https://afrexai-cto.github.io/context-packs/) - [Agent setup wizard](https://afrexai-cto.github.io/agent-setup/)
Related Skills
ercdata
Store, verify, and manage AI data on the Ethereum blockchain (Base network) using the ERCData standard. Use when an agent needs to store data fingerprints on-chain, verify data integrity, create audit trails, manage access control for private data, or interact with the ERCData smart contract. Supports public and private storage, EIP-712 verification, snapshots, and batch operations.
HIPAA Compliance for AI Agents
Generate HIPAA compliance checklists, risk assessments, and audit frameworks for healthcare organizations deploying AI agents.
Cybersecurity Risk Assessment
You are a cybersecurity risk assessment specialist. When the user needs a security audit, threat assessment, or compliance review, follow this framework.
afrexai-cybersecurity-engine
Complete cybersecurity assessment, threat modeling, and hardening system. Use when conducting security audits, threat modeling, penetration testing, incident response, or building security programs from scratch. Works with any stack — zero external dependencies.
Compliance & Audit Readiness Engine
Your AI compliance officer. Guides startups and scale-ups through SOC 2, ISO 27001, GDPR, HIPAA, and PCI DSS — from zero to audit-ready. No consultants needed.
Compliance Audit Generator
Run internal compliance audits against major frameworks without hiring a consultant.
AI Safety Audit
Comprehensive AI safety and alignment audit framework for businesses deploying AI agents. Built around the UK AI Security Institute Alignment Project standards (2026), EU AI Act requirements, and NIST AI RMF.
clickhouse-github-forensics
Query GitHub event data via ClickHouse for supply chain investigations, actor profiling, and anomaly detection. Use when investigating GitHub-based attacks, tracking repository activity, analyzing actor behavior patterns, detecting tag/release tampering, or reconstructing incident timelines from public GitHub data. Triggers on GitHub supply chain attacks, repo compromise investigations, actor attribution, tag poisoning, or "query github events".
security-guardian
Automated security auditing for OpenClaw projects. Scans for hardcoded secrets (API keys, tokens) and container vulnerabilities (CVEs) using Trivy. Provides structured reports to help maintain a clean and secure codebase.
mema-vault
Secure credential manager using AES-256 (Fernet) encryption. Stores, retrieves, and rotates secrets using a mandatory Master Key. Use for managing API keys, database credentials, and other sensitive tokens.
guardian-wall
Mitigate prompt injection attacks, especially indirect ones from external web content or files. Use this skill when processing untrusted text from the internet, user-uploaded files, or any external source to sanitize content and detect malicious instructions (e.g., "ignore previous instructions", "system override").
SX-security-audit
全方位安全审计技能。检查文件权限、环境变量、依赖漏洞、配置文件、网络端口、Git 安全、Shell 安全、macOS 安全、密钥检测等。支持 CLI 参数、JSON 输出、配置文件。当用户要求"安全检查"、"漏洞扫描"、"权限检查"、"安全审计"时使用此技能。