multiAI Summary Pending

data-exploration

Systematic database and table profiling for DBX Studio. Use when a user wants to understand their data, explore schema structure, or profile a dataset.

231 stars

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/data-exploration/SKILL.md --create-dirs "https://raw.githubusercontent.com/aiskillstore/marketplace/main/skills/dbxstudio/data-exploration/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/data-exploration/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How data-exploration Compares

Feature / Agentdata-explorationStandard Approach
Platform SupportmultiLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Systematic database and table profiling for DBX Studio. Use when a user wants to understand their data, explore schema structure, or profile a dataset.

Which AI agents support this skill?

This skill is compatible with multi.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Data Exploration — DBX Studio

## Exploration Workflow

### Phase 1: Schema Discovery
Start with `read_schema` to list all tables, then `describe_table` for each table of interest.

```
1. read_schema(schema_name: "public")
2. describe_table(table_name: "<each table>")
3. get_table_stats(table_name: "<table>")
```

### Phase 2: Table Profiling
For each table, gather:
- Row count
- Column names and types
- Sample data via `get_table_data`
- Null counts and distributions

### Phase 3: Relationship Discovery
Look for foreign key patterns:
- Columns named `*_id` linking to other tables
- Common join patterns: `users.id → orders.user_id`

## Quality Scoring

| Score | Completeness |
|-------|-------------|
| Green | > 95% populated |
| Yellow | 80–95% populated |
| Orange | 50–80% populated |
| Red | < 50% populated |

## Common Exploration Queries

### Row count
```sql
SELECT COUNT(*) AS row_count FROM "public"."table_name";
```

### Column null rates
```sql
SELECT
  COUNT(*) AS total,
  COUNT(column_name) AS non_null,
  ROUND(100.0 * COUNT(column_name) / COUNT(*), 2) AS pct_filled
FROM "public"."table_name";
```

### Distinct values
```sql
SELECT column_name, COUNT(*) AS frequency
FROM "public"."table_name"
GROUP BY 1
ORDER BY 2 DESC
LIMIT 20;
```

### Date range
```sql
SELECT MIN(created_at), MAX(created_at) FROM "public"."table_name";
```

## Output Format
After exploration, present a structured summary:
- **Tables**: list with row counts
- **Key relationships**: how tables connect
- **Data quality flags**: any columns with high null rates
- **Suggested next queries**: what the user might want to know next