Data Analyzer

Interpret CSV, JSON, and structured data, extract insights, identify patterns, and recommend appropriate visualizations.

3,891 stars

Best use case

Data Analyzer is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Interpret CSV, JSON, and structured data, extract insights, identify patterns, and recommend appropriate visualizations.

Teams using Data Analyzer should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/qf-data-analyzer/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/371166758-qq/qf-data-analyzer/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/qf-data-analyzer/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How Data Analyzer Compares

Feature / AgentData AnalyzerStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Interpret CSV, JSON, and structured data, extract insights, identify patterns, and recommend appropriate visualizations.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Data Analyzer

Interpret CSV, JSON, and structured data, extract insights, identify patterns, and recommend appropriate visualizations.

## Description

This skill provides a systematic approach to data analysis for tabular and structured data. It covers data profiling, statistical summary, trend identification, anomaly detection, and visualization recommendations. Designed for users who have data but need help understanding what it means and how to present it effectively.

## When to Use

- Analyzing CSV or JSON data files
- Understanding trends in sales, traffic, survey, or any time-series data
- Comparing groups, segments, or categories
- Identifying outliers or anomalies in datasets
- Preparing data insights for reports or presentations
- Recommending chart types for specific data stories

## Instructions

### Step 1: Data Profiling

Before analysis, profile the data:

```
Dataset Profile:
- Rows: [count]
- Columns: [count]
- Column types: [list each column with type: numeric, categorical, date, text]
- Missing values: [percentage per column]
- Date range: [if applicable]
- Key metrics summary: [mean, median, min, max for numeric columns]
```

Identify data quality issues:
- Missing values (>5% in any column needs attention)
- Duplicates
- Inconsistent formats (dates, categories, units)
- Outliers that might be errors vs. genuine extreme values

### Step 2: Analysis Framework

Apply the appropriate analysis based on data type and question:

#### For Time-Series Data (dates + values)

1. **Trend Analysis**: Is the metric growing, declining, or stable?
   - Calculate period-over-period change (MoM, YoY)
   - Identify inflection points (where trend direction changes)
   
2. **Seasonality**: Are there recurring patterns?
   - Weekly, monthly, or quarterly cycles
   - Compare same period across different years

3. **Anomaly Detection**: Any unexpected spikes or drops?
   - Flag values >2 standard deviations from the mean
   - Check if anomalies correlate with known events

#### For Categorical Comparisons

1. **Ranking**: Which categories lead/lag?
   - Sort by value
   - Calculate percentage of total for each category

2. **Distribution**: How are values spread?
   - Identify concentration (is 80% of value in 20% of categories?)

3. **Correlation**: Do categories relate?
   - Cross-tabulation between two categorical variables

#### For Numeric Relationships

1. **Correlation**: Do two metrics move together?
   - Note correlation direction and approximate strength
   - Caution: correlation ≠ causation — always state this

2. **Segmentation**: How do metrics differ across groups?
   - Compare averages/medians across segments

### Step 3: Insight Extraction

Structure findings as:

```
## Key Findings

1. **[Finding title]**
   - What: Specific observation with numbers
   - So what: Business impact or implication
   - Action: Recommended next step

2. **[Finding title]**
   - ...
```

**Prioritize insights by:**
- Impact: How much does this matter?
- Surprise: How unexpected is this?
- Actionability: Can the user do something about it?

### Step 4: Visualization Recommendations

For each finding, recommend the best chart type:

| Story You Want to Tell | Best Chart Type | When to Use |
|----------------------|----------------|-------------|
| Change over time | Line chart | Time-series with 5+ data points |
| Compare categories | Bar chart | Up to 10 categories |
| Show composition | Stacked bar / Pie | Parts of a whole (pie for ≤5 slices) |
| Show distribution | Histogram / Box plot | Numeric data distribution |
| Show relationship | Scatter plot | Two numeric variables |
| Compare metrics | Combined chart | Two different scales on same timeline |
| Show progress | Bullet / Gauge | Current vs. target |
| Geographic data | Map / Choropleth | Location-based data |
| Funnel / conversion | Funnel chart | Sequential stage drop-off |

**Specify for each recommendation:**
- Chart type
- X-axis, Y-axis, and data series
- Key formatting notes (color coding, annotations)
- Alternative if the recommended type isn't available

### Output Format

```markdown
## Data Analysis Report

### Dataset Overview
[Profile summary]

### Key Findings
1. [Finding with data, impact, action]
2. [Finding]
3. [Finding]

### Anomalies & Notes
[Any unusual patterns or data quality concerns]

### Recommended Visualizations
1. [Chart type + description]
2. [Chart type + description]

### Data Quality
[Missing values, inconsistencies, recommendations]
```

## Examples

**Input**: A CSV with columns [date, product_category, revenue, units_sold]

**Analysis Output**:
```
Key Findings:
1. Electronics accounts for 42% of revenue but only 18% of units → highest ASP
   Action: Investigate if premium electronics strategy is sustainable
   
2. Revenue dipped 23% in February across all categories → likely seasonal
   Note: Same pattern in previous year, but 2025 dip is steeper
   
3. "Home & Garden" shows 67% growth YoY → emerging category
   Action: Increase inventory allocation for Q3

Recommended Visualizations:
1. Stacked area chart: Revenue by category over time (shows composition + trend)
2. Bar chart: Revenue vs. units by category (highlights ASP differences)
3. Line chart with YoY comparison: Total revenue, 2024 vs 2025
```

## Tips

- Always start with "What question is this data supposed to answer?" — if unclear, state assumptions
- Provide numbers, not just qualitative descriptions ("Revenue grew" → "Revenue grew 15% from $1.2M to $1.38M")
- Note confidence level: with 10 data points, trends are suggestive; with 1000, they're reliable
- Recommend tools: Excel/Google Sheets for quick analysis, Python (pandas + matplotlib) for complex datasets, Tableau/Power BI for interactive dashboards
- If the dataset is too large to paste entirely, analyze a sample and note the limitation

Related Skills

Profit Margin Analyzer

3891
from openclaw/skills

Analyze and optimize profit margins across your business. Identifies margin compression, pricing opportunities, and cost levers.

Pricing Strategy Analyzer

3891
from openclaw/skills

Analyze and optimize pricing for any product or service. Covers value-based, cost-plus, competitive, and tiered pricing models.

Business Strategy & Growth

Portfolio Risk Analyzer

3891
from openclaw/skills

Complete investment portfolio risk management system. Analyze positions, calculate risk metrics, stress test scenarios, optimize allocations, and generate institutional-grade risk reports — all without external APIs.

Finance & Investing

Commercial Lease Analyzer

3891
from openclaw/skills

Analyze commercial leases (office, retail, industrial, warehouse) for hidden costs, unfavorable terms, and negotiation leverage. Use when reviewing a new lease, renegotiating a renewal, or comparing multiple lease options.

Business Analysis

Franchise Operations Analyzer

3891
from openclaw/skills

Evaluate franchise opportunities and manage multi-unit operations with data-driven frameworks.

Business & Finance

Financial Due Diligence Analyzer

3891
from openclaw/skills

Run comprehensive financial due diligence on acquisition targets, investment opportunities, or partnership prospects. Built for PE firms, corporate development teams, and founders evaluating deals.

Financial Analysis & Due Diligence

Employee Retention & Turnover Risk Analyzer

3891
from openclaw/skills

Diagnose why people leave. Fix it before they do.

HR & People Analytics

Database Engineering Mastery

3891
from openclaw/skills

> Complete database design, optimization, migration, and operations system. From schema design to production monitoring — covers PostgreSQL, MySQL, SQLite, and general SQL patterns.

Coding & Development

Data Room Builder

3891
from openclaw/skills

Build a structured virtual data room checklist and folder hierarchy for fundraising, M&A, or due diligence.

Workflow & Productivity

Data Privacy & Protection Program

3891
from openclaw/skills

You are a **Data Privacy Officer (DPO) agent** — a comprehensive privacy program architect. You help organizations build, operate, and mature privacy programs that comply with global regulations (GDPR, CCPA/CPRA, LGPD, PIPEDA, POPIA, APPI, PDPA) while enabling business growth.

Privacy & Compliance

Data Migration Planner

3891
from openclaw/skills

Plan, execute, and validate data migrations between systems. Covers schema mapping, ETL pipeline design, rollback strategies, and post-migration validation.

Workflow & Productivity

Data Governance Framework

3891
from openclaw/skills

Assess, score, and remediate your organization's data governance posture across 6 domains.

Security