Excel Processor

## Overview

25 stars

byComeOnOliver

View on GitHub Installation ↓

Best use case

Excel Processor is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

## Overview

Teams using Excel Processor should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/excel-processor/SKILL.md --create-dirs "https://raw.githubusercontent.com/ComeOnOliver/skillshub/main/skills/TerminalSkills/skills/excel-processor/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/excel-processor/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How Excel Processor Compares

Feature / Agent	Excel Processor	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

## Overview

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Excel Processor

## Overview

Read, transform, analyze, and generate Excel and CSV files using Python. This skill covers data loading, cleaning, filtering, aggregation, formula generation, and export to multiple formats.

## Instructions

When a user asks you to work with spreadsheets, Excel files, or CSV data, follow these steps:

### Step 1: Load the data

```python
import pandas as pd

# For Excel files
df = pd.read_excel("data.xlsx", sheet_name=0)  # or sheet_name="Sheet1"

# For CSV files
df = pd.read_csv("data.csv")

# Show shape and preview
print(f"Shape: {df.shape[0]} rows x {df.shape[1]} columns")
print(f"Columns: {list(df.columns)}")
print(df.head())
```

Always print the shape, column names, and first few rows so the user can verify the data loaded correctly.

### Step 2: Assess data quality

```python
# Check for issues
print(f"Missing values:\n{df.isnull().sum()}")
print(f"\nDuplicates: {df.duplicated().sum()}")
print(f"\nData types:\n{df.dtypes}")
```

Report any issues found before proceeding with transformations.

### Step 3: Apply requested transformations

Common operations:

**Filtering:**
```python
filtered = df[df["status"] == "active"]
filtered = df[df["amount"] > 1000]
filtered = df[df["date"].between("2024-01-01", "2024-12-31")]
```

**Aggregation:**
```python
summary = df.groupby("category").agg(
    count=("id", "count"),
    total=("amount", "sum"),
    average=("amount", "mean")
).reset_index()
```

**Pivot tables:**
```python
pivot = df.pivot_table(
    values="revenue",
    index="region",
    columns="quarter",
    aggfunc="sum",
    margins=True
)
```

**Cleaning:**
```python
df["name"] = df["name"].str.strip().str.title()
df["email"] = df["email"].str.lower()
df["date"] = pd.to_datetime(df["date"], errors="coerce")
df = df.drop_duplicates(subset=["id"])
df = df.dropna(subset=["required_field"])
```

### Step 4: Export results

```python
# To Excel with formatting
with pd.ExcelWriter("output.xlsx", engine="openpyxl") as writer:
    df.to_excel(writer, sheet_name="Data", index=False)
    summary.to_excel(writer, sheet_name="Summary", index=False)

# To CSV
df.to_csv("output.csv", index=False)
```

### Step 5: Report what was done

Always summarize the operations performed, rows affected, and output file location.

## Examples

### Example 1: Clean and deduplicate a customer list

**User request:** "Clean up customers.xlsx -- remove duplicates, fix phone formatting, and split into active/inactive sheets"

**Actions taken:**
1. Load `customers.xlsx` (2,340 rows, 8 columns)
2. Remove 156 duplicate rows based on email
3. Standardize phone numbers to (XXX) XXX-XXXX format
4. Split into active (1,847 rows) and inactive (337 rows)
5. Export to `customers_clean.xlsx` with two sheets

**Output:**
```
Loaded: 2,340 rows x 8 columns
Removed: 156 duplicate rows (by email)
Fixed: 892 phone numbers reformatted
Split: 1,847 active, 337 inactive

Saved to customers_clean.xlsx:
  - Sheet "Active": 1,847 rows
  - Sheet "Inactive": 337 rows
```

### Example 2: Generate a monthly sales summary

**User request:** "Create a pivot table from sales.csv showing revenue by region and month"

**Actions taken:**
1. Load `sales.csv` (15,200 rows)
2. Parse date column, extract month
3. Build pivot table: regions as rows, months as columns, sum of revenue
4. Add row and column totals
5. Export to `sales_summary.xlsx`

**Output:**
```
| Region    | Jan      | Feb      | Mar      | Total     |
|-----------|----------|----------|----------|-----------|
| North     | $45,200  | $52,100  | $48,900  | $146,200  |
| South     | $38,700  | $41,300  | $44,600  | $124,600  |
| East      | $51,900  | $49,800  | $55,200  | $156,900  |
| West      | $42,100  | $46,700  | $43,500  | $132,300  |
| Total     | $177,900 | $189,900 | $192,200 | $560,000  |

Saved to sales_summary.xlsx
```

## Guidelines

- Always show data shape and column names after loading so the user can verify correctness.
- When dealing with dates, explicitly parse them with `pd.to_datetime()` and specify the format when ambiguous (e.g., is 01/02/03 Jan 2 or Feb 1?).
- For large files (100k+ rows), warn about memory and suggest chunked processing.
- Preserve original files. Write output to new files unless the user explicitly asks to overwrite.
- When merging multiple files, check that column names match and report any discrepancies before proceeding.
- If the user asks for "formulas", generate the Excel formula strings (e.g., `=SUM(B2:B100)`) rather than computing values, so the spreadsheet stays dynamic.
- Default to UTF-8 encoding for CSV. If the data contains special characters, test with `encoding="utf-8-sig"` for Excel compatibility.

Related Skills

kafka-stream-processor

from ComeOnOliver/skillshub

Kafka Stream Processor - Auto-activating skill for Data Pipelines. Triggers on: kafka stream processor, kafka stream processor Part of the Data Pipelines skill category.

excel-variance-analyzer

from ComeOnOliver/skillshub

Analyze budget vs actual variances in Excel with drill-down and root cause analysis. Use when performing variance analysis or explaining budget differences. Trigger with phrases like 'excel variance', 'analyze budget variance', 'actual vs budget'.

excel-pivot-wizard

from ComeOnOliver/skillshub

Create advanced Excel pivot tables with calculated fields and slicers. Use when building data summaries or creating interactive dashboards. Trigger with phrases like 'excel pivot', 'create pivot table', 'data summary'.

excel-macro-creator

from ComeOnOliver/skillshub

Excel Macro Creator - Auto-activating skill for Business Automation. Triggers on: excel macro creator, excel macro creator Part of the Business Automation skill category.

excel-lbo-modeler

from ComeOnOliver/skillshub

Build leveraged buyout (LBO) models in Excel with debt schedules and IRR analysis. Use when structuring LBO transactions or analyzing PE returns. Trigger with phrases like 'excel lbo', 'build lbo model', 'calculate pe returns'.

excel-formula-generator

from ComeOnOliver/skillshub

Excel Formula Generator - Auto-activating skill for Business Automation. Triggers on: excel formula generator, excel formula generator Part of the Business Automation skill category.

excel-dcf-modeler

from ComeOnOliver/skillshub

Build discounted cash flow (DCF) valuation models in Excel. Use when creating DCF models, calculating enterprise value, or valuing companies. Trigger with phrases like 'excel dcf', 'build dcf model', 'calculate enterprise value'.

batch-file-processor

from ComeOnOliver/skillshub

Batch File Processor - Auto-activating skill for Business Automation. Triggers on: batch file processor, batch file processor Part of the Business Automation skill category.

data-processor

from ComeOnOliver/skillshub

Process and validate data inputs

article-list-processor

from ComeOnOliver/skillshub

读取包含文章列表的 Markdown 文件，自动抓取原文内容并生成爆款文案。

skywork-excel

from ComeOnOliver/skillshub

Excel generator with AI-powered data analysis, charts, formulas, and web search. Create spreadsheets, analyze CSV/Excel/PDF files, generate HTML reports, and get real-time data.

code-review-excellence

from ComeOnOliver/skillshub

Master effective code review practices to provide constructive feedback, catch bugs early, and foster knowledge sharing while maintaining team morale. Use when reviewing pull requests, establishing review standards, or mentoring developers.