csv-data-summarizer
Analyzes CSV files, generates summary stats, and plots quick visualizations using Python and pandas.
Best use case
csv-data-summarizer is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Analyzes CSV files, generates summary stats, and plots quick visualizations using Python and pandas.
Teams using csv-data-summarizer should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/csv-data-summarizer/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How csv-data-summarizer Compares
| Feature / Agent | csv-data-summarizer | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Analyzes CSV files, generates summary stats, and plots quick visualizations using Python and pandas.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# CSV Data Summarizer This Skill analyzes CSV files and provides comprehensive summaries with statistical insights and visualizations. ## When to Use This Skill Claude should use this Skill whenever the user: - Uploads or references a CSV file - Asks to summarize, analyze, or visualize tabular data - Requests insights from CSV data - Wants to understand data structure and quality ## How It Works ## ⚠️ CRITICAL BEHAVIOR REQUIREMENT ⚠️ **DO NOT ASK THE USER WHAT THEY WANT TO DO WITH THE DATA.** **DO NOT OFFER OPTIONS OR CHOICES.** **DO NOT SAY "What would you like me to help you with?"** **DO NOT LIST POSSIBLE ANALYSES.** **IMMEDIATELY AND AUTOMATICALLY:** 1. Run the comprehensive analysis 2. Generate ALL relevant visualizations 3. Present complete results 4. NO questions, NO options, NO waiting for user input **THE USER WANTS A FULL ANALYSIS RIGHT AWAY - JUST DO IT.** ### Automatic Analysis Steps: **The skill intelligently adapts to different data types and industries by inspecting the data first, then determining what analyses are most relevant.** 1. **Load and inspect** the CSV file into pandas DataFrame 2. **Identify data structure** - column types, date columns, numeric columns, categories 3. **Determine relevant analyses** based on what's actually in the data: - **Sales/E-commerce data** (order dates, revenue, products): Time-series trends, revenue analysis, product performance - **Customer data** (demographics, segments, regions): Distribution analysis, segmentation, geographic patterns - **Financial data** (transactions, amounts, dates): Trend analysis, statistical summaries, correlations - **Operational data** (timestamps, metrics, status): Time-series, performance metrics, distributions - **Survey data** (categorical responses, ratings): Frequency analysis, cross-tabulations, distributions - **Generic tabular data**: Adapts based on column types found 4. **Only create visualizations that make sense** for the specific dataset: - Time-series plots ONLY if date/timestamp columns exist - Correlation heatmaps ONLY if multiple numeric columns exist - Category distributions ONLY if categorical columns exist - Histograms for numeric distributions when relevant 5. **Generate comprehensive output** automatically including: - Data overview (rows, columns, types) - Key statistics and metrics relevant to the data type - Missing data analysis - Multiple relevant visualizations (only those that apply) - Actionable insights based on patterns found in THIS specific dataset 6. **Present everything** in one complete analysis - no follow-up questions **Example adaptations:** - Healthcare data with patient IDs → Focus on demographics, treatment patterns, temporal trends - Inventory data with stock levels → Focus on quantity distributions, reorder patterns, SKU analysis - Web analytics with timestamps → Focus on traffic patterns, conversion metrics, time-of-day analysis - Survey responses → Focus on response distributions, demographic breakdowns, sentiment patterns ### Behavior Guidelines ✅ **CORRECT APPROACH - SAY THIS:** - "I'll analyze this data comprehensively right now." - "Here's the complete analysis with visualizations:" - "I've identified this as [type] data and generated relevant insights:" - Then IMMEDIATELY show the full analysis ✅ **DO:** - Immediately run the analysis script - Generate ALL relevant charts automatically - Provide complete insights without being asked - Be thorough and complete in first response - Act decisively without asking permission ❌ **NEVER SAY THESE PHRASES:** - "What would you like to do with this data?" - "What would you like me to help you with?" - "Here are some common options:" - "Let me know what you'd like help with" - "I can create a comprehensive analysis if you'd like!" - Any sentence ending with "?" asking for user direction - Any list of options or choices - Any conditional "I can do X if you want" ❌ **FORBIDDEN BEHAVIORS:** - Asking what the user wants - Listing options for the user to choose from - Waiting for user direction before analyzing - Providing partial analysis that requires follow-up - Describing what you COULD do instead of DOING it ### Usage The Skill provides a Python function `summarize_csv(file_path)` that: - Accepts a path to a CSV file - Returns a comprehensive text summary with statistics - Generates multiple visualizations automatically based on data structure ### Example Prompts > "Here's `sales_data.csv`. Can you summarize this file?" > "Analyze this customer data CSV and show me trends." > "What insights can you find in `orders.csv`?" ### Example Output **Dataset Overview** - 5,000 rows × 8 columns - 3 numeric columns, 1 date column **Summary Statistics** - Average order value: $58.2 - Standard deviation: $12.4 - Missing values: 2% (100 cells) **Insights** - Sales show upward trend over time - Peak activity in Q4 *(Attached: trend plot)* ## Files - `analyze.py` - Core analysis logic - `requirements.txt` - Python dependencies - `resources/sample.csv` - Example dataset for testing - `resources/README.md` - Additional documentation ## Notes - Automatically detects date columns (columns containing 'date' in name) - Handles missing data gracefully - Generates visualizations only when date columns are present - All numeric columns are included in statistical summary
Related Skills
usfiscaldata
Query the U.S. Treasury Fiscal Data API for federal financial data including national debt, government spending, revenue, interest rates, exchange rates, and savings bonds. Access 54 datasets and 182 data tables with no API key required. Use when working with U.S. federal fiscal data, national debt tracking (Debt to the Penny), Daily Treasury Statements, Monthly Treasury Statements, Treasury securities auctions, interest rates on Treasury securities, foreign exchange rates, savings bonds, or any U.S. government financial statistics.
theme-factory
Toolkit for styling artifacts with a theme. These artifacts can be slides, docs, reportings, HTML landing pages, etc. There are 10 pre-set themes with colors/fonts that you can apply to any artifact that has been creating, or can generate a new theme on-the-fly.
slack-gif-creator
Toolkit for creating animated GIFs optimized for Slack, with validators for size constraints and composable animation primitives. This skill applies when users request animated GIFs or emoji animations for Slack from descriptions like "make me a GIF for Slack of X doing Y".
remotion-best-practices
Best practices for Remotion - Video creation in React
image-enhancer
Improves the quality of images, especially screenshots, by enhancing resolution, sharpness, and clarity. Perfect for preparing images for presentations, documentation, or social media posts.
canvas-design
Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art, design, or other static piece. Create original visual designs, never copying existing artists' work to avoid copyright violations.
algorithmic-art
Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use this when users request creating art using code, generative art, algorithmic art, flow fields, or particle systems. Create original algorithmic art rather than copying existing artists' work to avoid copyright violations.
raffle-winner-picker
Picks random winners from lists, spreadsheets, or Google Sheets for giveaways, raffles, and contests. Ensures fair, unbiased selection with transparency.
nlm-skill
Expert guide for the NotebookLM CLI (`nlm`) and MCP server - interfaces for Google NotebookLM. Use this skill when users want to interact with NotebookLM programmatically, including: creating/managing notebooks, adding sources (URLs, YouTube, text, Google Drive), generating content (podcasts, reports, quizzes, flashcards, mind maps, slides, infographics, videos, data tables), conducting research, chatting with sources, or automating NotebookLM workflows. Triggers on mentions of "nlm", "notebooklm", "notebook lm", "podcast generation", "audio overview", or any NotebookLM-related automation task.
md-to-pdf
Use this skill when users want to convert Markdown files to PDF. Handles workflows like "Convert this markdown to PDF", "转换为PDF", "批量转换MD文件". Supports single file and batch directory conversion with excellent CJK (Chinese) font support, image embedding, and clean typography.
markdown-to-epub-converter
Convert markdown documents and chat summaries into formatted EPUB ebook files that can be read on any device or uploaded to Kindle.
xlsx
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas