social-science-research

Orchestrates a social science research workflow from literature review through data collection, text analysis, statistical modeling, and report generation. Use when conducting empirical social science research, policy analysis, or mixed-methods studies. NOT for pure natural science analysis or clinical trial data.

564 stars

Best use case

social-science-research is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Orchestrates a social science research workflow from literature review through data collection, text analysis, statistical modeling, and report generation. Use when conducting empirical social science research, policy analysis, or mixed-methods studies. NOT for pure natural science analysis or clinical trial data.

Teams using social-science-research should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/social-science-research/SKILL.md --create-dirs "https://raw.githubusercontent.com/beita6969/ScienceClaw/main/skills/social-science-research/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/social-science-research/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How social-science-research Compares

Feature / Agentsocial-science-researchStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Orchestrates a social science research workflow from literature review through data collection, text analysis, statistical modeling, and report generation. Use when conducting empirical social science research, policy analysis, or mixed-methods studies. NOT for pure natural science analysis or clinical trial data.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Social Science Research (Meta Skill)

This meta-skill coordinates a complete social science research pipeline by
integrating literature discovery, data collection, natural language processing,
statistical analysis, and academic report writing. It combines four specialized
skills to support rigorous empirical research across economics, political
science, sociology, and related disciplines.

## Workflow

### Step 1: Literature Review and Theoretical Framing

Search SSRN and CrossRef for relevant academic papers on the research topic:
- Keyword and author-based searches across working paper repositories
- Citation network exploration to identify seminal and recent contributions
- Extraction of theoretical frameworks, hypotheses, and methodological approaches
- Identification of gaps in existing literature that motivate the study
- Construction of an annotated bibliography with key findings per source

### Step 2: Data Collection and Preparation

Gather quantitative data from established sources:
- **World Bank**: Development indicators, governance metrics, trade data
- **Census/Survey**: Demographic, labor market, household-level data
- **Custom datasets**: User-provided CSV, Excel, or API-sourced data
- Data cleaning: handle missing values, outliers, encoding inconsistencies
- Variable construction: create indices, interaction terms, lagged variables
- Descriptive statistics: summary tables, distributions, correlation matrices

### Step 3: Qualitative Text Analysis

Apply spaCy NLP tools to analyze qualitative or textual data sources:
- Named entity recognition to extract people, organizations, locations
- Document classification by topic, sentiment, or policy domain
- Keyword extraction and frequency analysis across corpora
- Relationship extraction between entities in policy documents
- Coding assistance: map text segments to predefined thematic categories

### Step 4: Statistical Analysis and Causal Inference

Use statsmodels for rigorous quantitative analysis:
- **Regression**: OLS, logistic, Poisson, negative binomial models
- **Panel data**: Fixed effects, random effects, difference-in-differences
- **Causal inference**: Instrumental variables, regression discontinuity, propensity score matching
- **Time series**: ARIMA, VAR, cointegration analysis
- **Robustness**: Alternative specifications, placebo tests, sensitivity analysis
- Report standard errors appropriate to the data structure (clustered, HAC)

### Step 5: Visualization and Figure Preparation

Generate publication-quality figures with matplotlib:
- Coefficient plots with confidence intervals
- Time series trend charts with event markers
- Geographic maps for spatial data (choropleth, point maps)
- Distribution comparisons (kernel density, box plots, violin plots)
- Regression diagnostic plots (residuals, Q-Q, leverage)

### Step 6: Academic Report Generation

Compile findings into a structured academic report:
- Abstract summarizing research question, method, and key findings
- Introduction with literature context and contribution statement
- Data section with source descriptions and summary statistics
- Methodology section with model specifications and identification strategy
- Results with tables, figures, and interpretation
- Discussion of limitations, policy implications, and future directions
- Properly formatted references and appendices

## Integration Points

- **ssrn-econpapers** -- Working paper search, citation discovery, literature mapping
- **world-bank-data** -- Development indicators, cross-country panel data, time series
- **statsmodels-stats** -- Regression analysis, panel methods, causal inference, diagnostics
- **spacy-nlp** -- Entity recognition, text classification, keyword extraction, coding support

## Output Formats

- **Literature table**: Paper title, authors, year, method, key finding, relevance score
- **Data summary**: Variable descriptions, summary statistics, sample sizes
- **Regression table**: Coefficients, standard errors, significance, R-squared, diagnostics
- **Figures**: Publication-ready plots with labeled axes, legends, and annotations
- **Report draft**: Structured academic document with sections and citations

## Best Practices

1. Define the research question and identification strategy before touching data
2. Pre-register hypotheses when conducting confirmatory analysis
3. Use consistent variable definitions across all analyses
4. Report all specifications tested, not only those yielding significant results
5. Cluster standard errors at the appropriate level of treatment assignment
6. Include balance tests and pre-trend checks for quasi-experimental designs
7. Triangulate quantitative findings with qualitative evidence where possible
8. Use multiple imputation or bounds analysis for missing data sensitivity
9. Follow disciplinary reporting standards (APA, AER, APSR as appropriate)
10. Clearly distinguish correlation from causation in all interpretive sections

Related Skills

social-science-analysis

564
from beita6969/ScienceClaw

Social science research methods including survey design, qualitative analysis, content analysis, network analysis, psychometrics, and mixed methods. Covers sociology, psychology, political science, education, and communication studies. Use when user designs surveys, analyzes qualitative data, does content analysis, builds scales, or uses mixed methods. Triggers on "survey design", "qualitative analysis", "content analysis", "Likert scale", "thematic analysis", "grounded theory", "factor analysis", "SEM", "structural equation", "psychometrics", "interview coding".

scienceclaw-verification

564
from beita6969/ScienceClaw

Verify scientific claims, check calculations, validate experimental designs, and fact-check citations. Use when: (1) checking a claim against evidence, (2) validating statistical analyses, (3) verifying experimental reproducibility claims, (4) fact-checking references, (5) adversarial review of research. NOT for: generating new content (use scienceclaw-generation), simple QA (use scienceclaw-qa).

scienceclaw-summarization

564
from beita6969/ScienceClaw

Summarize scientific papers, datasets, experimental results, and literature reviews. Use when: (1) condensing research papers, (2) creating literature reviews, (3) summarizing experimental findings, (4) meta-analysis synthesis, (5) creating executive summaries of research. NOT for: information extraction (use scienceclaw-ie), full paper retrieval (use scienceclaw-retrieval), or writing new content (use scienceclaw-generation).

scienceclaw-retrieval

564
from beita6969/ScienceClaw

Retrieve scientific information from databases, literature, and knowledge bases. Use when: (1) finding relevant papers, (2) querying scientific databases, (3) cross-referencing findings, (4) building bibliographies, (5) systematic literature search. NOT for: answering questions (use scienceclaw-qa), summarizing (use scienceclaw-summarization), or data analysis (use code-execution skill).

scienceclaw-reasoning

564
from beita6969/ScienceClaw

Perform multi-step scientific reasoning, proof construction, causal inference, and logical argumentation. Use when: (1) deriving conclusions from premises, (2) causal analysis, (3) mathematical proofs, (4) hypothesis evaluation, (5) counterfactual reasoning. NOT for: simple factual questions (use scienceclaw-qa), data analysis (use code-execution), or literature search (use scienceclaw-retrieval).

scienceclaw-qa

564
from beita6969/ScienceClaw

Answer scientific questions across all disciplines with evidence-based responses and citations. Use when: (1) user asks factual science questions, (2) needs explanation of concepts/theories/methods, (3) multi-step scientific reasoning needed. Covers natural sciences (physics, chemistry, biology, medicine, materials, astronomy, earth science, math, CS) and social sciences (economics, sociology, psychology, political science, linguistics, history, law, philosophy, education). NOT for: opinion-based questions, non-scientific queries, or when code execution is needed (use code-execution skill).

scienceclaw-prediction

564
from beita6969/ScienceClaw

Predict scientific properties, trends, and outcomes. Use when: user asks for property prediction, trend forecasting, or model-based estimation. NOT for: historical data lookup or real-time monitoring.

scienceclaw-ie

564
from beita6969/ScienceClaw

Extract structured information from scientific texts: entities, relations, data tables, methods, results. Use when: (1) parsing papers for key data, (2) extracting experimental parameters, (3) building knowledge graphs from literature, (4) NER on scientific documents, (5) extracting methods/results sections. NOT for: summarization (use scienceclaw-summarization), full text retrieval (use scienceclaw-retrieval).

scienceclaw-generation

564
from beita6969/ScienceClaw

Generate scientific hypotheses, experimental designs, and paper drafts. Use when: user asks to propose hypotheses, design experiments, or write scientific content. NOT for: data analysis or literature search.

scienceclaw-discovery

564
from beita6969/ScienceClaw

Identify research gaps, synthesize cross-disciplinary insights, and generate novel hypotheses. Use when: user asks about unexplored areas, cross-field connections, or new research directions. NOT for: routine literature review or data analysis.

scienceclaw-classification

564
from beita6969/ScienceClaw

Classify scientific content by discipline, methodology, topic, and quality. Use when: user asks to categorize papers, methods, or research outputs. NOT for: simple keyword tagging or non-scientific content.

science-communication

564
from beita6969/ScienceClaw

No description provided.