hypothesis-generation
Generate testable hypotheses. Formulate from observations, design experiments, explore competing explanations, develop predictions, propose mechanisms, for scientific inquiry across domains.
Best use case
hypothesis-generation is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate testable hypotheses. Formulate from observations, design experiments, explore competing explanations, develop predictions, propose mechanisms, for scientific inquiry across domains.
Teams using hypothesis-generation should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/hypothesis-generation/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How hypothesis-generation Compares
| Feature / Agent | hypothesis-generation | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate testable hypotheses. Formulate from observations, design experiments, explore competing explanations, develop predictions, propose mechanisms, for scientific inquiry across domains.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Scientific Hypothesis Generation ## Overview Hypothesis generation is a systematic process for developing testable explanations. Formulate evidence-based hypotheses from observations, design experiments, explore competing explanations, and develop predictions. Apply this skill for scientific inquiry across domains. ## When to Use This Skill This skill should be used when: - Developing hypotheses from observations or preliminary data - Designing experiments to test scientific questions - Exploring competing explanations for phenomena - Formulating testable predictions for research - Conducting literature-based hypothesis generation - Planning mechanistic studies across scientific domains ## Workflow Follow this systematic process to generate robust scientific hypotheses: ### 1. Understand the Phenomenon Start by clarifying the observation, question, or phenomenon that requires explanation: - Identify the core observation or pattern that needs explanation - Define the scope and boundaries of the phenomenon - Note any constraints or specific contexts - Clarify what is already known vs. what is uncertain - Identify the relevant scientific domain(s) ### 2. Conduct Comprehensive Literature Search Search existing scientific literature to ground hypotheses in current evidence. Use both PubMed (for biomedical topics) and general web search (for broader scientific domains): **For biomedical topics:** - Use WebFetch with PubMed URLs to access relevant literature - Search for recent reviews, meta-analyses, and primary research - Look for similar phenomena, related mechanisms, or analogous systems **For all scientific domains:** - Use WebSearch to find recent papers, preprints, and reviews - Search for established theories, mechanisms, or frameworks - Identify gaps in current understanding **Search strategy:** - Begin with broad searches to understand the landscape - Narrow to specific mechanisms, pathways, or theories - Look for contradictory findings or unresolved debates - Consult `references/literature_search_strategies.md` for detailed search techniques ### 3. Synthesize Existing Evidence Analyze and integrate findings from literature search: - Summarize current understanding of the phenomenon - Identify established mechanisms or theories that may apply - Note conflicting evidence or alternative viewpoints - Recognize gaps, limitations, or unanswered questions - Identify analogies from related systems or domains ### 4. Generate Competing Hypotheses Develop 3-5 distinct hypotheses that could explain the phenomenon. Each hypothesis should: - Provide a mechanistic explanation (not just description) - Be distinguishable from other hypotheses - Draw on evidence from the literature synthesis - Consider different levels of explanation (molecular, cellular, systemic, population, etc.) **Strategies for generating hypotheses:** - Apply known mechanisms from analogous systems - Consider multiple causative pathways - Explore different scales of explanation - Question assumptions in existing explanations - Combine mechanisms in novel ways ### 5. Evaluate Hypothesis Quality Assess each hypothesis against established quality criteria from `references/hypothesis_quality_criteria.md`: **Testability:** Can the hypothesis be empirically tested? **Falsifiability:** What observations would disprove it? **Parsimony:** Is it the simplest explanation that fits the evidence? **Explanatory Power:** How much of the phenomenon does it explain? **Scope:** What range of observations does it cover? **Consistency:** Does it align with established principles? **Novelty:** Does it offer new insights beyond existing explanations? Explicitly note the strengths and weaknesses of each hypothesis. ### 6. Design Experimental Tests For each viable hypothesis, propose specific experiments or studies to test it. Consult `references/experimental_design_patterns.md` for common approaches: **Experimental design elements:** - What would be measured or observed? - What comparisons or controls are needed? - What methods or techniques would be used? - What sample sizes or statistical approaches are appropriate? - What are potential confounds and how to address them? **Consider multiple approaches:** - Laboratory experiments (in vitro, in vivo, computational) - Observational studies (cross-sectional, longitudinal, case-control) - Clinical trials (if applicable) - Natural experiments or quasi-experimental designs ### 7. Formulate Testable Predictions For each hypothesis, generate specific, quantitative predictions: - State what should be observed if the hypothesis is correct - Specify expected direction and magnitude of effects when possible - Identify conditions under which predictions should hold - Distinguish predictions between competing hypotheses - Note predictions that would falsify the hypothesis ### 8. Present Structured Output Use the template in `assets/hypothesis_output_template.md` to present hypotheses in a clear, consistent format: **Standard structure:** 1. **Background & Context** - Phenomenon and literature summary 2. **Competing Hypotheses** - Enumerated hypotheses with mechanistic explanations 3. **Quality Assessment** - Evaluation of each hypothesis 4. **Experimental Designs** - Proposed tests for each hypothesis 5. **Testable Predictions** - Specific, measurable predictions 6. **Critical Comparisons** - How to distinguish between hypotheses ## Quality Standards Ensure all generated hypotheses meet these standards: - **Evidence-based:** Grounded in existing literature with citations - **Testable:** Include specific, measurable predictions - **Mechanistic:** Explain how/why, not just what - **Comprehensive:** Consider alternative explanations - **Rigorous:** Include experimental designs to test predictions ## Resources ### references/ - `hypothesis_quality_criteria.md` - Framework for evaluating hypothesis quality (testability, falsifiability, parsimony, explanatory power, scope, consistency) - `experimental_design_patterns.md` - Common experimental approaches across domains (RCTs, observational studies, lab experiments, computational models) - `literature_search_strategies.md` - Effective search techniques for PubMed and general scientific sources ### assets/ - `hypothesis_output_template.md` - Structured format for presenting hypotheses consistently with all required sections
Related Skills
zinc-database
Access ZINC (230M+ purchasable compounds). Search by ZINC ID/SMILES, similarity searches, 3D-ready structures for docking, analog discovery, for virtual screening and drug discovery.
zarr-python
Chunked N-D arrays for cloud storage. Compressed arrays, parallel I/O, S3/GCS integration, NumPy/Dask/Xarray compatible, for large-scale scientific computing pipelines.
vaex
Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that don't fit in memory.
uspto-database
Access USPTO APIs for patent/trademark searches, examination history (PEDS), assignments, citations, office actions, TSDR, for IP analysis and prior art searches.
uniprot-database
Direct REST API access to UniProt. Protein searches, FASTA retrieval, ID mapping, Swiss-Prot/TrEMBL. For Python workflows with multiple databases, prefer bioservices (unified interface to 40+ services). Use this for direct HTTP/REST work or UniProt-specific control.
umap-learn
UMAP dimensionality reduction. Fast nonlinear manifold learning for 2D/3D visualization, clustering preprocessing (HDBSCAN), supervised/parametric UMAP, for high-dimensional data.
transformers
This skill should be used when working with pre-trained transformer models for natural language processing, computer vision, audio, or multimodal tasks. Use for text generation, classification, question answering, translation, summarization, image classification, object detection, speech recognition, and fine-tuning models on custom datasets.
torchdrug
Graph-based drug discovery toolkit. Molecular property prediction (ADMET), protein modeling, knowledge graph reasoning, molecular generation, retrosynthesis, GNNs (GIN, GAT, SchNet), 40+ datasets, for PyTorch-based ML on molecules, proteins, and biomedical graphs.
torch-geometric
Graph Neural Networks (PyG). Node/graph classification, link prediction, GCN, GAT, GraphSAGE, heterogeneous graphs, molecular property prediction, for geometric deep learning.
tooluniverse
Use this skill when working with scientific research tools and workflows across bioinformatics, cheminformatics, genomics, structural biology, proteomics, and drug discovery. This skill provides access to 600+ scientific tools including machine learning models, datasets, APIs, and analysis packages. Use when searching for scientific tools, executing computational biology workflows, composing multi-step research pipelines, accessing databases like OpenTargets/PubChem/UniProt/PDB/ChEMBL, performing tool discovery for research tasks, or integrating scientific computational resources into LLM workflows.
sympy
Use this skill when working with symbolic mathematics in Python. This skill should be used for symbolic computation tasks including solving equations algebraically, performing calculus operations (derivatives, integrals, limits), manipulating algebraic expressions, working with matrices symbolically, physics calculations, number theory problems, geometry computations, and generating executable code from mathematical expressions. Apply this skill when the user needs exact symbolic results rather than numerical approximations, or when working with mathematical formulas that contain variables and parameters.
string-database
Query STRING API for protein-protein interactions (59M proteins, 20B interactions). Network analysis, GO/KEGG enrichment, interaction discovery, 5000+ species, for systems biology.