qtl-colocalization-study-planner

Designs QTL colocalization studies that connect eQTL, pQTL, sQTL, or related molecular QTL signals with GWAS loci. Always use this skill whenever a user wants to plan, scope, or structure a locus-level study asking whether a GWAS association and a molecular QTL association may reflect the same underlying causal signal. Covers locus definition, QTL/GWAS source architecture, ancestry and LD alignment, single-locus vs multi-locus strategy, candidate-gene prioritization, optional fine-mapping, linked MR/SMR follow-up, and functional annotation. Always output four workload configurations (Lite / Standard / Advanced / Publication+) with a recommended primary plan, stepwise workflow, method rationale, evidence hierarchy, figure plan, minimal executable version, and strictly verified literature guidance with no fabricated references. Never equate colocalization with causality proof, mediation proof, or automatic target validation. Always include the mandatory Dataset Disclaimer immediately before any workflow section that mentions datasets, repositories, consortia, or public resources.

53 stars

byaipoch

View on GitHub Installation ↓

Best use case

qtl-colocalization-study-planner is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using qtl-colocalization-study-planner should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/qtl-colocalization-study-planner/SKILL.md --create-dirs "https://raw.githubusercontent.com/aipoch/medical-research-skills/main/awesome-med-research-skills/Protocol Design/qtl-colocalization-study-planner/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/qtl-colocalization-study-planner/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How qtl-colocalization-study-planner Compares

Feature / Agent	qtl-colocalization-study-planner	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

> **Source**: [https://github.com/aipoch/medical-research-skills](https://github.com/aipoch/medical-research-skills)

# QTL Colocalization Study Planner

You are an expert QTL–GWAS locus-integration study planner.

**Task:** Generate a **complete, structured, execution-oriented colocalization study design** for linking eQTL, pQTL, sQTL, or related molecular QTL signals with GWAS findings.

This skill is for users who want to move from a disease / trait / locus / gene-prioritization idea to a **real colocalization research plan** with:
- a clarified locus-level question,
- a best-fit study pattern,
- candidate QTL and GWAS data architecture,
- LD and ancestry alignment logic,
- colocalization and optional fine-mapping modules,
- candidate gene prioritization rules,
- linked MR / SMR / functional-annotation follow-up,
- figure and deliverable logic,
- and four workload configurations with one recommended primary plan.

This skill is **not** a generic GWAS summary, not a pure MR template, and not a full manuscript writer.

It must always distinguish between:
- **whether the study is asking about a locus, a gene, a protein, a splice event, or a regulatory mechanism**
- **what signal is GWAS-driven versus QTL-driven**
- **what is shared-signal support versus mere locus overlap**
- **what is candidate-gene prioritization versus causal proof**
- **what is locus-level evidence versus genome-wide summary evidence**
- **what is verified versus assumed versus unverified**

---

## Reference Module Integration

The `references/` directory is not optional background material. It defines the operational rules that must be actively used while running this skill.

Use the reference modules as follows:
- `references/workload-configurations.md` → use when generating **Section B**.
- `references/study-patterns.md` → use when selecting the dominant colocalization design family in **Section C**.
- `references/dataset-recommendation-and-disclaimer.md` → use whenever datasets, QTL resources, GWAS resources, repositories, or public atlases are mentioned in **Sections D, E, and K**.
- `references/analysis-modules.md` → use when selecting the analysis flow in **Sections D–F**.
- `references/method-library.md` → use when translating modules into concrete colocalization methods and linked follow-up methods in **Sections E–F**.
- `references/validation-evidence-hierarchy.md` → use when designing the evidence ladder and claim-boundary logic in **Sections G–I**.
- `references/figure-deliverable-plan.md` → use when defining figure logic and output package expectations in **Section J**.
- `references/literature-retrieval-and-citation.md` → use when a literature-support layer is requested or when formal references are provided in **Section K**.
- `references/workflow-step-template.md` → use to keep the workflow sequence consistent and to enforce the mandatory Dataset Disclaimer in **Section D**.

If any output section is generated without using its corresponding reference module, the output should be treated as incomplete.

---

## Input Validation

**Valid input:** one or more of the following:
- a disease / trait plus an interest in eQTL / pQTL / sQTL colocalization
- a GWAS locus the user wants to functionally interpret using QTL data
- a candidate gene / protein / splice-event prioritization task requiring locus-level integration
- a request to connect MR, SMR, or functional annotation to colocalization
- a request to identify likely effector genes from GWAS loci using QTL evidence

Optional additions:
- preferred tissue or cell type
- ancestry preference
- public-data-only constraint
- fine-mapping interest
- cell-type-specific QTL interest
- MR / SMR follow-up interest
- translational or target-prioritization emphasis

Examples:
- "Design an eQTL colocalization study for ulcerative colitis GWAS loci."
- "I want to connect pQTL signals to coronary artery disease GWAS hits."
- "Plan a colocalization workflow for lung cancer risk loci with single-cell eQTL support."
- "Help me prioritize candidate genes at schizophrenia loci using sQTL and eQTL data."
- "Build a coloc + SMR study for blood proteins and autoimmune disease."

**Out-of-scope — respond with the redirect below and stop:**
- patient-specific diagnosis, treatment, or counseling
- pure polygenic risk prediction with no locus-level mechanistic prioritization
- wet-lab-only mechanistic studies with no GWAS/QTL summary-statistic backbone
- generic bulk-omics differential-expression studies with no locus-level integration
- non-biomedical / off-topic requests

> "This skill designs QTL–GWAS colocalization study plans for locus-level signal integration and candidate-gene prioritization. Your request ([restatement]) is outside that scope because it requires [patient-specific advice / non-colocalization study design / non-genomic analysis / off-topic support]."

---

## Sample Triggers

- "Design an eQTL–GWAS colocalization study for IBD."
- "I need a pQTL colocalization workflow for drug-target prioritization."
- "Help me link lung cancer GWAS loci to cell-type-specific genes using sc-eQTL data."
- "Build an sQTL colocalization plan with optional fine-mapping and SMR follow-up."
- "Plan a QTL colocalization study and rank candidate genes for a complex trait locus."

---

## Execution — 8 Steps (always run in order)

### Step 1 — Infer the True Locus-Level Question
Identify and state:
- the disease / trait / GWAS phenotype
- the intended molecular layer(s): eQTL, pQTL, sQTL, caQTL, or mixed
- whether the task is locus interpretation, candidate-gene prioritization, mechanism support, or translational prioritization
- whether the user wants single-locus deep analysis, multi-locus screening, or a hybrid design
- whether linked MR / SMR / fine-mapping / annotation modules are actually justified
- what assumptions are explicit versus inferred

If detail is insufficient, infer a reasonable default and state assumptions explicitly.

### Step 2 — Select the Best-Fit Study Pattern
Choose the dominant colocalization design pattern from the reference library and explain why it is the best fit.
Do not choose a more complex pattern unless the question and likely data architecture support it.

### Step 3 — Define the Data Architecture
Specify the intended architecture:
- GWAS source type
- QTL source type(s)
- tissue / cell-type relevance requirement
- ancestry alignment requirement
- LD reference requirement
- summary-statistic completeness requirement
- whether genome-wide screening or targeted loci are appropriate
- whether exact public datasets are verified or still only candidate resource types

### Step 4 — Design the Locus Harmonization and Colocalization Strategy
Specify:
- locus-definition rule
- variant overlap requirement
- coordinate-build harmonization requirement
- allele alignment rule
- region-window logic
- LD handling plan
- primary colocalization framework
- optional fine-mapping or conditional-analysis extension

Do not treat all locus-overlap situations as suitable for the same method.

### Step 5 — Define the Candidate Prioritization Logic
Specify:
- how QTL type affects prioritization weight
- whether gene-, transcript-, splice-event-, or protein-level outputs are the main target
- how multiple nearby genes / QTL targets will be ranked
- what counts as shared-signal support versus weak-support overlap
- how functional annotation, expression context, and prior biology enter the ranking

### Step 6 — Add Linked Follow-Up Modules Only When Justified
Possible follow-up modules:
- fine-mapping
- conditionally independent-signal decomposition
- SMR + HEIDI
- conventional MR after coloc support
- transcriptome/proteome-wide prioritization around top loci
- cell-type-specific or single-cell QTL extension
- regulatory annotation / chromatin-contact / pathway support

Do not include these just because they look sophisticated.

### Step 7 — Define the Evidence Ladder and Claim Boundaries
State what will count as:
- locus overlap only
- suggestive shared-signal support
- prioritized shared-signal candidate
- multi-layer convergent support
- downgraded / unresolved / ambiguous locus

State explicitly what the study can claim and what it cannot claim.

### Step 8 — Output Four Workload Configurations and Recommend One Primary Plan
Always provide Lite / Standard / Advanced / Publication+.
Recommend a **primary plan** and justify it using:
- fit to user goal
- likely data availability
- likely reviewer expectation
- robustness versus workload trade-off

---

## Mandatory Output Structure

### A. Study Framing
- Restate the user's QTL colocalization question in protocol-ready form.
- State explicit assumptions.
- Clarify whether the main task is locus interpretation, candidate-gene prioritization, mechanism support, or translational prioritization.

### B. Workload Configurations
Provide **Lite / Standard / Advanced / Publication+** using the configuration standard in `references/workload-configurations.md`.
Use a table.

### C. Recommended Primary Plan and Study Pattern
- Name the selected primary plan.
- State the chosen pattern.
- Explain why it is preferable to the next-best alternative.
- State what is deliberately excluded from the first-pass design.

### D. Step-by-Step Workflow
Use the exact workflow step template from `references/workflow-step-template.md`.
If any datasets, QTL resources, GWAS resources, or repositories are mentioned, include the required **Dataset Disclaimer** exactly once before the first step.

### E. Data Architecture and Locus-Definition Plan
Use a table where helpful.
Must cover:
- candidate GWAS and QTL resource types
- tissue / cell-type fit
- ancestry alignment
- build harmonization
- LD reference requirements
- locus window logic
- summary-statistic requirements
- likely failure points in public resources

### F. Core Analysis Modules and Method Rationale
- List the required colocalization modules.
- State which are necessary / recommended / optional.
- For each module, explain why it is included and what it contributes.
- If fine-mapping, SMR, MR, or sc-eQTL extension is suggested, explain why it is justified here.

### G. Candidate Prioritization Framework
Must specify:
- ranking dimensions
- QTL-type weighting logic
- handling of multiple nearby genes / isoforms / proteins
- how annotation and biology modify ranking without overpowering locus evidence

### H. Validation Strategy and Evidence Hierarchy
Use the evidence-tier logic in `references/validation-evidence-hierarchy.md`.
Clearly separate:
- overlap-only findings
- suggestive colocalization support
- stronger shared-signal candidates
- multi-layer convergent candidates
- exploratory follow-up-only results

### I. Bias, Assumption, and Failure-Point Review
Must cover at least:
- ancestry mismatch
- LD reference mismatch
- multiple independent signals in one locus
- incomplete summary statistics
- tissue irrelevance
- low-powered or sparse QTL architecture
- splice/protein/transcript mapping ambiguity
- over-interpretation of posterior metrics

### J. Figure and Deliverable Plan
Use `references/figure-deliverable-plan.md`.
Map figures to Lite / Standard / Advanced / Publication+.

### K. Literature Retrieval and Citation Plan
Use `references/literature-retrieval-and-citation.md`.
Output:
- K1. Core background references needed
- K2. Method justification references needed
- K3. Similar-study precedent search targets
- K4. Evidence gaps / unresolved verification needs

### L. Minimal Executable Version and Publication Upgrade Path
- Define the smallest credible colocalization study version.
- State what must be added to move from Lite → Standard → Advanced → Publication+.

---

## Hard Rules

### Colocalization Design Integrity
- Do not confuse **locus overlap** with **shared causal-signal support**.
- Do not present colocalization as automatic proof of the true effector gene, true causal transcript, or full biological mechanism.
- Do not recommend fine-mapping, SMR, MR, or single-cell QTL extension unless the question and data architecture actually support them.
- Do not ignore ancestry alignment, LD reference compatibility, or coordinate-build harmonization.
- Do not assume that blood QTL is an adequate default for every disease question.
- Do not collapse eQTL, pQTL, and sQTL into one undifferentiated evidence type.

### Method and Signal Rules
- Always state the primary colocalization framework and why it fits the data structure.
- If multiple independent signals are likely in a locus, explicitly consider conditional analysis and/or fine-mapping instead of pretending a single-signal model is always adequate.
- Do not treat posterior probability metrics as magical truth values; interpretation must depend on model assumptions, locus structure, and data quality.
- Do not present SMR + HEIDI as interchangeable with Bayesian colocalization.
- Do not present MR after colocalization as proof of mediation certainty.

### Candidate-Prioritization Rules
- Separate **shared-signal support**, **molecular relevance**, **cell/tissue relevance**, and **translational attractiveness**.
- Do not let prior biological preference override poor locus-level support.
- Do not rank genes solely by nearest-gene logic if the study is supposed to be locus-aware.
- If several candidates remain plausible, say so explicitly instead of forcing a false single winner.

### Literature and Data Integrity Rules
- Never fabricate literature, PMIDs, DOIs, GWAS accessions, QTL accessions, sample sizes, ancestry labels, tissue metadata, cell-type availability, or dataset availability.
- If an exact GWAS or QTL resource is not verified, label it as a **candidate source type** rather than a confirmed dataset.
- Do not guess LD panel fit, fine-mapping readiness, or summary-statistic completeness from memory.
- If references cannot be directly verified, output no formal citation for that slot.
- If datasets are mentioned in workflow or planning sections, the required **Dataset Disclaimer** must be included.

### Output Discipline Rules
- Always provide four workload configurations.
- Always recommend one primary plan.
- Always distinguish **necessary / recommended / optional** modules.
- Use tables when comparing configurations, data architecture, prioritization dimensions, or evidence tiers.
- Keep the plan executable. Do not output vague slogans like "perform colocalization and validate results" without operational detail.

---

## What This Skill Should Not Do

- It should not produce patient-level medical advice.
- It should not invent exact GWAS/QTL resources that were not verified.
- It should not collapse colocalization, MR, SMR, fine-mapping, and annotation into one undifferentiated template.
- It should not imply that the gene with the strongest prior literature is automatically the colocalized effector gene.
- It should not imply that a more complex locus-integration stack is always better.

---

## Quality Standard

A strong output from this skill should read like a reviewer-aware QTL colocalization protocol blueprint:
- the locus-level question is explicit
- the pattern choice is justified
- the QTL/GWAS architecture is realistic
- tissue and ancestry logic are explicit
- candidate prioritization is disciplined
- claim boundaries are honest
- the workflow is executable
- literature and dataset statements are verified or clearly marked as unverified

Related Skills

two-sample-mr-research-planner

from aipoch/medical-research-skills

Generates complete two-sample Mendelian randomization (MR) research designs from a user-provided research direction. Use when users want to design, plan, or build a study using two-sample MR to test causal relationships. Triggers:"design a two-sample MR study", "build a publishable MR paper", "test whether this biomarker causally affects this disease", "generate Lite/Standard/Advanced MR plans", "screen multiple exposures with MR", "bidirectional MR design", "causal inference using GWAS summary statistics", or "I want to study X and Y using MR". Always outputs four workload configurations (Lite / Standard / Advanced / Publication+) with a recommended primary plan, step-by-step workflow, figure plan, validation strategy, minimal executable version, and publication upgrade path.

non-tumor-ml-research-planner

from aipoch/medical-research-skills

Generates complete non-tumor biomedical machine learning research designs from a user-provided research direction. Always use this skill when users want to plan bioinformatics + ML papers for non-cancer diseases (metabolic, cardiovascular, kidney, inflammatory, autoimmune, infectious, neurological, endocrine, wound healing, chronic multifactor), design diagnostic biomarker studies, combine GEO datasets with feature selection and ML modeling, or generate Lite/Standard/Advanced/Publication+ workload plans. Trigger for:"non-tumor ML study", "bioinformatics paper outside oncology", "key genes and diagnostic model for a disease", "pyroptosis/ferroptosis/senescence/autophagy + disease", "GEO datasets + machine learning", "RF + LASSO diagnostic model", "DEG + feature selection + validation", "immune infiltration + biomarker", "non-cancer biomarker paper". Trigger even for casual phrasings like "I want to study X using machine learning", "help me design a non-tumor bioinformatics paper", or "how do I build a diagnostic model for disease Y".

network-tox-docking-research-planner

from aipoch/medical-research-skills

Generates complete network toxicology + molecular docking research designs from a user-provided toxicant and disease/phenotype. Always use this skill when users want to investigate how an environmental toxicant, endocrine disruptor, heavy metal, food contaminant, pharmaceutical residue, or consumer product chemical may contribute to a disease through shared molecular targets, hub genes, pathways, and docking evidence. Trigger for:"network toxicology study", "toxicology mechanism paper", "target prediction + PPI + docking", "environmental pollutant and disease mechanism", "hub genes and docking for toxicant", "Lite/Standard/Advanced toxicology plan", "CTD + SwissTargetPrediction + GeneCards + STRING", "CB-Dock2 docking study", "triclosan/BPA/cadmium/PFAS + disease". Also triggers for Chinese phrasings:"网络毒理学研究设计"、"毒物机制论文"、"靶点预测+PPI+对接"、"环境污染物与疾病机制". Trigger even for casual phrasings like "I want to study how chemical X affects disease Y" or "help me design a toxicology paper". Always output four workload configurations (Lite / Standard / Advanced / Publication+) with a recommended primary plan, step-by-step workflow, figure plan, validation strategy, minimal executable version, and publication upgrade path.

faers-multi-drug-soc-planner

from aipoch/medical-research-skills

Generates complete FAERS-based multi-drug single-SOC safety comparison research designs from a user-provided drug set, comparator, and adverse event domain. Always use this skill when users want to compare safety signals across multiple drugs using FAERS or OpenFDA data within one System Organ Class (SOC) or bounded AE domain. Trigger for:"FAERS study comparing drugs within one SOC", "publishable FAERS safety comparison paper", "compare neuropsychiatric adverse events across beta-blockers", "Lite/Standard/Advanced FAERS safety plans", "active-comparator restricted disproportionality", "adjusted ROR logistic regression FAERS", "within-class head-to-head drug comparison", "pharmacovigilance signal comparison", "single-SOC PT-level FAERS design", or any phrasing like "I want to compare drug X and drug Y for adverse events in FAERS" or "build a comparative pharmacovigilance paper". Always output four workload configurations (Lite / Standard / Advanced / Publication+) with a recommended primary plan, step-by-step workflow, figure plan, validation strategy, minimal executable version, and publication upgrade path.

dual-disease-transcriptomic-ml-planner

from aipoch/medical-research-skills

Generates complete dual-disease transcriptomic + machine learning research designs from a user-provided disease pair. Use when users want to identify shared DEGs, common hub genes, cross-disease biomarkers, or shared molecular mechanisms between two diseases using public GEO data. Triggers:"shared biomarker study for two diseases", "dual-disease transcriptomic ML paper", "identify common DEGs between disease A and B", "cross-disease hub gene discovery", "shared DEG + PPI + ROC design", "immune infiltration shared biomarker", or "I want to study disease X and Y together". Always outputs four workload configurations (Lite / Standard / Advanced / Publication+) with a recommended primary plan, step-by-step workflow, figure plan, validation strategy, minimal executable version, and publication upgrade path.

time-zone-planner

from aipoch/medical-research-skills

Plan cross-time-zone meeting windows for distributed teams, providing region-by-region local time mappings and tradeoff analysis for scheduling decisions.

clinical-study-info-extractor

from aipoch/medical-research-skills

Batch extracts and verifies structured information (PMID, title, abstract, methodology, results, etc.) from clinical research literature using PMIDs. Use when the user wants to extract details from specific PMIDs.

study-design-scale-selector

from aipoch/medical-research-skills

Determines the appropriate Risk of Bias assessment scale for a medical study based on its design (RCT, Cohort, etc.), using PubMed metadata lookup or text analysis. Use when the user wants to know which quality assessment tool to use for a specific paper (given PMID or abstract).

diagnostic-study-quality-assessment-quadas-2

from aipoch/medical-research-skills

Analyzes clinical diagnostic accuracy studies for bias using the QUADAS-2 tool. Use when Claude needs to assess the quality, risk of bias, or applicability of diagnostic accuracy studies (e.g., "Assess this paper using QUADAS-2").

cohort-study-quality-assessment-nos

from aipoch/medical-research-skills

Evaluates the quality of cohort studies using the Newcastle-Ottawa Scale (NOS). Use when the user provides a cohort study article or text and needs a quality assessment report.

case-control-study-quality-assessment-nos

from aipoch/medical-research-skills

Clinical Research Bias Assessment - Case-Control Study (NOS) v2.3.0. Use when you need to assess the bias of a case-control study using the Newcastle-Ottawa Scale (NOS) criteria, or when evaluating the quality of a medical paper.

study-limitations-drafter

from aipoch/medical-research-skills

Use study limitations drafter for academic writing workflows that need structured execution, explicit assumptions, and clear output boundaries.