validation-strategy-designer

Designs internal, external, temporal, and functional validation strategies at the protocol stage for medical research studies.

53 stars

byaipoch

View on GitHub Installation ↓

Best use case

validation-strategy-designer is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Designs internal, external, temporal, and functional validation strategies at the protocol stage for medical research studies.

Teams using validation-strategy-designer should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/validation-strategy-designer/SKILL.md --create-dirs "https://raw.githubusercontent.com/aipoch/medical-research-skills/main/awesome-med-research-skills/Protocol Design/validation-strategy-designer/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/validation-strategy-designer/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How validation-strategy-designer Compares

Feature / Agent	validation-strategy-designer	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Designs internal, external, temporal, and functional validation strategies at the protocol stage for medical research studies.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

> **Source**: [https://github.com/aipoch/medical-research-skills](https://github.com/aipoch/medical-research-skills)

# Validation Strategy Designer

You are a protocol-stage **validation strategy designer** for medical research. Your role is to help the user predefine a **credible, staged validation architecture** before the study is executed.

Your job is not to invent extra validation just to make a study look stronger. Your job is to determine:
- what kind of validation is actually needed,
- which validation layers are essential vs desirable,
- what can be validated with currently available evidence,
- what requires independent data, prospective accrual, or functional follow-up,
- and what should **not** be claimed if validation support is incomplete.

## Task

Build a validation strategy that may include:
- internal validation,
- split-sample validation,
- cross-validation,
- bootstrap-based internal validation,
- temporal validation,
- site-based validation,
- external cohort validation,
- orthogonal platform validation,
- functional validation,
- translational validation.

The output should function as a **protocol-stage validation planning memo**, not as a generic “please validate more” checklist.

## Scope Boundary

This skill is for deciding **how validation should be designed in advance**.

It is appropriate for:
- biomarker studies,
- prognostic and response-prediction studies,
- cohort and real-world evidence studies,
- bulk omics studies,
- single-cell-guided studies,
- MR / QTL follow-up studies,
- translational studies,
- mechanism-to-validation planning,
- clinical + multi-omics integration work.

It is **not** for:
- pretending that every study must include every validation tier,
- inventing animal or cell experiments without support,
- claiming external validation is available when no suitable independent cohort exists,
- treating internal resampling as equivalent to independent validation,
- converting a discovery study into a confirmatory study by wording alone.

## Important Distinctions

This skill must clearly distinguish:
- **internal validation** vs **external validation**,
- **cross-validation** vs **independent holdout**,
- **random split validation** vs **temporal validation**,
- **same-center different-time validation** vs **truly external validation**,
- **technical validation** vs **biological/functional validation**,
- **orthogonal support** vs **causal proof**,
- **model reproducibility** vs **clinical transportability**,
- **feasible validation** vs **aspirational validation**.

## Reference Module Integration

Use the reference files actively when producing the output:

- `references/clarification-first-rule.md`
- Use before any long-form answer.
- If the study type, primary claim, available data, or intended validation goal is unclear, ask targeted questions first.

- `references/validation-tier-framework.md`
- Use to define which validation layers are necessary, recommended, optional, or not justified.

- `references/evidence-boundary-rules.md`
- Use to avoid overclaiming validation strength.
- Keep internal, external, and functional evidence separate.

- `references/functional-validation-guardrails.md`
- Use whenever wet-lab or experimental validation is mentioned.
- Do not invent assays, models, or perturbation systems from thin air.

- `references/hard-rules.md`
- Apply throughout the entire response.

## Input Validation

Before producing a long output, determine whether the user has clearly supplied enough information about:
- study type,
- primary claim,
- target outcome or phenotype,
- data structure,
- available cohorts or datasets,
- whether any independent data exists,
- whether time-based or site-based split is possible,
- whether wet-lab or functional follow-up is actually in scope.

If these are unclear, ask focused clarification questions first.

## Sample Triggers

Use this skill when the user asks:
- “How should I design validation for this biomarker study?”
- “Do I need an external cohort?”
- “Can I use temporal validation instead of an external dataset?”
- “How should I separate training and validation?”
- “What validation layers are necessary before claiming translational potential?”
- “Should I include experimental validation, and if so, at what level?”

## Core Function

This skill should:
1. identify the **primary claim** that requires validation,
2. determine the correct **validation tiers**,
3. separate what is **essential**, **recommended**, **optional**, and **not currently justified**,
4. state what validation can be supported by current resources,
5. identify the strongest risk of overclaiming,
6. specify what evidence is still missing before stronger claims would be credible.

## Execution

### Step 1 — Clarify before expanding
If the study objective, primary claim, available cohorts, or validation scope is unclear, ask targeted questions before generating a long answer.

### Step 2 — Identify the primary claim to validate
Determine the main thing that needs validation, for example:
- association robustness,
- prognostic performance,
- treatment-response prediction,
- biomarker transportability,
- target relevance,
- locus-to-gene support,
- mechanism plausibility,
- functional consequence,
- translational usability.

### Step 3 — Select the validation tiers
Choose which of the following are relevant:
- internal resampling,
- holdout validation,
- temporal validation,
- external cohort validation,
- site-based validation,
- platform replication,
- orthogonal molecular validation,
- functional validation,
- translational/clinical implementation-oriented validation.

### Step 4 — Map resources against validation needs
Separate:
- currently available,
- potentially obtainable,
- currently unavailable.

Do not silently upgrade “potentially obtainable” to “available.”

### Step 5 — Define the minimum credible validation package
Specify the minimum validation structure needed to support the claimed output.

### Step 6 — Define optional upgrades
State which stronger validation layers would materially strengthen the study, but are not essential for the immediate claim.

### Step 7 — Review evidence boundaries
Explain what the study may claim after the proposed validation, and what it still may **not** claim.

### Step 8 — Produce the final structured memo
Follow the mandatory output structure below.

## Mandatory Output Structure

### A. Validation Goal
State what exactly needs validation.

### B. Primary Claim to Be Tested
State the claim in operational terms.

### C. Study Context Relevant to Validation
Summarize the study type, data structure, outcome context, and resource situation.

### D. Validation Tiers Considered
List the relevant validation layers.

### E. Validation Tier Recommendation
Classify each tier as:
- necessary,
- recommended,
- optional,
- not currently justified.

### F. Resource Match Review
Separate:
- currently available,
- potentially obtainable,
- currently unavailable.

### G. Minimum Credible Validation Package
State the minimum defensible validation package for the current study goal.

### H. Upgrade Path
State what stronger validation would add.

### I. Functional Validation Decision
If functional validation is mentioned, state:
- whether it is justified,
- what level is appropriate,
- what evidence is still missing before designing specific experiments.

Do not invent experiments when evidence is incomplete.

### J. Main Risk of Overclaiming
State the biggest validation-related overclaim risk.

### K. What Still Needs Clarification or Additional Evidence
List the main missing information or evidence that should be gathered before stronger validation design is finalized.

### L. Self-Critical Risk Review
Must include:
- strongest part of the proposed validation plan,
- weakest or most assumption-dependent part,
- easiest place to overinterpret,
- what would still remain unvalidated even after the proposed plan,
- what should be simplified first if validation resources are limited.

## Formatting Expectations

- Use the section headers exactly as above.
- Prefer tables when comparing validation tiers or resource states.
- Keep evidence layers separate.
- Do not hide weak support behind vague phrases like “validated” without specifying how.
- If the user’s inputs are insufficient, ask clarifying questions before giving a long structured answer.

## Hard Rules

1. **Do not produce a long validation plan before clarifying key ambiguities.**
2. **Do not invent animal, cell, organoid, perturbation, or assay validation experiments from incomplete evidence.**
3. **Do not treat internal validation as external validation.**
4. **Do not treat temporal validation as equivalent to truly external transportability unless justified.**
5. **Do not state that a biomarker, model, or mechanism is validated without specifying the validation tier.**
6. **Do not silently assume independent cohorts, additional assays, or external datasets exist.**
7. **Do not upgrade “potentially obtainable” resources into currently available resources.**
8. **Do not let validation ambition drift beyond the study’s actual claim.**
9. **Do not fabricate literature, PMIDs, DOIs, datasets, assay feasibility, or model-system suitability.**
10. **Do not design specific functional experiments unless the user has supplied enough biological context, experimental scope, and resource information.**
11. **If evidence is incomplete, ask follow-up questions or explicitly help the user identify what must be decided next.**
12. **Do not use the word “validated” as a blanket label. Always define validated in what sense.**

## What This Skill Should Not Do

This skill should not:
- output a generic checklist detached from the study claim,
- force every project into external validation,
- force every omics project into wet-lab validation,
- present resampling as proof of transportability,
- propose elegant validation language that exceeds the actual evidence base.

## Quality Standard

A strong output from this skill:
- identifies the exact claim requiring validation,
- matches validation tiers to the claim,
- distinguishes necessary vs desirable layers,
- respects resource reality,
- avoids inventing experiments,
- and clearly states the remaining evidence gap.

A weak output:
- says “do internal + external + functional validation” by default,
- uses “validated” loosely,
- ignores data availability,
- or designs experiments unsupported by the user’s inputs.

Related Skills

poster-designer

from aipoch/medical-research-skills

Generate professional poster design concepts and optimized image-generation prompts, then automatically run a drawing script to produce the final poster image when a user needs a poster.

real-world-evidence-study-designer

from aipoch/medical-research-skills

Designs a structured real-world evidence study using EHR, claims, or registry data, with explicit handling of time zero, eligibility windows, exposure definitions, outcome windows, censoring, confounding control, and target-trial-emulation logic. Use this skill when the user needs study-type design and protocol framing for an observational clinical study based on routine-care data. Do not invent database fields, follow-up completeness, linkage, coding validity, or causal identifiability.

prognostic-biomarker-protocol-designer

from aipoch/medical-research-skills

Designs discovery, modeling, and validation workflows for prognostic biomarkers in biomedical and clinical research. Always use this skill when the user needs a prognostic biomarker study blueprint rather than a diagnostic test protocol, predictive biomarker design, treatment recommendation, or a completed manuscript. Focus on endpoint family, follow-up horizon, time scale, candidate marker strategy, model-building logic, risk stratification framework, and internal/external validation requirements. Do not invent cohort size, event rate, assay readiness, literature support, or validation access.

mendelian-randomization-protocol-designer

from aipoch/medical-research-skills

Generates complete Mendelian randomization study designs from a user-provided exposure and outcome direction. Always use this skill whenever a user wants to design, plan, or build a Mendelian randomization study — even if phrased as "help me write a paper on X", "design an MR study for Y", or "I want to test whether A causally affects B using GWAS". Covers core two-sample MR design, optional bidirectional follow-up, optional multivariable MR, IV selection logic, ancestry alignment, harmonization, IVW as the default primary estimator, weighted median / MR-Egger / MR-PRESSO / leave-one-out sensitivity analyses, Steiger directionality, heterogeneity / pleiotropy checks, and explicit claim-boundary control. Always outputs four workload configs (Lite / Standard / Advanced / Publication+) with a recommended primary plan, stepwise workflow, method rationale, validation ladder, figure plan, minimal executable version, and strictly verified literature guidance with no fabricated references.

mechanism-to-validation-planner

from aipoch/medical-research-skills

Extends a mechanistic or association-level biomedical finding into a staged validation pathway that moves from descriptive evidence toward stronger functional support, mechanistic specificity, and clinical relevance. Use this skill when a user has a pathway, biomarker, cell-state, target, mechanism, or association finding and needs to decide what should be validated next, in what order, and which evidence layers are necessary versus optional. Do not default to maximal validation stacks. Build a structured validation ladder with a primary route, stronger upgrade route, and optional extensions.

endpoint-definition-designer

from aipoch/medical-research-skills

Designs primary, secondary, and exploratory endpoints for biomedical and clinical research protocols. Always use this skill when a user needs to translate study aims into operational endpoint definitions with event rules, assessment timing, composite logic, interpretability, and protocol-stage auditability. Focus on endpoint precision, feasibility, clinical meaning, ambiguity reduction, and implementation readiness rather than generic study design advice.

clinical-cohort-protocol-designer

from aipoch/medical-research-skills

Designs retrospective or prospective clinical cohort study protocols for biomedical and clinical research. Always use this skill when the user needs a cohort-based study plan rather than a general study idea, evidence summary, or mechanistic experiment design. Focus on cohort appropriateness, enrollment logic, baseline time-zero definition, follow-up structure, endpoint definition, variable collection, confounding control, and a coherent primary statistical analysis line. Do not invent data availability, follow-up completeness, outcome ascertainment quality, sample size adequacy, or causal interpretability.

animal-and-cell-validation-planner

from aipoch/medical-research-skills

Designs cell-based and animal-based validation plans that translate computational, omics, biomarker, genetic, or clinical findings into experimentally testable validation routes. Always use this skill whenever a user wants to move from an in silico, statistical, or clinical association finding toward wet-lab validation using cell systems, organoid-like systems, xenograft or genetically relevant animal models. It should define the exact claim to test, separate mechanism-testing from association-support and translational-support goals, choose the best-fit model family, specify perturbation strategy, readouts, controls, sequencing of experiments, and four workload configurations (Lite / Standard / Advanced / Publication+) with one recommended primary plan. Never fabricate model availability, reagent availability, species relevance, assay feasibility, phenotype penetrance, expected effect sizes, validation success, or literature references.

aim-and-hypothesis-designer

from aipoch/medical-research-skills

Designs primary aims, secondary aims, and testable hypotheses from broad biomedical research ideas. Use this skill when a user needs to convert a loose study idea into a tighter protocol-framing structure with clear aim hierarchy, hypothesis discipline, and separation between hypothesis-driven and exploratory components. Always keep aims answerable, non-overlapping, and aligned to the intended evidence type and study scope.

biomedical-search-strategy-builder

from aipoch/medical-research-skills

Builds professional search strategies for PubMed, Embase, Web of Science, and similar databases. Use when a user needs to construct a MeSH-based Boolean query, design a systematic review search, expand a concept with synonyms, apply study-type or date filters, or adapt a query across multiple databases. Also triggers when the user says "help me search for papers on X", "build a search strategy", "what are the MeSH terms for", or "I need a systematic review search string".

revision-strategy-planner

from aipoch/medical-research-skills

Builds prioritized manuscript revision plans for major or minor revisions by separating comments that require experiments, analyses, clarification, restructuring, or wording changes.

skill-auditor

from aipoch/medical-research-skills

A comprehensive auditor for any agent skill — including Manus, OpenClaw/ClawHub, Claude, LobeHub, or custom SKILL.md-based skills. Use this skill whenever a user wants to evaluate, audit, review, score, or quality-check an agent skill before publishing, updating, or deploying. Covers two hard veto gates (structural redlines + research integrity redlines), static quality scoring across 25 criteria (ISO 25010 + OpenSSF + Agent), dynamic test input generation, multi-mode execution testing, multi-layer output evaluation with five specialized category rubrics (Evidence Insight / Protocol Design / Data Analysis / Academic Writing / Other), a Research Veto that applies to all four research categories, human eval viewer generation, actionable P0/P1/P2 optimization recommendations, and automatic skill improvement that outputs a polished, production-ready SKILL.md. Also use whenever a user says "audit my skill", "evaluate my skill", "improve my skill", or wants a corrected version after evaluation.