image-algorithm-validator

Medical image processing algorithm validation skill for segmentation, detection, and analysis algorithms

509 stars

Best use case

image-algorithm-validator is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Medical image processing algorithm validation skill for segmentation, detection, and analysis algorithms

Teams using image-algorithm-validator should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/image-algorithm-validator/SKILL.md --create-dirs "https://raw.githubusercontent.com/a5c-ai/babysitter/main/library/specializations/domains/science/biomedical-engineering/skills/image-algorithm-validator/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/image-algorithm-validator/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How image-algorithm-validator Compares

Feature / Agentimage-algorithm-validatorStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Medical image processing algorithm validation skill for segmentation, detection, and analysis algorithms

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Image Algorithm Validator Skill

## Purpose

The Image Algorithm Validator Skill supports validation of medical image processing algorithms, including segmentation, detection, and analysis algorithms, ensuring performance meets clinical requirements.

## Capabilities

- Ground truth dataset curation guidance
- Performance metric calculation (Dice, IoU, sensitivity, specificity)
- Inter-observer variability analysis
- Statistical comparison methods
- Validation dataset stratification
- Multi-reader multi-case study design
- FDA AI/ML guidance alignment
- Failure case analysis
- Edge case identification
- Performance boundary testing
- Cross-validation methodology

## Usage Guidelines

### When to Use
- Validating image analysis algorithms
- Curating validation datasets
- Designing reader studies
- Preparing regulatory submissions

### Prerequisites
- Algorithm development complete
- Ground truth established
- Validation dataset available
- Performance criteria defined

### Best Practices
- Use representative, diverse datasets
- Establish robust ground truth methodology
- Assess performance across subgroups
- Document failure modes

## Process Integration

This skill integrates with the following processes:
- Medical Image Processing Algorithm Development
- AI/ML Medical Device Development
- Clinical Evaluation Report Development
- Software Verification and Validation

## Dependencies

- SimpleITK library
- scikit-image
- MONAI framework
- Evaluation frameworks
- Statistical analysis tools

## Configuration

```yaml
image-algorithm-validator:
  algorithm-types:
    - segmentation
    - detection
    - classification
    - registration
    - quantification
  metrics:
    - Dice
    - IoU
    - sensitivity
    - specificity
    - AUC
    - Hausdorff-distance
  validation-methods:
    - holdout
    - cross-validation
    - external-validation
```

## Output Artifacts

- Dataset curation protocols
- Ground truth documentation
- Performance reports
- Statistical analyses
- Reader study results
- Failure mode catalogs
- Regulatory submission sections
- Validation summaries

## Quality Criteria

- Ground truth methodology validated
- Metrics appropriate for algorithm type
- Dataset representative of intended use
- Statistical analysis rigorous
- Subgroup performance assessed
- Documentation supports regulatory review

Related Skills

image-optimization

509
from a5c-ai/babysitter

Image formats, responsive images, lazy loading, and CDN integration.

responsive-image

509
from a5c-ai/babysitter

Generate responsive image sets with srcset, WebP/AVIF conversion, and art direction

design-system-validator

509
from a5c-ai/babysitter

Validate design system compliance in code and detect token usage violations

link-validator

509
from a5c-ai/babysitter

Comprehensive link checking and validation for documentation. Validate internal links, external URLs, anchors, detect redirects, monitor link rot, and generate sitemap validation reports.

code-sample-validator

509
from a5c-ai/babysitter

Extract, validate, and test code samples in documentation. Verify syntax, execute samples, check outputs, validate imports, and ensure code samples are up-to-date with current APIs.

openapi-validator

509
from a5c-ai/babysitter

Validate OpenAPI specifications for correctness, security, and best practices

k8s-validator

509
from a5c-ai/babysitter

Validate Kubernetes manifests for security, best practices, and resource limits

slam-algorithms

509
from a5c-ai/babysitter

Expert skill for SLAM algorithm selection, configuration, and tuning. Configure visual SLAM (ORB-SLAM3, RTAB-Map), LiDAR SLAM (Cartographer, LIO-SAM), tune parameters, evaluate accuracy, and optimize for real-time performance.

specialization-validator

509
from a5c-ai/babysitter

Validate specialization completeness across all 7 phases, score each phase, identify gaps, and generate validation reports.

process-validator

509
from a5c-ai/babysitter

Validate process JS files for correct SDK patterns, task definitions, syntax, and quality gate implementation.

checklist-validator

509
from a5c-ai/babysitter

Skill for validating research against reporting checklists

tem-image-analyzer

509
from a5c-ai/babysitter

Transmission Electron Microscopy image analysis skill for nanoparticle size, morphology, and crystallography assessment