simo-multiomics-integration-agent
AI-powered spatial integration of multi-omics datasets using probabilistic alignment for comprehensive tissue atlas construction and cellular state mapping.
Best use case
simo-multiomics-integration-agent is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
AI-powered spatial integration of multi-omics datasets using probabilistic alignment for comprehensive tissue atlas construction and cellular state mapping.
Teams using simo-multiomics-integration-agent should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/simo-multiomics-integration-agent/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How simo-multiomics-integration-agent Compares
| Feature / Agent | simo-multiomics-integration-agent | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
AI-powered spatial integration of multi-omics datasets using probabilistic alignment for comprehensive tissue atlas construction and cellular state mapping.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# SIMO Multiomics Integration Agent
The **SIMO Multiomics Integration Agent** performs spatial integration of multi-omics datasets through probabilistic alignment. Unlike previous tools limited to transcriptomics, SIMO integrates spatial transcriptomics with single-cell RNA-seq and expands to chromatin accessibility, DNA methylation, and proteomics data.
## When to Use This Skill
* When integrating spatial transcriptomics with single-cell multi-omics data.
* For constructing comprehensive tissue atlases with spatial context.
* To map epigenomic states (ATAC-seq, methylation) onto spatial coordinates.
* When analyzing multi-modal cellular phenotypes in tissue architecture.
* For spatial deconvolution combining multiple modalities.
## Core Capabilities
1. **Spatial-scRNA Integration**: Probabilistically align single-cell RNA-seq to spatial coordinates.
2. **Chromatin Accessibility Mapping**: Project scATAC-seq profiles onto spatial tissue locations.
3. **DNA Methylation Spatial Mapping**: Integrate single-cell methylation data with spatial context.
4. **Multi-Modal Fusion**: Combine transcriptomic, epigenomic, and proteomic layers.
5. **Probabilistic Cell-Type Assignment**: Assign cell types to spatial spots with uncertainty quantification.
6. **Spatial Niche Identification**: Discover cellular niches defined by multi-omic signatures.
## Supported Modalities
| Modality | Input Format | Spatial Reference |
|----------|--------------|-------------------|
| scRNA-seq | AnnData, Seurat | Visium, MERFISH, Xenium |
| scATAC-seq | SnapATAC2, ArchR | Visium, Slide-seq |
| scMethyl | Bismark, allcools | Any spatial modality |
| CITE-seq (protein) | AnnData | Spatial proteomics |
| Multi-ome (RNA+ATAC) | Muon, SnapATAC2 | All platforms |
## Integration Algorithm
| Step | Method | Purpose |
|------|--------|---------|
| Feature Selection | HVG + marker genes | Reduce dimensionality |
| Embedding | Variational autoencoder | Shared latent space |
| Alignment | Optimal transport | Probabilistic matching |
| Spatial Mapping | Gaussian processes | Smooth spatial predictions |
| Uncertainty | Posterior sampling | Confidence intervals |
## Workflow
1. **Input**: Spatial transcriptomics (Visium/MERFISH/Xenium), reference single-cell multi-omics.
2. **Preprocessing**: Normalize, select features, QC both datasets.
3. **Embedding**: Learn joint latent representation across modalities.
4. **Probabilistic Alignment**: Compute cell-to-spot assignment probabilities.
5. **Spatial Imputation**: Transfer modalities to spatial coordinates.
6. **Niche Analysis**: Identify spatial domains by multi-omic signatures.
7. **Output**: Integrated spatial multi-omics object, niche assignments, visualizations.
## Example Usage
**User**: "Integrate our scRNA-seq and scATAC-seq data with the spatial transcriptomics to understand chromatin states in different tissue regions."
**Agent Action**:
```bash
python3 Skills/Genomics/SIMO_Multiomics_Integration_Agent/simo_integration.py \
--spatial_data visium_data.h5ad \
--scrna_ref scrna_atlas.h5ad \
--scatac_ref scatac_atlas.h5ad \
--modalities rna,atac \
--n_spots_per_cell 5 \
--uncertainty_quantification true \
--output integrated_spatial_multiome.h5ad
```
## Output Components
| Output | Description | Format |
|--------|-------------|--------|
| Integrated Object | Multi-modal spatial data | AnnData/Muon |
| Cell Type Map | Spatial cell type assignments | GeoTIFF, CSV |
| Chromatin Accessibility Map | Spatial ATAC patterns | BigWig, CSV |
| Niche Assignments | Spatial domain labels | CSV, Zarr |
| Uncertainty Maps | Per-spot confidence | GeoTIFF |
| Gene Activity Scores | ATAC-derived gene activity | AnnData layer |
## Spatial Platforms Supported
| Platform | Resolution | Spots/Cells | Genes |
|----------|------------|-------------|-------|
| 10x Visium | 55 μm | ~5,000 | Whole transcriptome |
| 10x Visium HD | 8 μm | ~300,000 | Whole transcriptome |
| 10x Xenium | Subcellular | >100,000 | 300-5,000 panel |
| MERFISH | Subcellular | >1M | 100-10,000 panel |
| Slide-seq | 10 μm | ~60,000 | Whole transcriptome |
| CosMx | Subcellular | >1M | 1,000-6,000 panel |
## AI/ML Components
**Variational Integration**:
- Multi-modal VAE for joint embeddings
- Contrastive learning for modality alignment
- Batch correction across datasets
**Probabilistic Mapping**:
- Optimal transport with entropic regularization
- Gaussian process spatial smoothing
- Bayesian uncertainty estimation
**Niche Discovery**:
- Multi-view clustering
- Spatial autocorrelation (Moran's I)
- Graph neural networks for niche boundaries
## Prerequisites
* Python 3.10+
* Scanpy, Squidpy, Muon
* scvi-tools, SnapATAC2
* POT (Python Optimal Transport)
* PyTorch, GPyTorch
## Related Skills
* scGPT_Agent - For foundation model embeddings
* Spatial_Epigenomics_Agent - For spatial epigenomics analysis
* Cell_Cell_Communication - For ligand-receptor analysis
* Nicheformer_Spatial_Agent - For spatial niche modeling
## Special Considerations
1. **Batch Effects**: Pre-align datasets from different protocols
2. **Spot Deconvolution**: Lower resolution platforms need deconvolution
3. **Sparsity**: scATAC data requires aggregation strategies
4. **Compute**: Multi-modal integration is memory-intensive
5. **Validation**: Verify spatial patterns with known marker distributions
## Applications
| Application | Use Case |
|-------------|----------|
| Tumor Microenvironment | Map chromatin states of immune infiltrates |
| Development | Track lineage chromatin dynamics spatially |
| Neurodegeneration | Spatial mapping of epigenetic changes |
| Fibrosis | Understand spatial activation programs |
## Author
AI Group - Biomedical AI PlatformRelated Skills
omero-integration
Microscopy data management platform. Access images via Python, retrieve datasets, analyze pixels, manage ROIs/annotations, batch processing, for high-content screening and microscopy workflows.
jira-integration
Agent Skill: Comprehensive Jira integration through lightweight Python scripts. AUTOMATICALLY TRIGGER when user mentions Jira URLs like 'https://jira.*/browse/*', 'https://*.atlassian.net/browse/*', or issue keys like 'PROJ-123'. Use when searching issues (JQL), getting/updating issue details, creating issues, transitioning status, adding comments, logging worklogs, managing sprints and boards, creating issue links, or formatting Jira wiki markup. If authentication fails, offer to configure credentials interactively. Supports both Jira Cloud and Server/Data Center with automatic authentication detection. By Netresearch.
Directus AI Assistant Integration
Build AI-powered features in Directus: chat interfaces, content generation, smart suggestions, and copilot functionality
ai-integration
AI/LLM integration patterns - Claude API, fal.ai, streaming, tool use
accessibility-object-model-integration
Programmatic manipulation of the accessibility tree to support complex custom controls in React.
academic-data-integration
When the user needs to integrate multiple data sources (Canvas API, user memory, file systems) to create comprehensive academic reports. This skill combines course information, assignment details, submission status, and user context to generate actionable insights. Triggers include requests that involve cross-referencing multiple data sources or creating consolidated academic reports from disparate systems.
302ai-api-integration
ALWAYS use this skill when user needs ANY API functionality (AI models, image generation, video, audio, text processing, etc.). Automatically search 302.AI's 1400+ APIs and generate integration code. Use proactively whenever APIs or AI capabilities are mentioned.
1k-defi-module-integration
Interactive guide for integrating new DeFi modules or protocols into OneKey. Use when adding new DeFi features like staking protocols, lending markets, or entirely new DeFi modules. Triggers on DeFi, protocol, integration, Earn, Borrow, staking, lending, supply, borrow, withdraw, repay, claim, new module, Pendle, Aave, Compound.
workflow-integration-git
Git commit workflow with conventional commits, artifact cleanup, and optional push/PR creation
Project Setup and CCAGI Integration
Complete project initialization including Node.js/TypeScript setup, GitHub integration, and CCAGI framework integration. Use when creating new projects or integrating CCAGI components.
integrations
NextDNS integration guides for third-party platforms and services. This skill should be used when integrating NextDNS with routers, home automation systems, network management platforms, or other third-party services. Triggers on tasks involving Tailscale, Home Assistant, Ubiquiti, pfSense, OpenWrt, or other platform-specific NextDNS configurations.
integration-workflows
Cross-MCP workflows that coordinate multiple systems (Linear, GitHub, n8n, Slack) for end-to-end automation. Captures patterns that span tool boundaries.