ensembl-sequence-retrieval

Retrieve genomic sequences from Ensembl database using transcript or gene IDs to obtain nucleotide and protein sequences.

157 stars

Best use case

ensembl-sequence-retrieval is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Retrieve genomic sequences from Ensembl database using transcript or gene IDs to obtain nucleotide and protein sequences.

Teams using ensembl-sequence-retrieval should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/ensembl-sequence-retrieval/SKILL.md --create-dirs "https://raw.githubusercontent.com/InternScience/DrClaw/main/drclaw/agent_hub/templates/genomics/skills/ensembl-sequence-retrieval/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/ensembl-sequence-retrieval/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How ensembl-sequence-retrieval Compares

Feature / Agentensembl-sequence-retrievalStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Retrieve genomic sequences from Ensembl database using transcript or gene IDs to obtain nucleotide and protein sequences.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Ensembl Sequence Retrieval

## Usage

### 1. MCP Server Definition

```python
import asyncio
import json
from mcp.client.streamable_http import streamablehttp_client
from mcp import ClientSession

class OrigeneClient:
    """Origene-Ensembl MCP Client"""

    def __init__(self, server_url: str, api_key: str):
        self.server_url = server_url
        self.api_key = api_key
        self.session = None

    async def connect(self):
        try:
            self.transport = streamablehttp_client(
                url=self.server_url,
                headers={"SCP-HUB-API-KEY": self.api_key}
            )
            self.read, self.write, self.get_session_id = await self.transport.__aenter__()
            self.session_ctx = ClientSession(self.read, self.write)
            self.session = await self.session_ctx.__aenter__()
            await self.session.initialize()
            return True
        except Exception as e:
            print(f"✗ connect failure: {e}")
            return False

    async def disconnect(self):
        try:
            if self.session:
                await self.session_ctx.__aexit__(None, None, None)
            if hasattr(self, 'transport'):
                await self.transport.__aexit__(None, None, None)
        except Exception as e:
            print(f"✗ disconnect error: {e}")

    def parse_result(self, result):
        if isinstance(result, dict):
            content_list = result.get("content") or []
        else:
            content_list = getattr(result, "content", []) or []
        texts = []
        for item in content_list:
            if isinstance(item, dict):
                if item.get("type") == "text":
                    texts.append(item.get("text") or "")
            else:
                if getattr(item, "type", None) == "text":
                    texts.append(getattr(item, "text", "") or "")
        return "".join(texts)
```

### 2. Sequence Retrieval Workflow

**Implementation:**

```python
## Initialize client
client = OrigeneClient(
    "https://scp.intern-ai.org.cn/api/v1/mcp/12/Origene-Ensembl",
    "<your-api-key>"
)

if not await client.connect():
    print("connection failed")
    exit()

## Retrieve sequence by Ensembl ID
result = await client.session.call_tool(
    "get_sequence_id",
    arguments={
        "id": "ENST00000380152"
    }
)

result_data = client.parse_result(result)
print(result_data)

await client.disconnect()
```

### Tool Descriptions

**Origene-Ensembl Server:**
- `get_sequence_id`: Retrieve sequence by Ensembl ID
  - Args:
    - `id` (str): Ensembl transcript or gene ID
  - Returns: Genomic or protein sequence

### Use Cases

- Genome annotation
- Comparative genomics
- Transcript analysis
- Protein sequence retrieval

Related Skills

ncbi-gene-retrieval

157
from InternScience/DrClaw

Retrieve gene information from NCBI Gene database by gene IDs to obtain genomic details, function, and expression data.

uniprot-protein-retrieval

157
from InternScience/DrClaw

Retrieve protein sequences and functional information from UniProt database by protein name, enabling protein analysis and bioinformatics workflows.

dna-rna-sequence-analysis

157
from InternScience/DrClaw

Analyze DNA and RNA sequences including molecular weight calculation, reverse complement generation, and oligonucleotide properties.

compound-name-retrieval

154
from InternScience/DrClaw

Retrieve SMILES strings from PubChem database using compound names to obtain molecular structures from common chemical names.

acpx

157
from InternScience/DrClaw

Use the ACPX CLI through DrClaw's existing exec/long_exec tools to run Codex in the current project workspace.

ui-ux-pro-max

157
from InternScience/DrClaw

[Frontend] Frontend UI/UX design intelligence - activate FIRST when user requests beautiful, stunning, gorgeous, or aesthetic interfaces. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks. Triggers on ui design, ux design, design system, color palette, typography, glassmorphism, claymorphism, neumorphism, bento grid, font pairing, ui-ux-pro-max, stunning interface, beautiful ui.

fetch

157
from InternScience/DrClaw

Fetch metadata and links from arXiv for a given query.

web_literature_mining

157
from InternScience/DrClaw

Scientific Literature Mining - Mine scientific literature: PubMed search, arXiv search, web search, and Tavily deep search. Use this skill for scientific informatics tasks involving pubmed search search literature search web tavily search. Combines 4 tools from 2 SCP server(s).

uniprot_deep_analysis

157
from InternScience/DrClaw

UniProt Deep Protein Analysis - Deep UniProt analysis: entry data, UniRef clusters, UniParc cross-references, and gene-centric view. Use this skill for protein science tasks involving get uniprotkb entry by accession get uniref cluster by id get uniparc entry by upi get gene centric by accession. Combines 4 tools from 1 SCP server(s).

synthetic_biology_design

157
from InternScience/DrClaw

Synthetic Biology Design - Design synthetic biology construct: gene lookup, codon optimization, protein property prediction, and structure prediction. Use this skill for synthetic biology tasks involving get sequence id DegenerateCodonCalculatorbyAminoAcid calculate protein sequence properties pred protein structure esmfold. Combines 4 tools from 4 SCP server(s).

structural_homology_modeling

157
from InternScience/DrClaw

Structural Homology & Evolution Analysis - Analyze protein evolution: get gene tree from Ensembl, find homologs, compare sequences, and predict structure. Use this skill for evolutionary biology tasks involving get homology symbol get genetree member symbol calculate protein sequence properties pred protein structure esmfold. Combines 4 tools from 3 SCP server(s).

proteome_analysis

157
from InternScience/DrClaw

Proteome-Level Analysis - Analyze at proteome level: get proteome from UniProt, gene-centric view, functional annotation from STRING. Use this skill for proteomics tasks involving get proteome by id get gene centric by proteome get functional annotation. Combines 3 tools from 2 SCP server(s).