devtu-auto-discover-apis

Automatically discover life science APIs online, create ToolUniverse tools, validate them, and prepare integration PRs. Performs gap analysis to identify missing tool categories, web searches for APIs, automated tool creation using devtu-create-tool patterns, validation with devtu-fix-tool, and git workflow management. Use when expanding ToolUniverse coverage, adding new API integrations, or systematically discovering scientific resources.

1,202 stars

bymims-harvard

View on GitHub Installation ↓

Best use case

devtu-auto-discover-apis is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using devtu-auto-discover-apis should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/devtu-auto-discover-apis/SKILL.md --create-dirs "https://raw.githubusercontent.com/mims-harvard/ToolUniverse/main/skills/devtu-auto-discover-apis/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/devtu-auto-discover-apis/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How devtu-auto-discover-apis Compares

Feature / Agent	devtu-auto-discover-apis	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agents for Coding

Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.

Cursor vs Codex for AI Workflows

Compare Cursor and Codex for AI coding workflows, repository assistance, debugging, refactoring, and reusable developer skills.

AI Agent for Product Research

Browse AI agent skills for product research, competitive analysis, customer discovery, and structured product decision support.

SKILL.md Source

# Automated Life Science API Discovery & Tool Creation

Discover, create, validate, and integrate life science APIs into ToolUniverse.

## Four-Phase Workflow

```
Gap Analysis → API Discovery → Tool Creation → Validation → Integration
     ↓              ↓               ↓              ↓            ↓
  Coverage      Web Search      devtu-create   devtu-fix    Git PR
```

Human approval gates after: discovery, creation, validation, and before PR.

---

## Phase 1: Discovery & Gap Analysis

### 1.1 Analyze Current Coverage
Load ToolUniverse, categorize tools by domain (genomics, proteomics, drug discovery, clinical, omics, imaging, literature, pathways, systems biology). Count per category.

### 1.2 Identify Gap Domains
- **Critical Gap**: <5 tools in category
- **Moderate Gap**: 5-15 tools, missing key subcategories
- **Emerging Gap**: New technologies not represented

Common gaps: single-cell genomics, metabolomics, patient registries, microbial genomics, multi-omics integration, synthetic biology, toxicology.

### 1.3 Web Search for APIs
For each gap domain, run multiple queries:
1. `"[domain] API REST JSON"` — direct API search
2. `"[domain] public database"` — database discovery
3. `"[domain] API 2025 OR 2026"` — recent releases
4. `"[domain] database" site:nar.oxfordjournals.org` — NAR Database Issue

Extract: base URL, endpoints, auth method, parameter schemas, rate limits.

### 1.4 Score and Prioritize

| Criterion | Max Points |
|-----------|------------|
| Documentation Quality | 20 |
| API Stability | 15 |
| Authentication Simplicity | 15 |
| Coverage | 15 |
| Maintenance | 10 |
| Community | 10 |
| License | 10 |
| Rate Limits | 5 |

High priority (>=70), Medium (50-69), Low (<50).

### 1.5 Generate Discovery Report
Coverage analysis, prioritized candidates with scores, implementation roadmap.

---

## Phase 2: Tool Creation

For each API, use `Skill(skill="devtu-create-tool")` or follow these patterns.

### Architecture Decision
- Multiple endpoints → multi-operation tool (single class, multiple JSON wrappers)
- Single endpoint → single-operation acceptable

### Key Steps
1. Design tool class following template — see [references/tool-templates.md](references/tool-templates.md)
2. Create JSON config with oneOf return_schema
3. Find real test examples (use List endpoint → extract IDs → verify)
4. Register in `default_config.py`

### Critical Requirements
- return_schema MUST have `oneOf` (success + error schemas)
- test_examples MUST use real IDs (NO placeholders)
- Tool name <= 55 characters
- NEVER raise exceptions in `run()` — return error dict
- Set timeout on all HTTP requests (30s)

---

## Phase 3: Validation

Full guide: [references/validation-guide.md](references/validation-guide.md)

### Quick Validation Checklist
1. **Schema**: oneOf structure, data wrapper, error field
2. **Placeholders**: No TEST/DUMMY/PLACEHOLDER in test_examples
3. **Loading**: 3-step check (class registered, config registered, wrappers generated)
4. **Integration tests**: `python scripts/test_new_tools.py [api_name] -v` → 100% pass

Fix failures with `Skill(skill="devtu-fix-tool")`.

---

## Phase 4: Integration

Use `Skill(skill="devtu-github")` or:
1. Create branch: `feature/add-[api-name]-tools`
2. Stage tool files + default_config.py
3. Commit with descriptive message
4. Push and create PR with validation results

---

## Processing Patterns

| Pattern | When to Use |
|---------|------------|
| **Batch** (multiple APIs → single PR) | Same domain, similar structure |
| **Iterative** (one API at a time) | Complex auth, novel patterns |
| **Discovery-only** (report, no tools) | Planning roadmap |
| **Validation-only** (audit existing) | PR review, quality check |

---

## References

- **Tool templates** (Python class + JSON config): [references/tool-templates.md](references/tool-templates.md)
- **Validation & integration guide**: [references/validation-guide.md](references/validation-guide.md)

Related Skills

tooluniverse-small-molecule-discovery

1202

from mims-harvard/ToolUniverse

Find, characterize, and source small molecules for chemical biology and drug discovery. Covers compound identification (PubChem, ChEMBL), structure search, binding affinity data, ADMET/drug-likeness prediction, and commercial availability (eMolecules, Enamine). Use when asked to find compounds, assess drug-likeness, search by structure, retrieve binding affinities, or source chemicals.

tooluniverse-gwas-drug-discovery

1202

from mims-harvard/ToolUniverse

Transform GWAS signals into actionable drug targets and repurposing opportunities. Performs locus-to-gene mapping, target druggability assessment, existing drug identification, safety profile evaluation, and clinical trial matching. Use when discovering drug targets from GWAS data, finding drug repurposing opportunities from genetic associations, or translating GWAS findings into therapeutic leads.

tooluniverse-dataset-discovery

1202

from mims-harvard/ToolUniverse

Find and evaluate research datasets for any scientific question. Teaches how to reason about data needs, search across public repositories, evaluate dataset fitness, and identify access requirements. Use whenever users ask to find data, search for datasets, identify cohort studies, or need data for analysis. Also use when users ask about a specific survey or cohort (NHANES, HRS, UK Biobank, TCGA, etc.), when they want to know what data exists for a research question, or when they need to compare available data sources. If the user mentions "where can I get data" or "is there a dataset for X", this is the right skill.

tooluniverse-binder-discovery

1202

from mims-harvard/ToolUniverse

Discover novel small molecule binders for protein targets using structure-based and ligand-based approaches. Creates actionable reports with candidate compounds, ADMET profiles, and synthesis feasibility. Use when users ask to find small molecules for a target, identify novel binders, perform virtual screening, or need hit-to-lead compound identification.

devtu-self-evolve

1202

from mims-harvard/ToolUniverse

Orchestrate the full ToolUniverse self-improvement cycle: discover APIs, create tools, test with researcher personas, fix issues, optimize skills, and push via git. References and dispatches to all other devtu skills. Use when asked to: run the self-improvement loop, do a debug/test round, expand tool coverage, improve tool quality, or evolve ToolUniverse.

devtu-optimize-skills

1202

from mims-harvard/ToolUniverse

Optimize ToolUniverse skills for better report quality, evidence handling, and user experience. Apply patterns like tool verification, foundation data layers, disambiguation-first, evidence grading, quantified completeness, and report-only output. Use when reviewing skills, improving existing skills, or creating new ToolUniverse research skills.

devtu-optimize-descriptions

1202

from mims-harvard/ToolUniverse

Optimize tool descriptions in ToolUniverse JSON configs for clarity and usability. Reviews descriptions for missing prerequisites, unexpanded abbreviations, unclear parameters, and missing usage guidance. Use when reviewing tool descriptions, improving API documentation, or when user asks to check if tools are easy to understand.

devtu-github

1202

from mims-harvard/ToolUniverse

GitHub workflow for ToolUniverse - push code safely by moving temp files, activating pre-commit hooks, running tests, and cleaning staged files. Use when pushing to GitHub, fixing CI failures, or cleaning up before commits.

devtu-fix-tool

1202

from mims-harvard/ToolUniverse

Fix failing ToolUniverse tools by diagnosing test failures, identifying root causes, implementing fixes, and validating solutions. Use when ToolUniverse tools fail tests, return errors, have schema validation issues, or when asked to debug or fix tools in the ToolUniverse framework.

devtu-docs-quality

1202

from mims-harvard/ToolUniverse

TOP PRIORITY skill — find and immediately fix or remove every piece of wrong, outdated, or redundant information in ToolUniverse docs. Wrong code, broken links, incorrect counts, and overlapping instructions must be fixed or removed — never left in place. Runs five phases: (D) static method scan, (C) live code execution, (A) automated validation, (B) ToolUniverse audit, (E) less-is-more simplification. Core philosophy: each concept appears exactly once; remove don't add; no emojis; single setup entry point. Use when reviewing docs, before releases, after API changes, or when asked to audit, fix, or simplify documentation.

devtu-create-tool

1202

from mims-harvard/ToolUniverse

Create new scientific tools for ToolUniverse framework with proper structure, validation, and testing. Use when users need to add tools to ToolUniverse, implement new API integrations, create tool wrappers for scientific databases/services, expand ToolUniverse capabilities, or follow ToolUniverse contribution guidelines. Supports creating tool classes, JSON configurations, validation, error handling, and test examples.

devtu-code-optimization

1202

from mims-harvard/ToolUniverse

Code quality patterns and guidelines for ToolUniverse tool development. Apply when writing, fixing, or refactoring tool Python code in the ToolUniverse project. Encodes lessons from 80+ debug rounds. Use alongside devtu-fix-tool and devtu-self-evolve. Triggers: implementing tool fixes, writing new tool classes, reviewing tool code quality, checking schema correctness, looking up API-specific bug fixes.