FAIR Data Principles — Findable, Accessible, Interoperable, Reusable

## Overview

912 stars

Best use case

FAIR Data Principles — Findable, Accessible, Interoperable, Reusable is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

## Overview

Teams using FAIR Data Principles — Findable, Accessible, Interoperable, Reusable should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/fair-data/SKILL.md --create-dirs "https://raw.githubusercontent.com/wu-yc/LabClaw/main/skills/general/fair-data/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/fair-data/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How FAIR Data Principles — Findable, Accessible, Interoperable, Reusable Compares

Feature / AgentFAIR Data Principles — Findable, Accessible, Interoperable, ReusableStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

## Overview

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# FAIR Data Principles — Findable, Accessible, Interoperable, Reusable

## Overview
Guidelines for making scientific data FAIR: Findable, Accessible, Interoperable, and Reusable.

## Findable
- Assign globally unique persistent identifiers (DOIs) to datasets
- Rich metadata describing the dataset (title, authors, description, keywords, dates)
- Metadata registered in searchable resources (DataCite, re3data, FAIRsharing)
- Data indexed in domain-specific repositories

## Accessible
- Data retrievable by identifier using standardized protocol (HTTP, FTP)
- Metadata accessible even if data is restricted
- Authentication/authorization where necessary, clearly documented
- Long-term preservation plan (minimum 10 years for funded research)

## Interoperable
- Use formal, shared vocabularies (ontologies: GO, ChEBI, EFO, MeSH)
- Standard file formats (CSV, JSON, HDF5, NetCDF — not proprietary)
- Include references to related datasets and publications
- Machine-readable metadata (JSON-LD, Dublin Core, schema.org)

## Reusable
- Clear data usage license (CC-BY, CC0 recommended for scientific data)
- Detailed provenance (how data was collected, processed, quality controlled)
- Meet community standards (MIAME for microarrays, MINSEQE for sequencing)
- Version control for datasets that evolve

## Recommended Repositories
| Domain | Repository |
|--------|-----------|
| General | Zenodo, Figshare, Dryad |
| Genomics | GEO, SRA, ENA |
| Proteomics | PRIDE, MassIVE |
| Structures | PDB, EMDB |
| Clinical | ClinicalTrials.gov, YODA |
| Chemistry | ChEMBL, PubChem |
| Materials | NOMAD, Materials Cloud |

Related Skills

zinc-database

912
from wu-yc/LabClaw

Access ZINC (230M+ purchasable compounds). Search by ZINC ID/SMILES, similarity searches, 3D-ready structures for docking, analog discovery, for virtual screening and drug discovery.

pubchem-database

912
from wu-yc/LabClaw

Query PubChem via PUG-REST API/PubChemPy (110M+ compounds). Search by name/CID/SMILES, retrieve properties, similarity/substructure searches, bioactivity, for cheminformatics.

opentargets-database

912
from wu-yc/LabClaw

Query Open Targets Platform for target-disease associations, drug target discovery, tractability/safety data, genetics/omics evidence, known drugs, for therapeutic target identification.

fda-database

912
from wu-yc/LabClaw

Query openFDA API for drugs, devices, adverse events, recalls, regulatory submissions (510k, PMA), substance identification (UNII), for FDA regulatory data analysis and safety research.

drugbank-database

912
from wu-yc/LabClaw

Access and analyze comprehensive drug information from the DrugBank database including drug properties, interactions, targets, pathways, chemical structures, and pharmacology data. This skill should be used when working with pharmaceutical data, drug discovery research, pharmacology studies, drug-drug interaction analysis, target identification, chemical similarity searches, ADMET predictions, or any task requiring detailed drug and drug target information from DrugBank.

datamol

912
from wu-yc/LabClaw

Pythonic wrapper around RDKit with simplified interface and sensible defaults. Preferred for standard drug discovery including SMILES parsing, standardization, descriptors, fingerprints, clustering, 3D conformers, parallel processing. Returns native rdkit.Chem.Mol objects. For advanced control or custom parameters, use rdkit directly.

chembl-database

912
from wu-yc/LabClaw

Query ChEMBL bioactive molecules and drug discovery data. Search compounds by structure/properties, retrieve bioactivity data (IC50, Ki), find inhibitors, perform SAR studies, for medicinal chemistry.

uspto-database

912
from wu-yc/LabClaw

Access USPTO APIs for patent/trademark searches, examination history (PEDS), assignments, citations, office actions, TSDR, for IP analysis and prior art searches.

pubmed-database

912
from wu-yc/LabClaw

Direct REST API access to PubMed. Advanced Boolean/MeSH queries, E-utilities API, batch processing, citation management. For Python workflows, prefer biopython (Bio.Entrez). Use this for direct HTTP/REST work or custom API implementations.

openalex-database

912
from wu-yc/LabClaw

Query and analyze scholarly literature using the OpenAlex database. This skill should be used when searching for academic papers, analyzing research trends, finding works by authors or institutions, tracking citations, discovering open access publications, or conducting bibliometric analysis across 240M+ scholarly works. Use for literature searches, research output analysis, citation analysis, and academic database queries.

clinvar-database

912
from wu-yc/LabClaw

Query NCBI ClinVar for variant clinical significance. Search by gene/position, interpret pathogenicity classifications, access via E-utilities API or FTP, annotate VCFs, for genomic medicine.

clinicaltrials-database

912
from wu-yc/LabClaw

Query ClinicalTrials.gov via API v2. Search trials by condition, drug, location, status, or phase. Retrieve trial details by NCT ID, export data, for clinical research and patient matching.