literature-experiment-extract

Extract experimental models, experimental methods, and biomarker information from paper Markdown (typically produced by PDF-to-Markdown tools) when a user provides paper Markdown and needs a structured, evidence-backed summary (1 Markdown + 3 CSVs).

53 stars

byaipoch

View on GitHub Installation ↓

Best use case

literature-experiment-extract is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using literature-experiment-extract should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/literature-experiment-extract/SKILL.md --create-dirs "https://raw.githubusercontent.com/aipoch/medical-research-skills/main/scientific-skills/Evidence Insight/literature-experiment-extract/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/literature-experiment-extract/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How literature-experiment-extract Compares

Feature / Agent	literature-experiment-extract	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

> **Source**: [https://github.com/aipoch/medical-research-skills](https://github.com/aipoch/medical-research-skills)

## When to Use

- You have a paper converted to Markdown (e.g., via PDF-to-Markdown) and need to extract **cell/animal models** used in experiments.
- You need a structured list of **experimental methods/protocols** described in the paper, with traceable evidence.
- You want to compile **biomarkers / detection indicators** (e.g., genes, proteins, assays, readouts) reported in the study.
- You need standardized outputs for downstream analysis: **one Markdown summary plus three CSV tables**.
- The paper Markdown includes page markers (e.g., `## Page XX`) and you want evidence organized **by page**.

## Key Features

- Extracts three entity groups from paper Markdown:
  - **Experimental models** (cell lines, animal models, strains, genotypes, etc.)
  - **Experimental methods** (assays, protocols, instruments, conditions)
  - **Biomarkers / indicators** (targets, readouts, measured variables)
- Produces **evidence-backed** results (citations/excerpts preserved and traceable to the source).
- Supports **page-aware evidence organization** when the input includes pagination headers like `## Page XX`.
- Outputs are fixed and standardized:
  - **1 Markdown summary**
  - **3 CSV files**: models / methods / biomarkers
- Uses a predefined template and extraction rules:
  - Requirements and consistency rules: `references/guide.md`
  - Output template: `assets/template.md`

## Dependencies

- None (documentation-driven workflow).
- Input assumption: paper content is available as **Markdown**, typically generated by a **PDF-to-Markdown** tool.

## Example Usage

### Input

A paper converted to Markdown, ideally with page headers:

```md
## Page 1
... text describing "C57BL/6 mice" and "Western blot" ...

## Page 2
... text describing "ELISA" and "IL-6 levels" ...
```

### Steps

1. Open the paper Markdown (typically produced by PDF-to-Markdown tools).
2. Extract **models**, **methods**, and **biomarkers** page by page.
3. Follow:
   - Extraction rules and evidence requirements: `references/guide.md`
   - Output template: `assets/template.md`
4. Output **exactly**:
   - `outputs/{Paper Abbreviation}-experiment-summary.md`
   - `outputs/{Paper Abbreviation}-models.csv`
   - `outputs/{Paper Abbreviation}-methods.csv`
   - `outputs/{Paper Abbreviation}-biomarkers.csv`

### Output (required)

- All final outputs must be **UTF-8** encoded.
- Output must be produced **directly** (no confirmation steps or optional branches).
- Evidence excerpts must remain in the **original language** of the source literature.

## Implementation Details

- **Input parsing**
  - Read the paper Markdown as the sole input source.
  - If pagination headers like `## Page XX` exist, prioritize attaching evidence to the corresponding page.

- **Extraction rules**
  - Apply entity definitions, allowed/expected fields, normalization rules, and evidence formatting as specified in `references/guide.md`.

- **Output formatting**
  - Generate outputs using `assets/template.md` as the canonical structure.
  - Add rows as needed while preserving evidence citations/excerpts.
  - The output set is fixed: **1 Markdown summary + 3 CSVs** (models/methods/biomarkers).

- **Paths and naming**
  - Default output directory: `outputs/`
  - Naming:
    - Markdown: `outputs/{Paper Abbreviation}-experiment-summary.md`
    - CSVs:
      - `outputs/{Paper Abbreviation}-models.csv`
      - `outputs/{Paper Abbreviation}-methods.csv`
      - `outputs/{Paper Abbreviation}-biomarkers.csv`

- **Language**
  - Output language should be **Chinese by default** (or the user-requested language if specified).
  - Evidence excerpts must remain in the **original language** of the source text.

Related Skills

pdf-extract

from aipoch/medical-research-skills

Extract PDF selectable text and full-page or segmented page images (including tables) into Markdown with per-page headings and image links; use when you need both readable text and page visuals for PPT creation, review, or analysis.

literatureimages-interpretation

from aipoch/medical-research-skills

Interpret figures in academic papers and their captions when the input is a PDF-to-Markdown document with page markers and image links, producing a structured Markdown report for extracting variables, trends, and conclusions.

literature-statistics

from aipoch/medical-research-skills

Generate statistics for publication-year and journal distributions from local references or PDFs; use when you need standardized Year/Journal tables and a summary without any network access.

literature-management

from aipoch/medical-research-skills

Import local literature into a managed library; trigger when you need offline deduplication, tagging, and a searchable index.

experiment-detail-comparator

from aipoch/medical-research-skills

Compare experimental method details between two Zotero PDF papers, identify protocol differences (ratios, dosages, timing, conditions), search supporting literature to explain why they differ, and generate an HTML report. Use when you need a parameter-level comparison of two methods and evidence-backed reasons for discrepancies.

pdf-extract-experimental-materials

from aipoch/medical-research-skills

Extract experimental materials and instrument information from PDFs (or PDF-derived text/Markdown) into three CSV tables; use when a paper/report contains sections like Materials and Methods, Key Resources Table, Reagents, Antibodies, Consumables, Software, Equipment, Instruments, or Reagent Preparation.

methodology-extractor

from aipoch/medical-research-skills

Batch extraction of experimental methods from multiple papers for protocol.

literature-filtering

from aipoch/medical-research-skills

Filter literature by publication year, journal, and predefined screening rules to produce inclusion/exclusion lists; use when conducting preliminary screening or systematic review screening to narrow the literature scope.

literature-extensive-read

from aipoch/medical-research-skills

Rapidly skim and summarize academic papers (default:PDF-to-Markdown full text with `## Page XX` pagination and image references) and output a structured extensive-reading summary in Markdown when you need to quickly understand research questions, methods, key results, conclusions, and decide whether intensive reading is worthwhile.

literature-close-read

from aipoch/medical-research-skills

Produce a structured close-reading report from a paper's full PDF-to-Markdown text (with `## Page XX` pagination and image references) when you need to systematically extract background, research questions, methods, results, limitations, and reproducible experimental details.

clinical-study-info-extractor

from aipoch/medical-research-skills

Batch extracts and verifies structured information (PMID, title, abstract, methodology, results, etc.) from clinical research literature using PMIDs. Use when the user wants to extract details from specific PMIDs.

outcome-extraction-for-clinical-trials

from aipoch/medical-research-skills

Clinical research outcome extraction for meta-analysis. Use when users need to extract outcome measures (binary, continuous, or survival data) from clinical research papers for systematic review and meta-analysis. Handles both database lookup by PMID and real-time LLM extraction.