cohort-study-quality-assessment-nos
Evaluates the quality of cohort studies using the Newcastle-Ottawa Scale (NOS). Use when the user provides a cohort study article or text and needs a quality assessment report.
Best use case
cohort-study-quality-assessment-nos is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Evaluates the quality of cohort studies using the Newcastle-Ottawa Scale (NOS). Use when the user provides a cohort study article or text and needs a quality assessment report.
Teams using cohort-study-quality-assessment-nos should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/cohort-study-quality-assessment-nos/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How cohort-study-quality-assessment-nos Compares
| Feature / Agent | cohort-study-quality-assessment-nos | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Evaluates the quality of cohort studies using the Newcastle-Ottawa Scale (NOS). Use when the user provides a cohort study article or text and needs a quality assessment report.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
> **Source**: [https://github.com/aipoch/medical-research-skills](https://github.com/aipoch/medical-research-skills)
# Cohort Study Quality Assessment (NOS)
This skill evaluates the quality of a cohort study based on the Newcastle-Ottawa Scale (NOS). It analyzes Selection, Comparability, and Outcome categories and generates a scored report.
## When to Use
- Use this skill when the request matches its documented task boundary.
- Use it when the user can provide the required inputs and expects a structured deliverable.
- Prefer this skill for repeatable, checklist-driven execution rather than open-ended brainstorming.
## Key Features
- Scope-focused workflow aligned to: Evaluates the quality of cohort studies using the Newcastle-Ottawa Scale (NOS). Use when the user provides a cohort study article or text and needs a quality assessment report.
- Packaged executable path(s): `scripts/calculate_nos_score.py` plus 1 additional script(s).
- Reference material available in `references/` for task-specific guidance.
- Structured execution path designed to keep outputs consistent and reviewable.
## Dependencies
See `## Prerequisites` above for related details.
- `Python`: `3.10+`. Repository baseline for current packaged skills.
- `Third-party packages`: `not explicitly version-pinned in this skill package`. Add pinned versions if this skill needs stricter environment control.
## Example Usage
See `## Usage` above for related details.
```bash
cd "20260316/scientific-skills/Data Analytics/cohort-study-quality-assessment-nos"
python -m py_compile scripts/calculate_nos_score.py
python scripts/calculate_nos_score.py --help
```
Example run plan:
1. Confirm the user input, output path, and any required config values.
2. Edit the in-file `CONFIG` block or documented parameters if the script uses fixed settings.
3. Run `python scripts/calculate_nos_score.py` with the validated inputs.
4. Review the generated output and return the final artifact with any assumptions called out.
## Implementation Details
See `## Workflow` above for related details.
- Execution model: validate the request, choose the packaged workflow, and produce a bounded deliverable.
- Input controls: confirm the source files, scope limits, output format, and acceptance criteria before running any script.
- Primary implementation surface: `scripts/calculate_nos_score.py` with additional helper scripts under `scripts/`.
- Reference guidance: `references/` contains supporting rules, prompts, or checklists.
- Parameters to clarify first: input path, output path, scope filters, thresholds, and any domain-specific constraints.
- Output discipline: keep results reproducible, identify assumptions explicitly, and avoid undocumented side effects.
## Prerequisites
- Python 3.x
- PyPDF2 library (`pip install PyPDF2`)
- Access to `references/nos_criteria.md` for detailed evaluation criteria
## Usage
### Method 1: Direct Text Input
1. **Input**: The full text of the cohort study article (copy-pasted or provided by user).
2. **Process**:
* Extract study metadata (Author, Year).
* Evaluate the study against NOS criteria (Selection, Comparability, Outcome).
* Calculate the total score (number of stars).
3. **Output**: A Markdown table summarizing the assessment for each criterion and the overall score.
### Method 2: PDF File Input
1. **Input**: Path to a PDF file containing the cohort study.
2. **Process**:
* Use `scripts/extract_pdf.py` to extract text from the PDF.
* Review the extracted text for completeness.
* Proceed with Method 1 steps.
3. **Output**: Same as Method 1.
## Workflow
### Step 1: Extract Content
**If input is PDF:**
```bash
cd "D:\helix\test record\\262\skills_\skills\cohort-study-quality-assessment-nos"
python scripts/extract_pdf.py "path/to/your/file.pdf"
```
**Note**: The extracted text will be saved to `extracted_text.txt` in the current directory.
### Step 2: Analyze the Text
You must analyze the input text to extract information and evaluate it against the criteria defined in `references/nos_criteria.md`.
**1. Extract Metadata:**
* **Study**: First Author, Year (e.g., "Wang, 2018").
**2. Evaluate Selection (4 items):**
* **D1**: Representativeness of the Exposed Cohort (* or -)
- Look for: study design (retrospective/prospective), data source (hospital registry/population-based), inclusion/exclusion criteria clarity
- Give star if: representative sample from a defined population with clear criteria
* **D2**: Selection of the Non-Exposed Cohort (* or -)
- Look for: comparison group source (same community/hospital/different source)
- Give star if: drawn from the same source as exposed cohort
* **D3**: Ascertainment of Exposure (* or -)
- Look for: how exposure was determined (medical records/interview/self-report)
- Give star if: secure record (surgical records, pharmacy records, EMR) or structured interview
* **D4**: Outcome not present at start (* or -)
- Look for: exclusion of patients with outcome at baseline
- Give star if: demonstrated that outcome was not present at study start
**3. Evaluate Comparability (2 items):**
* **D5**: Age comparability (* or -)
- Look for: age matching or statistical adjustment for age
- Give star if: matched in design OR adjusted in analysis (not just "no difference" statement)
* **D6**: Additional comparability (* or -)
- Look for: matching/adjustment for other key confounders (BMI, disease severity, comorbidities)
- Give star if: matched in design OR adjusted in analysis for important factors beyond age
**4. Evaluate Outcome (3 items):**
* **D7**: Assessment of Outcome (* or -)
- Look for: outcome measurement method (blind assessment/secure records/self-report)
- Give star if: independent blind assessment, secure records (lab results, imaging), or record linkage
* **D8**: Enough follow-up (* or -)
- Look for: duration of follow-up
- Give star if: follow-up long enough for outcomes to occur (depends on disease/condition)
* **D9**: Adequacy of follow up (* or -)
- Look for: follow-up completion rate, description of losses
- Give star if: complete follow-up OR ≥80% follow-up with description of losses
- **Important**: If follow-up rate not reported or <80%, do NOT give star
For each item, determine if it meets the criteria for a star (*). If not, or if uncertain, mark as (-).
### Step 3: Format Data
Construct a JSON object with the results:
```json
{
"Study": "Wang, 2018",
"D1": "*",
"D2": "-",
"D3": "*",
"D4": "*",
"D5": "*",
"D6": "-",
"D7": "*",
"D8": "*",
"D9": "-"
}
```
**Scoring Notes:**
- Be conservative: If uncertain about meeting criteria, use "-"
- For D9 (follow-up adequacy): Common issue is lack of reported completion rate
- Document your reasoning for each item
### Step 4: Calculate Score and Generate Report
Run the python script to generate the final table:
```bash
python scripts/calculate_nos_score.py '<json_string>'
```
**Example:**
```bash
python scripts/calculate_nos_score.py "{\"Study\": \"Wei et al., 2026\", \"D1\": \"*\", \"D2\": \"*\", \"D3\": \"*\", \"D4\": \"*\", \"D5\": \"*\", \"D6\": \"*\", \"D7\": \"*\", \"D8\": \"*\", \"D9\": \"-\"}"
```
**Important**: JSON string must be properly escaped when passed via command line.
### Step 5: Generate Final Report
Return to user:
1. The generated Markdown table
2. Brief explanation of each scoring decision
3. Summary of study quality (High: ≥7 stars, Moderate: 4-6 stars, Low: <4 stars)
4. Key strengths and limitations
## Helper Scripts
### PDF Text Extraction
When the user provides a PDF file path, use `scripts/extract_pdf.py` to extract the text content before assessment:
**Features:**
- Extracts text from all pages
- Saves output to `extracted_text.txt`
- Handles path issues with spaces
- Provides progress feedback
**Usage:**
```bash
python scripts/extract_pdf.py "path/to/file.pdf"
```
**Output:**
- Console: Extraction progress and statistics
- File: `extracted_text.txt` in current working directory
## Quality Interpretation
| Score | Quality Level | Recommendation |
|-------|---------------|----------------|
| 9 stars | Excellent | Low risk of bias, high confidence |
| 7-8 stars | High quality | Acceptable for meta-analysis |
| 4-6 stars | Moderate quality | Consider in sensitivity analyses |
| <4 stars | Low quality | High risk of bias, use caution |
## Common Issues and Solutions
1. **PDF extraction fails**: Check if file exists and is not corrupted; try different PDF library (PyMuPDF)
2. **JSON parsing error**: Ensure proper escaping of quotes in command line
3. **Uncertain criteria**: When in doubt, be conservative and assign "-"
4. **Missing information**: Note in report that certain items could not be assessed
## References
- Detailed criteria: `references/nos_criteria.md`
- Wells GA, et al. The Newcastle-Ottawa Scale (NOS) for assessing the quality of nonrandomised studies in meta-analyses
## When Not to Use
- Do not use this skill when the required source data, identifiers, files, or credentials are missing.
- Do not use this skill when the user asks for fabricated results, unsupported claims, or out-of-scope conclusions.
- Do not use this skill when a simpler direct answer is more appropriate than the documented workflow.
## Required Inputs
- A clearly specified task goal aligned with the documented scope.
- All required files, identifiers, parameters, or environment variables before execution.
- Any domain constraints, formatting requirements, and expected output destination if applicable.
## Output Contract
- Return a structured deliverable that is directly usable without reformatting.
- If a file is produced, prefer a deterministic output name such as `cohort_study_quality_assessment_nos_result.md` unless the skill documentation defines a better convention.
- Include a short validation summary describing what was checked, what assumptions were made, and any remaining limitations.
## Validation and Safety Rules
- Validate required inputs before execution and stop early when mandatory fields or files are missing.
- Do not fabricate measurements, references, findings, or conclusions that are not supported by the provided source material.
- Emit a clear warning when credentials, privacy constraints, safety boundaries, or unsupported requests affect the result.
- Keep the output safe, reproducible, and within the documented scope at all times.
## Failure Handling
- If validation fails, explain the exact missing field, file, or parameter and show the minimum fix required.
- If an external dependency or script fails, surface the command path, likely cause, and the next recovery step.
- If partial output is returned, label it clearly and identify which checks could not be completed.
## Quick Validation
Run this minimal verification path before full execution when possible:
```bash
python scripts/calculate_nos_score.py --help
```
Expected output format:
```text
Result file: cohort_study_quality_assessment_nos_result.md
Validation summary: PASS/FAIL with brief notes
Assumptions: explicit list if any
```Related Skills
clinical-study-info-extractor
Batch extracts and verifies structured information (PMID, title, abstract, methodology, results, etc.) from clinical research literature using PMIDs. Use when the user wants to extract details from specific PMIDs.
study-design-scale-selector
Determines the appropriate Risk of Bias assessment scale for a medical study based on its design (RCT, Cohort, etc.), using PubMed metadata lookup or text analysis. Use when the user wants to know which quality assessment tool to use for a specific paper (given PMID or abstract).
rct-bias-assessment-rob2
Automates Risk of Bias 2 (ROB2) assessment for RCT papers by analyzing text against specific domains and synthesizing a report. Use when you need to assess the quality of a clinical trial paper or evaluate risk of bias.
quapas-quality-assessment-for-prognosis-studies
Evaluates bias in medical literature (prognosis studies) using QUAPAS criteria. Use when the user wants to assess the quality or risk of bias of a medical paper text.
quality-assessment
Automates critical appraisal and quality assessment for research papers by analyzing text against established methodological standards (such as risk of bias tools, quality checklists, or reporting guidelines) and synthesizing a structured evaluation report. Use when you need to assess the methodological quality, internal validity, or reporting completeness of any type of study—including RCTs, observational studies, systematic reviews, qualitative research, or diagnostic accuracy studies.
probast-quality-assessment-for-prediction-model-studies
Assess bias in medical prediction model studies using PROBAST tool. Use when user wants to evaluate the quality or risk of bias of a medical paper (text or PDF).
diagnostic-study-quality-assessment-quadas-2
Analyzes clinical diagnostic accuracy studies for bias using the QUADAS-2 tool. Use when Claude needs to assess the quality, risk of bias, or applicability of diagnostic accuracy studies (e.g., "Assess this paper using QUADAS-2").
case-control-study-quality-assessment-nos
Clinical Research Bias Assessment - Case-Control Study (NOS) v2.3.0. Use when you need to assess the bias of a case-control study using the Newcastle-Ottawa Scale (NOS) criteria, or when evaluating the quality of a medical paper.
study-limitations-drafter
Use study limitations drafter for academic writing workflows that need structured execution, explicit assumptions, and clear output boundaries.
translational-study-blueprint
Designs a translational blueprint for moving a biomedical finding toward diagnosis, prognosis, treatment response prediction, patient stratification, or therapeutic development, with explicit translational milestones, validation thresholds, and feasibility-sensitive route framing.
study-objective-refiner
Refines broad, vague, or aspirational biomedical research objectives into clear, bounded, measurable, executable, and downstream-ready study objective statements. Always use this skill when a user has a general aim such as “explore a mechanism,” “study prognosis,” “investigate biomarkers,” or “look at treatment response,” but the objective is still too broad, non-operational, or too ambiguous to support protocol framing, design selection, analysis planning, or hypothesis design. Never assume that polished wording alone means the objective is actionable. Focus first on objective type, missing operational elements, scope discipline, and downstream-ready formulation.
real-world-evidence-study-designer
Designs a structured real-world evidence study using EHR, claims, or registry data, with explicit handling of time zero, eligibility windows, exposure definitions, outcome windows, censoring, confounding control, and target-trial-emulation logic. Use this skill when the user needs study-type design and protocol framing for an observational clinical study based on routine-care data. Do not invent database fields, follow-up completeness, linkage, coding validity, or causal identifiability.