bio-proteomics-quantification
Protein quantification from mass spectrometry data including label-free (LFQ, intensity-based), isobaric labeling (TMT, iTRAQ), and metabolic labeling (SILAC) approaches. Use when extracting protein abundances from MS data for differential analysis.
Best use case
bio-proteomics-quantification is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Protein quantification from mass spectrometry data including label-free (LFQ, intensity-based), isobaric labeling (TMT, iTRAQ), and metabolic labeling (SILAC) approaches. Use when extracting protein abundances from MS data for differential analysis.
Teams using bio-proteomics-quantification should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/bio-proteomics-quantification/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How bio-proteomics-quantification Compares
| Feature / Agent | bio-proteomics-quantification | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Protein quantification from mass spectrometry data including label-free (LFQ, intensity-based), isobaric labeling (TMT, iTRAQ), and metabolic labeling (SILAC) approaches. Use when extracting protein abundances from MS data for differential analysis.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
## Version Compatibility
Reference examples tested with: MSnbase 2.28+, numpy 1.26+, pandas 2.2+
Before using code patterns, verify installed versions match. If versions differ:
- Python: `pip show <package>` then `help(module.function)` to check signatures
- R: `packageVersion('<pkg>')` then `?function_name` to verify parameters
If code throws ImportError, AttributeError, or TypeError, introspect the installed
package and adapt the example to match the actual API rather than retrying.
# Protein Quantification
**"Quantify proteins from my mass spec data"** → Extract protein abundances from MS data using label-free (LFQ, spectral counting), isobaric labeling (TMT, iTRAQ), or metabolic labeling (SILAC) approaches.
- R: `MSstats::dataProcess()` for feature-to-protein summarization
- Python: `pandas` for MaxLFQ-style normalization and ratio calculation
- R: `MSnbase` for isobaric tag reporter ion extraction
## Label-Free Quantification (LFQ)
### Intensity-Based (MaxLFQ Algorithm)
```python
import pandas as pd
import numpy as np
def maxlfq_normalize(intensities):
'''Simplified MaxLFQ normalization'''
log_int = np.log2(intensities.replace(0, np.nan))
# Median centering per sample
sample_medians = log_int.median(axis=0)
global_median = sample_medians.median()
normalized = log_int - sample_medians + global_median
return normalized
```
### Spectral Counting
```python
def spectral_count_normalize(counts, total_spectra):
'''Normalized spectral abundance factor (NSAF)'''
# Divide by protein length, then by total
nsaf = counts / total_spectra
return nsaf / nsaf.sum()
```
## TMT/iTRAQ Quantification
```r
library(MSnbase)
# Load reporter ion data
tmt_data <- readMSnSet('tmt_data.txt')
# Normalize with reference channel
tmt_normalized <- normalize(tmt_data, method = 'center.median')
# Summarize to protein level
protein_data <- combineFeatures(tmt_normalized, groupBy = fData(tmt_data)$protein,
fun = 'median')
```
### Python TMT Processing
```python
def extract_tmt_intensities(spectrum, reporter_mz, tolerance=0.003):
'''Extract TMT reporter ion intensities'''
mz, intensity = spectrum.get_peaks()
tmt_intensities = {}
for channel, target_mz in reporter_mz.items():
mask = np.abs(mz - target_mz) < tolerance
if mask.any():
tmt_intensities[channel] = intensity[mask].max()
else:
tmt_intensities[channel] = 0
return tmt_intensities
TMT_10PLEX = {'126': 126.127726, '127N': 127.124761, '127C': 127.131081,
'128N': 128.128116, '128C': 128.134436, '129N': 129.131471,
'129C': 129.137790, '130N': 130.134825, '130C': 130.141145,
'131': 131.138180}
```
## SILAC Quantification
```python
def calculate_silac_ratio(heavy_intensity, light_intensity):
'''Calculate SILAC H/L ratio'''
if light_intensity > 0 and heavy_intensity > 0:
return np.log2(heavy_intensity / light_intensity)
return np.nan
# Typical mass shifts
SILAC_SHIFTS = {
'Arg10': 10.008269, # 13C6 15N4 Arginine
'Lys8': 8.014199, # 13C6 15N2 Lysine
'Arg6': 6.020129, # 13C6 Arginine
'Lys6': 6.020129 # 13C6 Lysine
}
```
## MSstats Workflow (R)
**Goal:** Convert MaxQuant output into normalized protein-level abundance estimates using MSstats feature-to-protein summarization.
**Approach:** Reformat MaxQuant evidence and proteinGroups files into MSstats input format, then apply median equalization normalization with Tukey's median polish for protein-level summarization.
```r
library(MSstats)
# Prepare input from MaxQuant
maxquant_input <- MaxQtoMSstatsFormat(
evidence = read.table('evidence.txt', sep = '\t', header = TRUE),
proteinGroups = read.table('proteinGroups.txt', sep = '\t', header = TRUE),
annotation = read.csv('annotation.csv')
)
# Process and normalize
processed <- dataProcess(maxquant_input, normalization = 'equalizeMedians',
summaryMethod = 'TMP', censoredInt = 'NA')
# Protein-level summary
protein_summary <- quantification(processed)
```
## Related Skills
- data-import - Load MS data before quantification
- differential-abundance - Statistical testing after quantification
- expression-matrix/counts-ingest - Similar matrix handlingRelated Skills
tooluniverse-proteomics-analysis
Analyze mass spectrometry proteomics data including protein quantification, differential expression, post-translational modifications (PTMs), and protein-protein interactions. Processes MaxQuant, Spectronaut, DIA-NN, and other MS platform outputs. Performs normalization, statistical analysis, pathway enrichment, and integration with transcriptomics. Use when analyzing proteomics data, comparing protein abundance between conditions, identifying PTM changes, studying protein complexes, integrating protein and RNA data, discovering protein biomarkers, or conducting quantitative proteomics experiments.
bio-splicing-quantification
Quantifies alternative splicing events (PSI/percent spliced in) from RNA-seq using SUPPA2 from transcript TPM or rMATS-turbo from BAM files. Calculates inclusion levels for skipped exons, alternative splice sites, mutually exclusive exons, and retained introns. Use when measuring splice site usage or isoform ratios from RNA-seq data.
bio-spatial-transcriptomics-spatial-proteomics
Analyzes spatial proteomics data from CODEX, IMC, and MIBI platforms including cell segmentation and protein colocalization. Use when working with multiplexed imaging data, analyzing protein spatial patterns, or integrating spatial proteomics with transcriptomics.
bio-proteomics-spectral-libraries
Build, manage, and search spectral libraries for proteomics. Use when creating or working with spectral libraries for DIA analysis. Covers DDA-based library generation, predicted libraries (Prosit, DeepLC), and library formats.
bio-proteomics-ptm-analysis
Post-translational modification analysis including phosphorylation, acetylation, and ubiquitination. Covers site localization, motif analysis, and quantitative PTM analysis. Use when analyzing phosphoproteomic data or other modification-enriched samples.
bio-proteomics-proteomics-qc
Quality control and assessment for proteomics data. Use when evaluating proteomics data quality before downstream analysis. Covers sample metrics, missing value patterns, replicate correlation, batch effects, and intensity distributions.
bio-proteomics-protein-inference
Protein grouping and inference from peptide identifications. Use when resolving protein ambiguity from shared peptides. Handles protein groups and protein-level FDR control using parsimony and probabilistic approaches.
bio-proteomics-peptide-identification
Peptide-spectrum matching and protein identification from MS/MS data. Use when identifying peptides from tandem mass spectra. Covers database searching, spectral library matching, and FDR estimation using target-decoy approaches.
bio-proteomics-differential-abundance
Statistical testing for differentially abundant proteins between conditions. Covers limma and MSstats workflows with multiple testing correction. Use when identifying proteins with significant abundance changes between experimental groups.
bio-proteomics-dia-analysis
Data-independent acquisition (DIA) proteomics analysis with DIA-NN and other tools. Use when analyzing DIA mass spectrometry data with library-free or library-based workflows for deep proteome profiling.
bio-proteomics-data-import
Load and parse mass spectrometry data formats including mzML, mzXML, and quantification tool outputs like MaxQuant proteinGroups.txt. Use when starting a proteomics analysis with raw or processed MS data. Handles contaminant filtering and missing value assessment.
zinc-database
Access ZINC (230M+ purchasable compounds). Search by ZINC ID/SMILES, similarity searches, 3D-ready structures for docking, analog discovery, for virtual screening and drug discovery.