xgboost

XGBoost gradient boosting library. Use for tabular ML.

7 stars

byG1Joshi

View on GitHub Installation ↓

Best use case

xgboost is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

XGBoost gradient boosting library. Use for tabular ML.

Teams using xgboost should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/xgboost/SKILL.md --create-dirs "https://raw.githubusercontent.com/G1Joshi/Agent-Skills/main/skills/ai-ml/xgboost/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/xgboost/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How xgboost Compares

Feature / Agent	xgboost	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

XGBoost gradient boosting library. Use for tabular ML.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# XGBoost

XGBoost is the winningest algorithm in Kaggle history for tabular data. v2.1 (2025) brings native **Blackwell** GPU support and Polars integration.

## When to Use

- **Tabular Data**: It usually beats Deep Learning on structured tables.
- **Speed**: Extremely optimized C++ backend.

## Core Concepts

### Gradient Boosting

Building extensive decision trees sequentially, each correcting the previous one's errors.

### DMatrix

Internal optimized data structure.

### Device Parameter

`device="cuda"` enables GPU acceleration.

## Best Practices (2025)

**Do**:

- **Use `device="cuda"`**: GPU training is 10x faster.
- **Use Early Stopping**: Stop training when validation error rises.
- **Pass Polars Dataframes**: No need to convert to Pandas/NumPy first.

**Don't**:

- **Don't use one-hot encoding**: Use native categorical support (`enable_categorical=True`).

## References

- [XGBoost Documentation](https://xgboost.readthedocs.io/)