haystack-pipeline

Haystack NLP pipeline configuration for document processing and QA

509 stars

bya5c-ai

View on GitHub Installation ↓

Best use case

haystack-pipeline is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Haystack NLP pipeline configuration for document processing and QA

Teams using haystack-pipeline should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/haystack-pipeline/SKILL.md --create-dirs "https://raw.githubusercontent.com/a5c-ai/babysitter/main/library/specializations/ai-agents-conversational/skills/haystack-pipeline/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/haystack-pipeline/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How haystack-pipeline Compares

Feature / Agent	haystack-pipeline	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Haystack NLP pipeline configuration for document processing and QA

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Haystack Pipeline Skill

## Capabilities

- Configure Haystack pipeline components
- Set up document stores and retrievers
- Implement reader/generator models
- Design custom pipeline graphs
- Configure preprocessing pipelines
- Implement evaluation pipelines

## Target Processes

- rag-pipeline-implementation
- intent-classification-system

## Implementation Details

### Core Components

1. **DocumentStores**: Elasticsearch, Weaviate, FAISS, etc.
2. **Retrievers**: BM25, Dense, Hybrid
3. **Readers/Generators**: Extractive and generative QA
4. **Preprocessors**: Document cleaning and splitting

### Pipeline Types

- Retrieval pipelines
- RAG pipelines
- Evaluation pipelines
- Indexing pipelines

### Configuration Options

- Component selection
- Pipeline graph design
- Document store backend
- Model selection
- Preprocessing settings

### Best Practices

- Modular pipeline design
- Proper preprocessing
- Evaluation integration
- Component versioning

### Dependencies

- haystack-ai
- farm-haystack (legacy)

Related Skills

cicd-pipeline-generator

509

from a5c-ai/babysitter

Generate CI/CD pipelines for SDK build and release

texture-pipeline

509

from a5c-ai/babysitter

Texture skill for compression, atlasing, and streaming.

nextflow-pipeline-executor

509

from a5c-ai/babysitter

Nextflow workflow management skill for reproducible bioinformatics pipelines

cicd-pipelines

509

from a5c-ai/babysitter

Multi-platform CI/CD pipeline expertise. Generate GitHub Actions, GitLab CI, Jenkins, and Azure Pipelines configurations. Analyze failures, optimize execution time, validate syntax, and configure matrix builds and caching strategies.

kubeflow-pipeline-executor

509

from a5c-ai/babysitter

Kubeflow Pipelines skill for ML workflow orchestration, component management, and Kubernetes-native ML.

etl-pipeline-builder

509

from a5c-ai/babysitter

Build and manage ETL pipelines for data migration with transformation, CDC, and monitoring

code-review-pipeline

509

from a5c-ai/babysitter

Multi-dimensional code review across correctness, security, performance, and maintainability with confidence-gated reporting and remediation loops.

process-builder

509

from a5c-ai/babysitter

Scaffold new babysitter process definitions following SDK patterns, proper structure, and best practices. Guides the 3-phase workflow from research to implementation.

Workflow & Productivity

babysitter

509

from a5c-ai/babysitter

Orchestrate via @babysitter. Use this skill when asked to babysit a run, orchestrate a process or whenever it is called explicitly. (babysit, babysitter, orchestrate, orchestrate a run, workflow, etc.)

yolo

509

from a5c-ai/babysitter

Run Babysitter autonomously with minimal manual interruption.

user-install

509

from a5c-ai/babysitter

Install the user-level Babysitter Codex setup.

team-install

509

from a5c-ai/babysitter

Install the team-pinned Babysitter Codex workspace setup.