haystack-pipeline
Haystack NLP pipeline configuration for document processing and QA
Best use case
haystack-pipeline is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Haystack NLP pipeline configuration for document processing and QA
Teams using haystack-pipeline should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/haystack-pipeline/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How haystack-pipeline Compares
| Feature / Agent | haystack-pipeline | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Haystack NLP pipeline configuration for document processing and QA
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Haystack Pipeline Skill ## Capabilities - Configure Haystack pipeline components - Set up document stores and retrievers - Implement reader/generator models - Design custom pipeline graphs - Configure preprocessing pipelines - Implement evaluation pipelines ## Target Processes - rag-pipeline-implementation - intent-classification-system ## Implementation Details ### Core Components 1. **DocumentStores**: Elasticsearch, Weaviate, FAISS, etc. 2. **Retrievers**: BM25, Dense, Hybrid 3. **Readers/Generators**: Extractive and generative QA 4. **Preprocessors**: Document cleaning and splitting ### Pipeline Types - Retrieval pipelines - RAG pipelines - Evaluation pipelines - Indexing pipelines ### Configuration Options - Component selection - Pipeline graph design - Document store backend - Model selection - Preprocessing settings ### Best Practices - Modular pipeline design - Proper preprocessing - Evaluation integration - Component versioning ### Dependencies - haystack-ai - farm-haystack (legacy)
Related Skills
cicd-pipeline-generator
Generate CI/CD pipelines for SDK build and release
texture-pipeline
Texture skill for compression, atlasing, and streaming.
nextflow-pipeline-executor
Nextflow workflow management skill for reproducible bioinformatics pipelines
cicd-pipelines
Multi-platform CI/CD pipeline expertise. Generate GitHub Actions, GitLab CI, Jenkins, and Azure Pipelines configurations. Analyze failures, optimize execution time, validate syntax, and configure matrix builds and caching strategies.
kubeflow-pipeline-executor
Kubeflow Pipelines skill for ML workflow orchestration, component management, and Kubernetes-native ML.
etl-pipeline-builder
Build and manage ETL pipelines for data migration with transformation, CDC, and monitoring
code-review-pipeline
Multi-dimensional code review across correctness, security, performance, and maintainability with confidence-gated reporting and remediation loops.
process-builder
Scaffold new babysitter process definitions following SDK patterns, proper structure, and best practices. Guides the 3-phase workflow from research to implementation.
babysitter
Orchestrate via @babysitter. Use this skill when asked to babysit a run, orchestrate a process or whenever it is called explicitly. (babysit, babysitter, orchestrate, orchestrate a run, workflow, etc.)
yolo
Run Babysitter autonomously with minimal manual interruption.
user-install
Install the user-level Babysitter Codex setup.
team-install
Install the team-pinned Babysitter Codex workspace setup.