home / skills / a5c-ai / babysitter / haystack-pipeline
This skill helps you configure Haystack pipelines for document processing and QA, including stores, retrievers, readers, and preprocessing with best practices.
npx playbooks add skill a5c-ai/babysitter --skill haystack-pipelineReview the files below or copy the command above to add this skill to your agents.
---
name: haystack-pipeline
description: Haystack NLP pipeline configuration for document processing and QA
allowed-tools:
- Read
- Write
- Edit
- Bash
- Glob
- Grep
---
# Haystack Pipeline Skill
## Capabilities
- Configure Haystack pipeline components
- Set up document stores and retrievers
- Implement reader/generator models
- Design custom pipeline graphs
- Configure preprocessing pipelines
- Implement evaluation pipelines
## Target Processes
- rag-pipeline-implementation
- intent-classification-system
## Implementation Details
### Core Components
1. **DocumentStores**: Elasticsearch, Weaviate, FAISS, etc.
2. **Retrievers**: BM25, Dense, Hybrid
3. **Readers/Generators**: Extractive and generative QA
4. **Preprocessors**: Document cleaning and splitting
### Pipeline Types
- Retrieval pipelines
- RAG pipelines
- Evaluation pipelines
- Indexing pipelines
### Configuration Options
- Component selection
- Pipeline graph design
- Document store backend
- Model selection
- Preprocessing settings
### Best Practices
- Modular pipeline design
- Proper preprocessing
- Evaluation integration
- Component versioning
### Dependencies
- haystack-ai
- farm-haystack (legacy)
This skill configures Haystack NLP pipelines for document processing and question-answering workflows. It helps assemble document stores, retrievers, and reader/generator models into modular, reproducible pipelines tailored for RAG, retrieval, and evaluation tasks. The skill emphasizes practical configuration, preprocessing, and evaluation integration for production-grade NLP stacks.
You define and connect core Haystack components: a document store backend, retriever(s), reader or generator, and optional preprocessors. The skill builds pipeline graphs for retrieval, RAG, indexing, or evaluation, and exposes configuration knobs for component selection, model choice, and preprocessing rules. It supports modular, versioned pipelines that can be evaluated and iterated deterministically.
Which document stores are supported?
Common backends include Elasticsearch, Weaviate, FAISS, and other Haystack-compatible stores; choose based on scale and retrieval latency needs.
Should I use an extractive reader or a generative model?
Use extractive readers for precise span-based answers and generative models when fluency and synthesis across passages are required; evaluation should guide the final choice.