AI Workflow Engineer
An AI Workflow Engineer designs, builds, and maintains end-to-end pipelines that orchestrate large language models, agents, retrie…
Skill Guide
RAG pipeline design is the systematic engineering of a retrieval system that queries a curated knowledge base to provide context for a Large Language Model (LLM), transforming it from a generic generator into a domain-specific, fact-grounded answering engine.
Scenario
You have a single technical manual (e.g., for a washing machine or software) and need to build a chatbot that answers user questions strictly from this document.
Scenario
You have a database of historical support tickets (text + metadata like category) and need to retrieve the most relevant past solutions for new, often ambiguous, user queries.
Scenario
Design a system for lawyers that retrieves relevant clauses from thousands of contracts, but must actively verify its own retrieval quality and surface source contradictions.
Use LangChain for its extensive tooling and agent capabilities. LlamaIndex is superior for advanced indexing/querying strategies over heterogeneous data. Haystack is excellent for building customizable, production-oriented pipelines with a focus on search.
Pinecone/Weaviate for managed, scalable cloud services. Qdrant for high-performance filtering. Chroma for simple, local prototyping. FAISS for research and high-speed local search, but requires manual management.
Ragas and TruLens for offline evaluation of context relevance, faithfulness, and answer quality. LangSmith for production tracing, debugging, and monitoring of latency and cost.
Use OpenAI/Cohere for state-of-the-art performance with minimal setup. BGE models are top open-source choices. Cross-encoders are used for high-accuracy re-ranking of retrieved chunks.
1 career found
Try a different search term.