AI Medical Literature Review Specialist
An AI Medical Literature Review Specialist leverages large language models, retrieval-augmented generation (RAG), and biomedical N…
Skill Guide
The architectural design, implementation, and iterative refinement of a Retrieval-Augmented Generation system tailored to extract precise, contextually relevant information from specialized, non-general knowledge bases.
Scenario
Create a RAG system for a small corpus of technical documentation (e.g., a specific Python library's API docs) to answer user questions.
Scenario
Improve a RAG system's precision for extracting specific clauses from a corpus of 10,000+ legal contracts where accuracy is critical.
Scenario
Architect a production-grade RAG pipeline for PubMed articles that includes confidence scoring, source attribution, and automatic query refinement when answers are uncertain.
Use LangChain for flexible, composable pipelines and rapid prototyping. Choose LlamaIndex for advanced data ingestion and indexing patterns. Use Haystack for production-grade, scalable pipelines with strong support for retrieval modules.
Pinecone for managed, scalable cloud-native vector search. Weaviate for hybrid search with built-in vectorization. Qdrant for high-performance filtering. FAISS for local, high-speed similarity search in research/prototyping.
Use OpenAI embeddings for general quality. Choose BGE or Cohere for domain-specific fine-tuning potential. Use a cross-encoder reranker after initial retrieval to dramatically improve precision on the top-k results.
RAGAS for faithfulness, relevance, and context recall metrics. DeepEval for comprehensive LLM/RAG testing. TruLens for tracing and feedback-driven evaluation. LangSmith for observability, tracing, and debugging production chains.
Answer Strategy
The candidate should demonstrate an understanding of domain-specific document structure and the trade-offs between context preservation and retrieval granularity. Answer should move beyond default splitters. Sample Answer: 'I would first analyze the document structure, identifying common sections (Abstract, Methods, Results). For dense scientific text, I'd use a recursive character splitter with a chunk size of 512 tokens and 50-token overlap, but with separators customized for academic paragraphs and headings. I'd also extract and attach metadata like 'paper_id', 'section_title', and 'year' to each chunk, as this allows for highly precise metadata filtering during retrieval. For the Methods section specifically, which is critical for reproducibility, I might use a smaller chunk size to ensure granular retrieval of specific protocols.'
Answer Strategy
Tests for problem-solving, understanding of retrieval semantics, and user-centric design. Sample Answer: 'This is a classic retrieval relevance issue. First, I'd use tracing tools like LangSmith to inspect the retrieved context for these bad queries. The problem is likely that the retrieval is finding passages that are topically correct but semantically misaligned. My improvement plan would be twofold: 1) Implement a query understanding step, using an LLM to expand the user's question into multiple, more specific sub-queries before retrieval. 2) I'd fine-tune the embedding model on pairs of user questions and their truly relevant document passages from our support ticket history, creating a domain-specific embedding space that better captures nuance.'
1 career found
Try a different search term.