AI Decision Intelligence Engineer
An AI Decision Intelligence Engineer designs, builds, and optimizes AI-powered decision systems that translate raw data into actio…
Skill Guide
LLM orchestration and agent architecture is the engineering discipline of designing, building, and managing complex systems where multiple large language models (LLMs), external tools, and memory systems are coordinated to perform autonomous, multi-step tasks.
Scenario
Create an agent that can ingest a PDF (e.g., a technical manual) and answer follow-up questions about its content, remembering the conversation history.
Scenario
Develop an agent that takes a research topic, generates an outline, searches the web for each section, summarizes findings, and compiles a report, handling failures and retries gracefully.
Scenario
Architect a system where a 'Triage Agent' analyzes incoming support tickets, then delegates to specialized 'Agents' (Billing Agent, Technical Agent, Escalation Agent) based on intent and complexity, ensuring seamless handoff and context preservation.
LangChain/LCEL for linear chains and rapid prototyping. LangGraph for complex, stateful, cyclic agent workflows. CrewAI for role-based multi-agent team simulations. AutoGen for conversational multi-agent patterns. Select based on workflow complexity and need for cyclic reasoning.
LangSmith for tracing and debugging agent runs. W&B/Phoenix for experiment tracking and LLM-specific observability. Redis/DynamoDB for external, persistent agent memory and state management. Critical for moving from prototype to production.
Vector stores enable RAG (Retrieval-Augmented Generation). Embedding models convert text to vectors for semantic search. Tool libraries provide the 'hands' for agents to interact with the world. The quality of these components directly limits agent capability.
Answer Strategy
Structure the answer around defining the state schema, node design, and error handling. 'I would define a Pydantic model for the state holding the query, intermediate data, and citation list. The graph would have nodes for: 1) query_parser, 2) data_retriever (tool), 3) calculator (tool), 4) report_generator. I'd use conditional edges to loop back if the calculator gets an error. For tool failures, I'd implement a retry policy with a fallback node that logs the issue and proceeds with available data, updating the state with an error flag.'
Answer Strategy
Tests systematic debugging and knowledge of observability tools. 'In a customer service agent, it would sometimes hallucinate order numbers. My process: 1) Reproduce with logging set to VERBOSE, using LangSmith traces to visualize the exact prompt/tool call sequence. 2) Isolated the issue to the summarization step feeding ambiguous context. 3) Fixed by adding a post-processing validation step with a stricter, rule-based prompt before the final output. This reduced the error rate by 95%.'
1 career found
Try a different search term.