AI System Prompt Engineer
An AI System Prompt Engineer designs, architects, and optimizes the foundational prompts and instruction sets that define how larg…
Skill Guide
The engineering discipline of enabling AI systems to reliably invoke external code (functions), data sources, and services (plugins) to execute specific tasks and extend their capabilities beyond text generation.
Scenario
Create a simple chatbot that can use at least three different tools (e.g., a calculator, a currency converter using a free API, and a local file reader).
Scenario
Design an agent that takes a research topic, uses a search tool to find sources, a scraping tool to extract text, and a summarization tool to produce a structured report.
Scenario
Architect a secure, observable gateway that allows multiple internal AI agents to discover, authenticate, and invoke a catalog of approved enterprise plugins (e.g., Salesforce CRM, ServiceNow, internal databases).
Use these to implement the foundational request-response cycle, define tool schemas, and manage agent loops. OpenAI's API is the reference standard; LangChain/LlamaIndex provide higher-level abstractions for complex orchestration and memory.
JSON Schema is mandatory for defining function parameters. OpenAPI is critical for designing and documenting plugin endpoints. TypeSpec (formerly Cadl) is a Microsoft language for authoring API descriptions and can generate OpenAPI specs, useful for complex plugin ecosystems.
Frameworks designed for building multi-agent systems where tools are shared resources. AutoGen excels at human-in-the-loop patterns. CrewAI focuses on role-playing agents. Semantic Kernel is Microsoft's enterprise-oriented SDK for integrating AI with native code.
Essential for tracing the exact sequence of LLM calls and tool invocations in production. They provide logs, cost tracking, and latency metrics, which are critical for debugging non-deterministic agent behavior.
Answer Strategy
The candidate must demonstrate mastery of API design and safety. A strong answer outlines: 1) Required vs. optional parameters (e.g., 'customer_id' is required, 'notes' is optional), 2) Data types and formats (string, integer, enum), 3) Constraints (min/max length, regex for emails), 4) A clear 'description' for the LLM, and 5) A 'return' schema describing success/error states. Example: 'I would define customer_id as a required string following our UUID format. Fields like phone and email would use regex patterns for validation. The description would explicitly state this is a privileged operation requiring confirmation. The return schema would include a success boolean and the updated record object or error details.'
Answer Strategy
Tests debugging and system thinking. The core issue is likely the LLM's non-determinism or poor prompt engineering leading to redundant calls. Strategy: 1) **Observability**: Check logs in LangSmith/W&B to trace the exact call sequence. 2) **Root Cause**: It's often the system prompt not discouraging speculation (the agent asks for weather twice, perhaps with slightly different city names). 3) **Fix**: Refine the system prompt to be more specific about data retrieval (e.g., 'Gather all necessary data before finalizing your answer. Do not make redundant calls.'). Example: 'First, I would instrument the agent with LangSmith to log all API calls. If logs show repeated calls for the same location, I'd revise the system prompt to instruct the agent to consolidate its data needs into a single request per source per task step.'
1 career found
Try a different search term.