AI Employee Wellbeing AI Specialist
An AI Employee Wellbeing AI Specialist designs, deploys, and oversees AI systems that monitor, analyze, and proactively improve th…
Skill Guide
The technical discipline of structuring, iterating, and fine-tuning large language model interactions and parameters to reliably produce outputs that promote user psychological safety, resilience, and constructive cognitive reframing.
Scenario
A user inputs: 'I'm so stressed about this project deadline, I feel like I'm going to fail.' The chatbot must guide the user to reframe the stressor and identify one actionable step.
Scenario
A workplace copilot that initiates a daily check-in, analyzes the user's text for emotional state, and offers tailored, low-intensity resources (e.g., a breathing exercise for anxiety, an article on focus for feeling scattered).
Scenario
Create a specialized agent that guides users through a structured Cognitive Behavioral Therapy for Insomnia (CBT-I) sleep restriction protocol, requiring strict adherence to clinical steps and safety monitoring.
Use OpenAI for rapid prompt prototyping and function calling. Hugging Face for accessing and fine-tuning open-source models like Mistral or Llama. LangChain for chaining prompts, memory, and tools into complex copilot architectures.
Build a custom rubric for your specific wellbeing use case. Use Microsoft's tools for fairness and interpretability assessments. Adapt academic benchmarks to stress-test your model's refusal of harmful requests and truthfulness of health information.
Translate these therapeutic frameworks into concrete prompt instructions and dialogue flows. MI informs how to ask open-ended, evocative questions. The CBT schema structures how to guide a user through identifying and challenging automatic negative thoughts.
Answer Strategy
The interviewer is testing systematic risk assessment and technical implementation skills. Start by categorizing risks: harmful advice, user crisis, data privacy, and scope creep. For each, state the technical mitigation: e.g., for crisis, implement a separate, highly accurate classifier for self-harm keywords that triggers a hardcoded response with crisis resources, bypassing the generative LLM entirely. Mention using few-shot examples to teach refusal patterns for out-of-scope advice.
Answer Strategy
This tests your iterative, data-driven approach to improvement. The core competency is system feedback loop design. Respond: 'I would first build a quality evaluation dataset with diverse user scenarios and expert-rated responses. I'd then perform error analysis, categorizing bad outputs (e.g., generic, off-topic, unsafe). For repetition, I'd adjust the system prompt to explicitly instruct varied language. For genericness, I'd enhance the few-shot examples with more specific, context-aware solutions. Finally, I'd implement A/B testing of prompt versions against the rubric.'
1 career found
Try a different search term.