AI Vulnerability Assessment Specialist
An AI Vulnerability Assessment Specialist systematically identifies, tests, and documents security weaknesses in machine learning …
Skill Guide
The systematic process of identifying, analyzing, and prioritizing security threats to AI systems throughout their lifecycle by applying adversarial attack taxonomies (ATLAS) and LLM-specific vulnerability frameworks (OWASP Top 10 for LLMs).
Scenario
You are given a documentation set for a hypothetical LLM search tool that uses RAG (Retrieval-Augmented Generation). Your task is to perform an initial threat model.
Scenario
A public incident report describes a malicious actor causing a recommendation system to promote extremist content via data poisoning. Your team must investigate the attack chain and recommend controls.
Scenario
Your company is launching a new generative AI feature integrated with a high-value customer database. As the security architect, you must design the security review process.
ATLAS provides the adversarial tactics, techniques, and procedures (TTPs) specific to ML. The OWASP LLM Top 10 is a focused checklist for generative AI risks. STRIDE and PASTA are general threat modeling methodologies useful for initial brainstorming and risk-centric analysis, respectively.
Counterfit and ART are command-line tools and libraries for running adversarial attacks against ML models to test robustness. Garak and Promptfoo are specialized tools for probing LLMs for vulnerabilities like prompt injection and data leakage, used during red teaming.
Seldon and MLflow provide model monitoring capabilities to detect drift or anomalous inference patterns post-deployment. Great Expectations is for data validation. Guardrails AI offers tools to enforce constraints on LLM inputs/outputs, implementing runtime mitigations identified in threat models.
Answer Strategy
The candidate should demonstrate a structured methodology. The answer should start by scoping assets and trust boundaries, then directly apply ATLAS and the OWASP LLM Top 10 to the agent's capabilities. Priority would be given to threats like Indirect Prompt Injection (OWASP LLM01), Excessive Agency (OWASP LLM07), and ML Supply Chain Compromise (ATLAS TA0043). A strong answer includes specific mitigations like strict sandboxing for code execution and robust output parsing.
Answer Strategy
This tests the candidate's ability to think beyond common software bugs and consider adversarial ML threats. The core competency is applying the ATLAS framework for covert attack identification. The response should move methodically from benign causes (data drift) to adversarial ones (data poisoning, model evasion).
1 career found
Try a different search term.