AI Attack Surface Analyst
An AI Attack Surface Analyst systematically discovers, classifies, and prioritizes vulnerabilities across an organization's entire…
Skill Guide
Prompt injection and jailbreak technique design and detection is the adversarial cybersecurity discipline focused on crafting, analyzing, and mitigating malicious or unintended inputs that manipulate the behavior of large language models (LLMs) to bypass safety constraints, extract hidden information, or execute unauthorized actions.
Scenario
You have a simple LLM chatbot for customer support. Your task is to create a system that can reliably detect if a user is attempting to make the bot reveal its initial system prompt.
Scenario
An LLM-powered assistant summarizes emails and calendar invites from a connected, potentially untrusted data source (e.g., a public forum feed). Craft an attack where a poisoned external document causes the assistant to perform a malicious action when summarizing it.
Scenario
You are leading a security review for a high-stakes financial LLM that provides investment analysis. The model is under strict safety constraints to avoid giving specific financial advice. A determined adversary is using a combination of techniques to bypass these filters over multiple conversation turns.
Garak probes for known vulnerabilities with automated prompts. Rebuff is an open-source tool for building and testing detection systems. PyRIT provides a structured framework for orchestrating red team operations against LLMs. The HF library allows direct experimentation with model internals and token manipulation.
LangChain and Guardrails provide tools to define and enforce output schemas, preventing hallucinated or malicious responses. Constitutional AI principles can be embedded in prompts for self-reflection. Presidio is crucial for preventing data exfiltration via prompts.
W&B helps visualize and compare the effectiveness of different attack and defense strategies over time. Observability platforms provide traceability of LLM calls, input/output logging, and anomaly detection. Custom logs are essential for real-time monitoring of production systems for injection attempts.
1 career found
Try a different search term.