Question 1

What are the six lawful bases for processing personal data under the GDPR, and which might be most relevant for training an AI model?

Accepted Answer

The answer should list the bases (consent, contract, legal obligation, vital interests, public task, legitimate interests) and discuss legitimate interests as a common but nuanced basis for ML training.

Question 2

Explain what a Data Protection Impact Assessment (DPIA) is and when it is required under GDPR.

Accepted Answer

A great answer defines DPIA as a process to identify and minimize data protection risks, and correctly states it's mandatory for processing likely to result in high risk, including systematic profiling.

Question 3

What is the difference between 'data controller' and 'data processor' in the context of using a third-party AI SaaS tool?

Accepted Answer

The candidate should clearly delineate that the controller determines the 'why' and 'how' of processing, while the processor acts on the controller's behalf, and explain the contractual requirements (Art. 28).

Question 4

How do you interpret the GDPR principle of 'purpose limitation' when applied to the reuse of data for a new AI project?

Accepted Answer

A strong answer discusses that data collected for one purpose cannot be repurposed without compatibility assessment, and may require fresh consent or a new legal basis for AI training.

Question 5

What is pseudonymization, and how does it differ from anonymization under the GDPR?

Accepted Answer

The response should explain pseudonymization (data can be attributed with additional info) vs. true anonymization (no longer personal data, exempt from GDPR), and note pseudonymization is a recommended safeguard.

Question 6

An ML engineer wants to use a public dataset to pretrain a foundation model. What key GDPR compliance checks should you perform?

Accepted Answer

Look for discussion on verifying the original consent/lawful basis of the public dataset, assessing potential bias, checking for sensitive data, and documenting the Data Provenance and Data Protection lineage.

Question 7

Describe a technical approach to fulfill the 'right to erasure' (right to be forgotten) when personal data has been used to train a deep learning model.

Accepted Answer

A good answer acknowledges the technical challenge ('unlearning') and discusses strategies like retraining the model without the data, using machine unlearning techniques, or strong documentation of why erasure may be impossible.

Question 8

How would you assess and document a legitimate interest for using customer interaction data to train a chatbot improvement model?

Accepted Answer

The candidate should walk through the three-part test: (1) identify the legitimate interest (e.g., improving service quality), (2) demonstrate the processing is necessary, and (3) conduct a balancing test against the individuals' interests and rights.

Question 9

What specific questions would you ask a vendor providing an 'AI-powered' analytics tool during a due diligence review for GDPR compliance?

Accepted Answer

Probes should include: Where is data processed/stored? Sub-processors? Data retention/deletion? Exercising data subject rights? Security measures? DPIA availability? Contractual Article 28 clauses?

Question 10

Explain the concept of 'data minimization' in the feature engineering phase of an ML project.

Accepted Answer

The answer should advise on selecting only the features strictly necessary for the model's purpose, avoiding the collection of redundant or highly sensitive data, and potentially using techniques like feature selection.

AI GDPR Compliance Specialist Interview Questions

Beginner

Intermediate

Advanced

Scenario-Based

AI Workflow & Tools

Behavioral

Done Practicing? Here's What's Next

Full Career Guide

Learning Roadmap

Compare This Role