AI A/B Testing Analyst
An AI A/B Testing Analyst designs, executes, and interprets controlled experiments on AI-powered products and features-from LLM pr…
Skill Guide
The specialized use of SQL to query, join, and transform raw event data to isolate user groups (cohorts) defined by specific behaviors or attributes for controlled experimentation.
Scenario
You have an 'events' table with columns: user_id, event_name, event_timestamp, device_type. The product team wants the DAU for the 'photo_upload' feature over the last 30 days.
Scenario
A/B test on a new pricing page. You have an 'experiment_assignments' table (user_id, experiment_id, variation, assigned_at) and a 'purchases' table (user_id, purchase_id, amount, purchased_at). Segment users by the experiment variation and calculate conversion rate and average revenue per user (ARPU).
Scenario
The marketing team needs a reusable system to dynamically define and export user cohorts (e.g., 'high-value users who churned after seeing email campaign X') for targeting, without writing new SQL each time.
Essential for executing queries. BigQuery and Snowflake are dominant for their scalability with massive event datasets. Proficiency in their specific SQL dialects (e.g., BigQuery's STRUCT, UNNEST) is critical.
Understanding the underlying data model (e.g., fact and dimension tables) is key to writing efficient joins. Knowledge of how experimentation platforms log assignment and exposure data is necessary for accurate analysis.
Cohort and funnel analysis provide the mental models for structuring queries. Understanding statistical concepts (p-values, confidence intervals) ensures query outputs are interpreted correctly for business decisions.
Answer Strategy
Tests real-world experience and the ability to handle complexity. The core competency is connecting technical execution to business impact. Sample Answer: 'I analyzed a cohort of users who completed onboarding but delayed their first key action. The SQL required a self-join on the events table to find the time delta between 'onboarding_complete' and 'first_purchase' events, filtered for deltas > 24 hours. This revealed a 15% higher LTV for delayed engagers, suggesting they were more deliberate. The complex part was ensuring the time delta calculation correctly handled timezone conversions across our global user base.'
1 career found
Try a different search term.