AI A/B Testing Analyst
An AI A/B Testing Analyst designs, executes, and interprets controlled experiments on AI-powered products and features-from LLM pr…
Skill Guide
The process of determining the required sample size for a study to detect a statistically significant effect of a given magnitude, if that effect exists, while controlling for the risk of false positives (Type I errors) and false negatives (Type II errors).
Scenario
Your product team wants to test two different versions of a 'Sign Up' button (control vs. variant) to see which has a higher click-through rate. The current baseline rate is 5%. They want to detect a relative increase of at least 20% (to 6%) with 80% power and 95% confidence.
Scenario
The marketing team plans to run a randomized controlled trial to measure the impact of a new email campaign on customer lifetime value (LTV), a continuous, skewed metric. They believe a $10 increase in LTV is meaningful. Historical LTV mean is $100, with a standard deviation of $50.
Scenario
A/B testing a new onboarding flow where the unit of randomization is the 'account' (a company) but the outcome is measured at the individual user level within that account. Users within an account are correlated (intraclass correlation, ICC). This violates assumptions of simple random sampling.
R and Python are essential for programmatic, reproducible, and simulation-based analyses. G*Power is excellent for rapid, GUI-driven calculations for standard tests. Platform calculators are useful for quick checks in A/B testing contexts but should not be trusted for complex designs.
Pre-registration enforces discipline by requiring the power analysis plan *before* data collection. Understanding effect size benchmarks prevents designing studies with unrealistic expectations. Prospective (a priori) analysis guides planning; retrospective (post-hoc) analysis is controversial for interpreting null results but useful for planning future studies.
1 career found
Try a different search term.