AI Causal Inference Analyst
An AI Causal Inference Analyst determines not just what happened, but why it happened - using causal reasoning frameworks, statist…
Skill Guide
Regression discontinuity design (RDD) is a quasi-experimental causal inference method that identifies the effect of a treatment by exploiting a known cutoff rule for assignment, where units just above and just below the threshold are comparable except for treatment status.
Scenario
A university provides a scholarship to students with a high school GPA above 3.5. Analyze the causal effect of the scholarship on first-year college GPA.
Scenario
A bank uses a credit score threshold (e.g., 700) to *automatically approve* loans, but loan officers have discretion to approve some applicants just below the threshold. Estimate the causal effect of loan approval on default rates.
Scenario
A new environmental regulation mandates pollution controls for factories exceeding a size threshold (e.g., 500 employees). Estimate the regulation's causal impact on firm productivity and emissions.
These are industry-standard tools for implementing modern RDD methods, including optimal bandwidth selection, bias-corrected estimation, and robust inference. Use them for all serious empirical work.
The IK and CCT frameworks provide the econometric foundation for optimal bandwidth selection and bias correction. The McCrary test is a mandatory diagnostic to check for sorting manipulation around the cutoff.
Answer Strategy
Structure the answer around the 4 core RDD steps: 1) Design (sharp RDD, running variable is rating, cutoff is 90), 2) Visualization (plot next-year rating vs. current rating with bins), 3) Estimation (local linear regression on each side), 4) Validation (covariate balance, McCrary test). Emphasize the key assumptions: continuity of potential outcomes at the cutoff and no precise manipulation.
Answer Strategy
Test for methodological rigor and practical judgment. The answer should question the bandwidth choice (too wide may introduce bias from functional form misspecification), suggest using data-driven optimal bandwidth selection (e.g., IK or CCT), and recommend presenting results across a range of bandwidths as a robustness check. Frame it as ensuring the finding is not an artifact of modeling choices.
1 career found
Try a different search term.