AI Exam Generation Specialist
An AI Exam Generation Specialist designs, generates, and validates assessment items-including multiple-choice, constructed-respons…
Skill Guide
A structured process for evaluating and documenting the reliability, validity, and fairness of psychological assessments using the foundational framework of Classical Test Theory (CTT).
Scenario
You are given a 20-item multiple-choice test designed to screen for basic technical knowledge for a junior role. You need to determine if it's a reliable and fair initial filter.
Scenario
A company uses a 360-degree feedback survey to measure 'Strategic Influence.' You must gather evidence that it actually measures this competency and predicts promotion success.
Scenario
Your organization's entire technical hiring battery is challenged in an adverse impact analysis by an internal audit. You must present a cohesive validation dossier proving job-relatedness and business necessity.
Used for computing reliability (alpha, omega), conducting factor analysis, item analysis, and significance testing. R/Python are preferred for advanced modeling; JASP/SPSS for accessible initial analyses.
The Validity Hierarchy guides the type of evidence to collect. The *Standards* provide the ethical and technical framework for good practice. The *Uniform Guidelines* are the legal framework for defensibility in the U.S. context.
Answer Strategy
The interviewer is testing if you understand reliability is multidimensional. Do not just agree. Respond: 'While .82 indicates acceptable internal consistency, I'd first verify this is from a representative sample. I'd also need to present evidence of its test-retest stability over time, as internal consistency alone doesn't confirm consistency across situations. For high-stakes decisions, we'd also examine the Standard Error of Measurement to define a band of scores around a cutoff.'
Answer Strategy
This tests diagnostic and problem-solving skills. The core issue is likely a deficient criterion (the supervisor rating). A strong answer: 'This points to a criterion problem. Supervisor ratings may be biased, infrequent, or based on different constructs. My next step would be to investigate the criterion measure: conduct a job analysis to define performance accurately, then develop or select a more rigorous criterion like structured interview performance, customer satisfaction scores, or a behaviorally anchored rating scale, and re-validate.'
1 career found
Try a different search term.