AI Genomics Data Analyst
An AI Genomics Data Analyst leverages machine learning, large language models, and bioinformatics pipelines to extract clinically …
Skill Guide
The core body of knowledge encompassing the flow of genetic information from DNA to RNA to protein (central dogma), the mechanisms controlling gene expression, and the classification and functional impact of DNA sequence variations.
Scenario
You are provided with a list of 5 raw variant calls (e.g., chr17:7674220 C>T) from a gene panel sequencing run. Your task is to produce a standardized annotation and classification report for each.
Scenario
A patient has a variant in the *intronic* region of the *LMNA* gene associated with dilated cardiomyopathy. The variant does not alter the protein sequence. Your task is to hypothesize its regulatory mechanism.
Scenario
You are analyzing whole-genome sequencing data from a cohort of patients with an idiopathic neurological disorder. Several non-coding variants of unknown significance are identified near a candidate gene. Your goal is to prioritize one for functional studies.
VEP and ANNOVAR are primary tools for annotating the genomic and functional consequences of variants. ClinVar is the definitive database for clinical significance assertions. gnomAD provides critical population allele frequency data to assess rarity. The UCSC Genome Browser is the essential platform for visualizing variants in their genomic context.
ACMG/AMP provides the standardized, evidence-based framework for classifying variants from Benign to Pathogenic, mandatory for clinical reporting. HGVS nomenclature is the required language for unambiguously describing variants in reports and publications.
Answer Strategy
Use a structured, evidence-based approach following the ACMG framework. The candidate should outline evaluating: 1) Computational/Predictive data (REVEL, CADD scores), 2) Functional data from literature, 3) Segregation data in the family, 4) The variant's location in a known functional domain (e.g., DNA-binding domain). A strong answer avoids definitive claims without data and emphasizes accumulating evidence.
Answer Strategy
The interviewer is testing the candidate's depth beyond the basic central dogma. The answer must demonstrate knowledge of mRNA processing and regulation. A top response would detail: 1) Disruption of an exonic splicing enhancer (ESE) or silencer (ESS) leading to exon skipping, and 2) Alteration of codon usage affecting translational speed and protein folding, citing a known example like the synonymous CFTR variant c.1679G>A (p.Arg560=) which causes aberrant splicing.
1 career found
Try a different search term.