AI Analytics Engineering Specialist
An AI Analytics Engineering Specialist bridges data engineering, analytics, and AI/ML to build intelligent data pipelines and auto…
Skill Guide
The practice of designing scalable, cost-effective, and secure data systems by composing and integrating the native data processing, analytics, and AI/ML services of a specific major cloud provider (AWS, GCP, or Azure).
Scenario
A retail company needs to analyze daily sales data from CSV files uploaded to an S3 bucket. They want to query aggregated results (total sales by product category) without managing servers.
Scenario
A fintech startup ingests real-time transaction data via Pub/Sub and batch customer profile updates via Cloud Storage. The unified data must be queryable for a marketing segmentation ML model, and the schema of transaction fields may change over time.
Scenario
A large manufacturing enterprise wants to decentralize data ownership. Each business unit (Supply Chain, Manufacturing, Sales) must own its domain data products (e.g., 'Supplier Quality Score', 'Factory OEE') on Azure, while ensuring global discoverability, governance, and standardized access via APIs.
Use these as the foundational checklists and design principles for any architecture. They provide cloud-vendor-specific best practices for security, reliability, cost optimization, and operational excellence.
Mandatory for creating repeatable, version-controlled, and auditable cloud infrastructure. Terraform is the multi-cloud standard, while vendor-specific tools offer deeper integration. Use to provision S3 buckets, Glue jobs, BigQuery datasets, and Synapse pools.
dbt is the industry standard for version-controlled, SQL-based transformation logic. Iceberg/Delta Lake add ACID transactions and time travel to cloud data lakes. Governance tools (Lake Formation, Purview) are used to define fine-grained access policies and catalog data assets enterprise-wide.
1 career found
Try a different search term.