AI Backtesting Automation Specialist
An AI Backtesting Automation Specialist designs, builds, and maintains intelligent systems that automate the testing of trading st…
Skill Guide
The design, implementation, and maintenance of automated, scalable systems that capture, transform, validate, and distribute financial market data from raw source feeds into a normalized, analysis-ready format.
Scenario
Ingest raw tick-level trade data from a public API (e.g., Alpha Vantage) for a single stock, compute daily Open/High/Low/Close/Volume, and store it in a PostgreSQL database.
Scenario
Ingest real-time trade and quote data for the same equity from two different exchange feeds (e.g., NYSE and NASDAQ simulations). Normalize timestamps to UTC, adjust prices for corporate actions using a static calendar, and merge into a single canonical event stream.
Scenario
Ingest the full US options chain (millions of quotes) in real-time from a direct feed, calculate implied volatility and Greeks on-the-fly, and serve this normalized + derived data to internal trading systems with guaranteed latency < 10ms.
Used to schedule, monitor, and manage complex, multi-step data pipeline DAGs (Directed Acyclic Graphs). Essential for batch-oriented or micro-batch processing workflows.
The backbone for real-time, event-driven ingestion. Kafka provides durable, high-throughput messaging; Flink enables stateful stream processing for complex event processing (CEP) on market data.
Python for rapid prototyping and batch processing. Java/Kotlin for robust, scalable JVM-based services. C++/Rust for ultra-low latency, performance-critical ingestion and transformation. q/kdb+ is the domain-specific standard for time-series analysis in finance.
TimescaleDB/InfluxDB for high-speed time-series writes and queries. ClickHouse for fast analytical queries over large historical datasets. Parquet for cost-effective, columnar storage of normalized data lakes.
Answer Strategy
The candidate must demonstrate production experience and resilience thinking. They should outline the architecture (e.g., Kafka -> Flink -> Database), identify the bottleneck (e.g., database write lock contention, consumer lag), and detail specific actions (e.g., implementing backpressure, temporarily increasing consumer instances, switching to a more partitioned topic structure).
Answer Strategy
Tests depth of domain knowledge and system design for consistency. The candidate must explain using a reference data service, applying the adjustment factor to all historical and real-time data, and ensuring atomicity so downstream consumers see a consistent view.
1 career found
Try a different search term.