AI Blockchain Data Analyst
An AI Blockchain Data Analyst extracts, models, and interprets on-chain and off-chain data using machine learning pipelines and AI…
Skill Guide
The technical capability to query, transform, and analyze on-chain and decoded blockchain data using SQL and Python on specialized data warehouse platforms like Dune Analytics, Flipside Crypto, and The Graph.
Scenario
Create a public Dune dashboard that tracks and visualizes the daily trading volume (in USD) for Uniswap V3 on Ethereum over the last 30 days.
Scenario
Identify the top 100 wallets by initial deposit amount into Aave V3 on Ethereum, then track their subsequent health factor and liquidation risk over the following 90 days using Python and Flipside.
Scenario
Build a system to detect, analyze, and rank MEV bots operating across Ethereum and Arbitrum by net profitability, accounting for gas costs, token swaps, and bridge activity.
Dune/Flipside are primary SQL-based warehouses for raw/decoded blockchain data. The Graph provides indexed, queryable data via GraphQL for specific protocols. Python is used for advanced transformation, analysis, and automation outside the SQL editor.
`dune-client` and `flipsidecrypto` are the official SDKs for interacting with their APIs. `web3.py` can be used for direct node interaction or verifying data. `requests` is for interacting with The Graph's hosted service endpoints.
Structuring queries with fact/dimension tables (star schema) improves performance and clarity. Time-series and cohort analyses are core blockchain analytics patterns. Heuristic labeling (e.g., 'likely bot') is critical for entity-level analysis.
Answer Strategy
Test the candidate's ability to reason about complex on-chain data structures (UTXO) and translate a financial concept into a multi-step SQL query. The answer should outline: 1) Joining `bitcoin.transactions` and `bitcoin.inputs`/`bitcoin.outputs` to track UTXO movement. 2) Identifying the 'last spent' timestamp for each output. 3) Joining with a historical price table (`prices.usd`) on that specific timestamp. 4) Aggregating the (value * price) sum. Mentioning performance considerations like using `DENSE_RANK()` or `WINDOW` functions is a strong signal.
Answer Strategy
Test the candidate's debugging methodology and understanding of platform constraints. The answer should demonstrate a systematic approach: checking API status/limits, analyzing query performance (data growth, lack of time filters), and implementing solutions like query optimization, pagination, or switching to an incremental model.
1 career found
Try a different search term.