AI On-Device AI Engineer
An AI On-Device AI Engineer specializes in deploying, optimizing, and running machine learning models on edge hardware-smartphones…
Skill Guide
The practice of quantifying a software system's memory footprint (peak heap, resident set size) and energy consumption per operation to ensure it operates within the fixed resource constraints of a target hardware platform.
Scenario
You are given a Python script that processes a large CSV file into a pandas DataFrame. The target deployment platform is a micro-computer with only 512MB of RAM.
Scenario
Deploy a pre-trained MobileNetV2 model for image classification on an NVIDIA Jetson Nano. The power budget is 10W total, and you must ensure a single inference does not cause thermal throttling.
Scenario
Create a C++ inference engine for a keyword spotting model that must run on an ARM Cortex-M4 with 256KB of SRAM. The model's peak tensor memory must fit within 100KB.
Used for deep-diving into heap allocation, cache performance, and GPU kernel bottlenecks. Select based on the target hardware (e.g., VTune for Intel CPUs, Nsight for NVIDIA GPUs).
Essential for obtaining ground-truth power consumption data at the component or system level. Software-based tools (RAPL, tegrastats) offer convenience; hardware monitors provide absolute accuracy.
Used to reduce model memory footprint (e.g., quantization, pruning) and optimize for target hardware (e.g., operator fusion, kernel auto-tuning), directly impacting both memory and power budgets.
1 career found
Try a different search term.