AI AR/VR AI Engineer
An AI AR/VR Engineer designs and deploys intelligent systems that power spatial computing experiences - from AI-driven scene under…
Skill Guide
Computer vision for AR encompasses the real-time computational pipeline that detects visual features, maps and localizes the device within an environment (SLAM), estimates depth from camera data, and identifies geometric planes to anchor virtual content.
Scenario
You are developing the initial visual tracking component for an AR navigation app. You need to identify and track distinctive points in a camera feed to estimate device motion.
Scenario
Create a mobile AR application that allows a user to place a virtual 3D object (e.g., a chair) on a real-world flat surface (floor or table) detected by the device's camera.
Scenario
You are tasked with creating a high-precision spatial anchor system for an industrial AR maintenance guide that must work in feature-sparse, large-scale environments (e.g., a factory floor).
OpenCV provides the low-level computer vision primitives. ARKit and ARCore are the production-grade, platform-specific SDKs that provide high-level SLAM, plane detection, and depth APIs. MediaPipe offers cross-platform, ML-powered solutions for hand/face tracking and segmentation.
Open3D and Pangolin are used for 3D data processing and visualization during prototyping. GTSAM is the industry standard for implementing graph-based SLAM back-ends. COLMAP is a benchmark tool for Structure from Motion and can be used to validate SLAM pipelines.
PyTorch is used for research and training of custom models for depth estimation, feature extraction (SuperPoint), or semantic segmentation. The other frameworks are essential for deploying these models with high performance on mobile/AR headset hardware.
1 career found
Try a different search term.