Senior AI Data Pipeline Engineer
42dot
WE ARE LOOKING FOR THE BEST
ABOUT US
42dot is a mobility AI company committed to solving mobility challenges with software and AI. As the Global Software Center of Hyundai Motor Group, 42dot pioneers the future of mobility by advancing the development of software-defined vehicles.
We develop safety-first, user-centric software-defined vehicle technologies that deliver the latest performance through continuous updates like smartphones. By advancing software and AI technology, 42dot envisions a world where everything is connected and moves autonomously through a self-managing urban transportation operating system.
Our AI Data Pipeline Engineers build up the core data processing pipelines and datasets readiness for autonomous driving cutting edge algorithms. We develop the distributed system of a scalable data pipeline for large-scale dataset (millions of scenes), as well as high-performance data serving SDKs for ML model training / evaluation. The data pipelines we deliver could highly improve the efficiency of ML model development lifecycle, including training, evaluation, deployment, as well as monitoring in the cloud environment.
Responsibilities
- Develop high scale, reliable data extraction pipeline to extract millions of raw data from data collection fleet and convert to high-value scene data
- Develop data labeling pipelines to perform the auto labeling inferences for autonomous driving algorithms
- Develop advanced autonomous driving data SDK, including scene data search, datasets preparation, dataset loading, etc.
- Build up the data lakehouse for autonomous driving scene dataset, including the sensor data, calibration data, as well as annotation data
- Dig into performance bottlenecks all along the data processing pipelines, from data processing latency, data search latency to Test Procedure (TP) coverage.
- Bootstrap and maintain infrastructure for data platform components—data processing pipeline, database, data lakehouse and data serv...
Share this job: