Senior Data Engineer (Autonomous Vehicles Data) at RSB Automotive Consulting
, , Poland -
Full Time


Start Date

Immediate

Expiry Date

07 Jun, 26

Salary

0.0

Posted On

09 Mar, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Python, SQL, Spark, PySpark, Databricks, Time-series Data, Advanced Analytics, Data Preparation, Software Engineering, ETL Pipelines, Data Mining, Simulation, Machine Learning

Industry

Motor Vehicle Manufacturing

Description
Senior Data Engineer (Autonomous Vehicles Data) At RSB Automotive Consulting, we work with engineers and technology teams developing advanced mobility solutions. We are currently looking for a Senior Data Engineer to join a long-term project focused on data processing and analytics for Autonomous Vehicle (AV) development. In this role, you will work with large-scale sensor datasets collected from test vehicles and support teams building simulation environments and machine learning pipelines used to validate safety-critical AV systems. Project context: data pipelines and analytics supporting Autonomous Vehicle (AV) development Tech stack: Python, SQL, Spark / PySpark, Databricks Data scale: up to ~1 TB of sensor data per hour (cameras, LiDAR, radar) Focus: time-series data, advanced analytics, data preparation for simulation and ML training Work mode: remote (any location), with daily overlap with the US team Start: ASAP Project duration: long-term collaboration with a global automotive OEM Your responsibilities Analyze large volumes of sensor and time-series data from autonomous vehicle test fleets Develop advanced SQL, Python, and PySpark queries to filter, transform, and aggregate datasets Design and maintain ETL pipelines processing large-scale structured and semi-structured data Identify and extract data suitable for AV simulation scenarios and ML training pipelines Support the discovery of rare or complex driving situations (e.g. unusual traffic scenarios, hard braking events, edge cases) Develop scripts and internal tools supporting data mining and analytics workflows Collaborate with engineers working across the autonomous driving technology stack What we’re looking for Strong software engineering background Advanced SQL skills with experience writing complex queries Advanced Python programming Hands-on experience with Spark / PySpark Experience working with Databricks Experience in advanced data analytics and time-series analysis Understanding of data preparation for machine learning workflows Project context You will work with sensor data generated by autonomous vehicle test fleets equipped with multiple cameras, LiDAR, and radar. These systems generate extremely large datasets used to build simulation environments that help validate autonomous driving algorithms. The role focuses on transforming raw sensor data into structured, simulation-ready datasets used by engineering and research teams. If you are interested in working with large-scale data systems and real-world autonomous driving datasets, we would be happy to connect.
Responsibilities
The role involves analyzing large volumes of sensor and time-series data from autonomous vehicle test fleets, developing advanced SQL, Python, and PySpark queries to transform this data, and designing/maintaining ETL pipelines. Responsibilities also include supporting the discovery of rare driving situations and developing internal tools for data mining and analytics workflows.
Loading...