Data Engineer

at  Saab Inc

Tampere, Länsi-Suomi, Finland -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate19 Jan, 2025Not Specified20 Oct, 20243 year(s) or aboveNetwork Architecture,Data Security,Hipaa,Computer Science,Python,Hdf5,Databases,Data Collection,Machine Learning,Neural Networks,Hadoop,Data Engineering,Dbt,Amazon S3,Sql,Cloud Storage,Apache SparkNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

Saab Sensor Technology Center in Tampere is part of Business Area Surveillance. Our R&D teams are developing extremely complex and innovative electronic warfare systems, motivated by a clear target: to keep people and society safe. As an example of a brilliant product that we are developing in Tampere, check out Sirius Compact!
Your role
As a Data Engineer focused on AI development, you will be at the heart of our data operations. You’ll be responsible for building the data infrastructure that powers our machine learning models, ensuring high-quality data collection, preprocessing, and availability. You’ll work closely with Data Scientists and Machine Learning Engineers to ensure that our AI models are trained on clean, structured, and relevant data at scale.

Key Responsibilities:

  • Design and implement data pipelines to support large-scale data collection and processing for AI model development
  • Collaborate with Data Scientists to identify data requirements for training and validating machine learning models and neural networks
  • Work with other experts to define and implement scalable on-premise storage and compute infrastructure for large-scale sensor data collection and machine learning operating on sensitive data
  • Build and maintain data lakes, warehouses, and real-time data streaming platforms that serve AI development needs
  • Ensure data quality by implementing data validation, anomaly detection, and integrity checks
  • Automate data extraction, transformation, and loading processes to streamline data availability for AI models
  • Monitor and optimize the performance of AI data pipelines, ensuring efficiency and low latency
  • Implement data security, privacy, and governance best practices for compliance with AI development standards
  • Collaborate with machine learning experts to deploy training data at scale, ensuring readiness for neural network experimentation

Since Saab is a global company, this role requires fluent communication in both English and Finnish, with customers and colleagues both in Finland and internationally.
This role may require some travelling, mainly occasional short visits to Sweden.
Your skills and experience

Following skills and experience will be highly beneficial in this role:

  • Master’s degree in Computer Science, Data Engineering, or a related field (or equivalent experience)
  • 3+ years of experience working as a Data Engineer with a focus on AI/ML pipelines and neural networks
  • Familiarity with machine learning models utilizing time series data
  • Proficiency in Python, SQL, and experience with data engineering tools (e.g., Apache Airflow, dbt, or Luigi)
  • Experience with big data technologies like Apache Spark, Hadoop, or distributed databases
  • Strong understanding of data collection and preprocessing techniques for training AI models
  • Hands-on experience with data storage and databases optimized for AI workloads (e.g., NoSQL, data lakes like Amazon S3 or Google Cloud Storage)
  • Experience in building real-time data pipelines (e.g., Kafka, Kinesis) for continuous model training
  • Knowledge of data formats relevant to AI (e.g., JSON, Parquet, HDF5, TFRecord)

Additional skills and experience that will be beneficial:

  • Experience with AI/ML frameworks like TensorFlow, PyTorch, or scikit-learn
  • Understanding of neural network architecture and how to support model training with optimized data pipelines
  • Experience with data annotation tools and managing labeled datasets for supervised learning
  • Knowledge of data security, privacy (e.g., GDPR, HIPAA), and ethical considerations in AI data collection
  • Familiarity with cloud-based AI infrastructure (AWS Sagemaker, GCP AI Platform, Azure Machine Learning)

If the description above sounds like you, we encourage you to apply!
In this job, some of your tasks have a connection to defense secrecy. Therefore, a drug test and approved security clearance in accordance with the Finnish Security Clearance act 726/2014 is required for the selected applicant.
What you will be part of?
Our unit offers a wide range of interesting opportunities to grow and challenge yourself within the exciting field of Electronic Warfare. In addition to interesting projects and learning by doing, Saab offers various trainings and learning paths to help you build your competence and career.
Your work place will be at Saab Sensor Technology Center in Tampere downtown in newly renovated premises.
On top of joining a team of highly talented professionals and fun people, we offer you competitive benefit package, including full lunch benefit, exercise & culture benefit, gym facilities, transport related options, and many more. We also value a good work-life balance and offer our employees flexible working hours.
Would you like to know more about the Saab-life and our culture?
Read about Saab Careers.
Interested?
Take your chance to build your future career at Saab by submitting your CV and application via Saab Careers already now! Kindly observe that this is an ongoing recruitment process and that the position might be filled before the closing date of the advertisement.
If you aspire to help create and innovate whilst developing yourself in a challenging team setting, Saab may well have the perfect conditions for you to grow. We pride ourselves on a nurturing environment, where everyone is different yet we share the same goal – to help protect people

Responsibilities:

Key Responsibilities:

  • Design and implement data pipelines to support large-scale data collection and processing for AI model development
  • Collaborate with Data Scientists to identify data requirements for training and validating machine learning models and neural networks
  • Work with other experts to define and implement scalable on-premise storage and compute infrastructure for large-scale sensor data collection and machine learning operating on sensitive data
  • Build and maintain data lakes, warehouses, and real-time data streaming platforms that serve AI development needs
  • Ensure data quality by implementing data validation, anomaly detection, and integrity checks
  • Automate data extraction, transformation, and loading processes to streamline data availability for AI models
  • Monitor and optimize the performance of AI data pipelines, ensuring efficiency and low latency
  • Implement data security, privacy, and governance best practices for compliance with AI development standards
  • Collaborate with machine learning experts to deploy training data at scale, ensuring readiness for neural network experimentatio

Following skills and experience will be highly beneficial in this role:

  • Master’s degree in Computer Science, Data Engineering, or a related field (or equivalent experience)
  • 3+ years of experience working as a Data Engineer with a focus on AI/ML pipelines and neural networks
  • Familiarity with machine learning models utilizing time series data
  • Proficiency in Python, SQL, and experience with data engineering tools (e.g., Apache Airflow, dbt, or Luigi)
  • Experience with big data technologies like Apache Spark, Hadoop, or distributed databases
  • Strong understanding of data collection and preprocessing techniques for training AI models
  • Hands-on experience with data storage and databases optimized for AI workloads (e.g., NoSQL, data lakes like Amazon S3 or Google Cloud Storage)
  • Experience in building real-time data pipelines (e.g., Kafka, Kinesis) for continuous model training
  • Knowledge of data formats relevant to AI (e.g., JSON, Parquet, HDF5, TFRecord


REQUIREMENT SUMMARY

Min:3.0Max:8.0 year(s)

Information Technology/IT

IT Software - DBA / Datawarehousing

Software Engineering

Graduate

Computer Science, Engineering

Proficient

1

Tampere, Finland