Senior Data Engineer at EPAM Systems Inc
Praha, Praha, Czech -
Full Time


Start Date

Immediate

Expiry Date

22 Apr, 25

Salary

0.0

Posted On

23 Jan, 25

Experience

0 year(s) or above

Remote Job

No

Telecommute

No

Sponsor Visa

No

Skills

Github, Apache Spark, Working Experience, Azure, Landscape, Aws, Scala

Industry

Information Technology/IT

Description

We are looking for a Senior Data Engineer to make a team even stronger.

REQUIREMENTS

  • Expertise in Apache Spark along with Spark streaming & Spark SQL
  • Good hands on experience with Databricks and delta-lake
  • Fluency in Scala programming language
  • Good understanding & hands-on experience with CI/CD
  • Rich working experience with Github
  • Fluency working with any cloud (AWS, Azure, GCP) landscape
  • Ability to build Apache Airflow pipelines
Responsibilities
  • Develop, monitor, and operate the most used and most critical curated data pipeline - Sales Order Data (incl. Post-order information, e.g. shipment, return, payment). This pipeline is processing hundreds of millions of records to provide high-quality datasets for analytical and machine learning use-cases
  • Consulting with analysts, data scientists, and product managers to build and continuously improve “Single Source of Truth” KPI for business steering such as the central Profit Contribution measurement (PC II)
  • Leverage and improve a cloud-based tech stack that includes AWS(Azure), Databricks, Kubernetes, Spark, Airflow, Python, and Scala
Loading...