AIML - Sr Software Data Infrastructure Engineer - Data and ML Innovation

at  Apple

Cupertino, California, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate02 Dec, 2024USD 312200 Annual05 Sep, 2024N/AComputer Science,Spark,Data Processing,Data Science,User Scenarios,Data Mining,Data Quality,Scala,Python,Server Side,Long Term Vision,Version Control,Data Engineering,AwsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

SUMMARY

Posted: Aug 23, 2024
Weekly Hours: 40
Role Number:200563976
Do you get excited by driving impact via measurement and evaluation, for products and services used by hundreds of millions of people globally? The vision of AIML Data and ML Innovation organization is to improve products by using data as the voice of our customers. We are seeking a passionate and experienced Data Infrastructure Engineer to play a pivotal role in revolutionizing how we process and use substantial datasets as the heart of Siri, Search and Machine Learning. You will be instrumental in building a unified, groundbreaking data insights framework and data processing framework, powered by technologies such as Spark or Iceberg. You will collaborate closely with teams with varied strengths (i.e. Data Scientists and Analysts, other Engineering teams) to transform massive data into valuable, actionable datasets. You will also build metrics platform that fuel our innovative features and future machine learning area.

DESCRIPTION

As a Data Infrastructure Engineer, you will be at the forefront of designing and implementing a robust data processing framework to streamline log data pipelines, and a flexible data insights infrastructure, applying your expertise in Spark and Python. In this collaborative role, you’ll partner closely with the Siri, Search, and other teams to design solutions that process data and build metrics platform, which drive innovation. Your work will focus on optimizing performance, ensuring data quality, and contributing to a long-term vision that extends the existing framework’s capabilities to new user scenarios and innovative machine learning applications. We’re looking for someone who thrives on tackling data challenges at scale and stays abreast of the latest advancements in big data processing on both device and server side.

  • Demonstrated expertise in large-scale data processing, with a strong background of working with Spark and Python or Scala.
  • Understanding of distributed computing principles, data engineering and DevOps standard processes.
  • Proven programming skills in Python and Scala.
  • A genuine passion for working with data and solving complex problems at scale, in cloud platforms (AWS, GCP).
  • Experience with machine learning data mining.
  • B.S.degree in Computer Science or Data Science.

PREFERRED QUALIFICATIONS

  • Metrics infrastructure experience, including metrics sharing, management, version control.
  • PhD or MS in Computer Science.

Responsibilities:

  • Demonstrated expertise in large-scale data processing, with a strong background of working with Spark and Python or Scala.
  • Understanding of distributed computing principles, data engineering and DevOps standard processes.
  • Proven programming skills in Python and Scala.
  • A genuine passion for working with data and solving complex problems at scale, in cloud platforms (AWS, GCP).
  • Experience with machine learning data mining.
  • B.S.degree in Computer Science or Data Science


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

MSc

Computer Science

Proficient

1

Cupertino, CA, USA