Software Engineer, Reliability Engineering, AiDP at Apple
, Texas, United States -
Full Time


Start Date

Immediate

Expiry Date

09 Mar, 26

Salary

0.0

Posted On

09 Dec, 25

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Python, Java, Go, Kubernetes, Docker, CI/CD, Troubleshooting, Performance Analysis, Distributed Systems, Big Data, Spark, Flink, Iceberg, GenAI, ML, Linux

Industry

Computers and Electronics Manufacturing

Description
The Applied Machine Learning team in AI and Data Platform org has been at the forefront of accelerating digital transformation through machine learning across Apple's enterprise ecosystem. We build and operate ML, GenAI, Inference and Data Platforms and Services to provide a comprehensive suite of capabilities—serving business-critical needs across Apple's enterprise. We work on interesting and hard challenges related to scale and performance across diverse set of open-source and cutting edge technologies. DESCRIPTION We are looking for a talented engineer to join our team and bring passion for building and operating large scale platform and distributed systems leveraging cutting edge open source technologies across hybrid cloud environments.As a software engineer in AiDP reliability engineering you will work on one or many projects related to GenAI, ML, Inference and Big data platform. MINIMUM QUALIFICATIONS BS/MS in computer science or equivalent experience. 2+ years experience programming skills in one of the following areas: Python, Java, or Go. 2+ years experience in Kubernetes, Docker or other container orchestration framework. PREFERRED QUALIFICATIONS Ability to read and explain open source codebase. Experience deploying and managing CI/CD pipelines. Strong expertise in troubleshooting complex production issues. Should be able to understand complex architectures and be comfortable working with multiple teams. Ability to conduct performance analysis and troubleshoot large scale distributed systems. Should be highly proactive with a keen focus on improving uptime/availability of our mission-critical services. Experience with big data technologies - Spark, Flink, Iceberg or emerging GenAI/ML like Ray/MLflow/model serving) technologies. Experience of Linux, database and security concepts.
Responsibilities
As a software engineer in AiDP reliability engineering, you will work on projects related to GenAI, ML, Inference, and Big Data platforms. You will be involved in building and operating large-scale platforms and distributed systems.
Loading...