Site Reliability Engineer (SRE) - Apple Maps at Apple

Cupertino, CA 95014, USA -

Full Time

Start Date

Immediate

Expiry Date

11 Aug, 25

Salary

175800.0

Posted On

11 May, 25

Experience

8 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Code, Kubernetes, Splunk, Google Cloud, Ansible, Java, Reliability Engineering, Docker, Devops, Computer Science, Python

Industry

Information Technology/IT

Description

Are you passionate about building the infrastructure backbone that supports Apple Maps-used by millions of people around the world? If you’re looking for a fast-paced, high-impact environment where Apple Maps is designed, tested, and deployed, we’d love to hear from you! The Apple Maps Application Services SRE team is looking for a self-starter, problem solver, and team player to help scale our next-generation infrastructure. You’ll take ownership of crafting and optimizing critical systems, while ensuring our stack stays modern, resilient, and performant. You will apply SRE best practices to ensure the availability, reliability, and performance of our systems and services.

DESCRIPTION

In this role, you will be a key contributor to the reliability and scalability of the backend services that power Apple Maps experiences worldwide. You’ll partner with product and engineering teams to translate requirements into resilient infrastructure designs, and bring those systems to life using modern SRE practices. Whether it’s building automation to manage large-scale service deployments, implementing observability frameworks to proactively detect and diagnose issues, or driving performance improvements across distributed systems-you will be at the forefront of operational excellence. You’ll take ownership of end-to-end service health, participating in on-call rotations and leading technical incident response when needed. As a technical leader, you’ll mentor junior engineers, contribute to design and code reviews, and help shape how Apple Maps Application Services scales into the future. This is a highly collaborative, fast-paced environment where you’ll apply a software engineering approach to tackle infrastructure challenges-and where your work will directly impact the stability and experience of millions of users every day.

MINIMUM QUALIFICATIONS

8+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Operations roles
Bachelor’s degree in Computer Science or related technical field (or equivalent proven experience)
Deep experience with container orchestration (Docker, Kubernetes); Helm knowledge is a plus
Proficiency in infrastructure-as-code tools such as Terraform, Ansible, or CloudFormation
Demonstrated ability to deploy, operate, and maintain services in Kubernetes environments
Strong understanding of SRE principles: monitoring, alerting, error budgets, and fault-tolerant design
Hands-on experience with observability tools like Prometheus, Grafana, OpenTelemetry, and Splunk
Proficiency in Linux systems, networking concepts, and systems management fundamentals
Proficient in at least one modern language: Java, Python, or Go
Skilled at identifying architectural gaps and driving solutions across partner teams

PREFERRED QUALIFICATIONS

Hands-on experience with multi-cloud environments, particularly AWS and Google Cloud
Proven contributions to cloud migration or modernization projects
Strong ability to communicate complex infrastructure concepts to both technical and business stakeholders
Experience leading large-scale infrastructure projects or multi-functional engineering initiatives

Responsibilities

Please refer the Job description for details