Site Reliability Engineer, iCloud
at Apple
London, England, United Kingdom -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 13 Aug, 2024 | Not Specified | 14 May, 2024 | N/A | Encryption,Fault Analysis,Python,Rust,Systems Management,Distributed Systems,Computer Science,Infrastructure Services,Ownership,Swift,Java,Automation | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
SUMMARY
Posted: 11 Apr 2024
Role Number:200538240
People at Apple don’t just build products - they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many of these daily experiences possible. If you’ve used Apple products, you’ve likely interacted with us. iCloud Services SRE teams are responsible for the systems and services that directly support those customers and their experiences. We focus on availability and automation of key services that run iCloud every minute of every day all around the world.
KEY QUALIFICATIONS
- Experience with large scale distributed systems. Experience with ML infrastructure services, including LLMs, Generative AI, and transformers desired.
- In-depth knowledge of one or more of core operating system principles, networking fundamentals, and systems management.
- Demonstrable advanced experience in at least one of Java, Python, Swift, Rust or GoLang and building distributed services/applications.
- Awareness of key security principles including encryption and keys (types and exchange protocols).
- Thorough understanding of SRE principals including monitoring, alerting, error budgets, fault analysis, and automation.
- Strong sense of ownership with a desire to communicate and collaborate with other engineers and teamsExperience in mentoring and developing more junior engineers.
DESCRIPTION
We are looking for an SRE with experience building and supporting machine learning (ML) infrastructure. You will apply SRE best practices to ensure the availability, reliability, and performance of our ML systems and services. You will actively engage with our development partners and product teams regularly so the ML services are well aligned with business needs. If you love designing and running systems and infrastructure that will delight millions of customers this team is for you. Responsibilities will include: Support and maintain ML services by measuring and monitoring availability, latency, and overall system health Deploy and support existing and new ML models and infrastructure Provide insights to partner stakeholders through log and telemetry analysis Maintaining documentation and automating manual processes where possible Be part of an oncall rotation providing hands-on technical expertise during service impacting events Collaborate with other engineers on code, infrastructure, and design reviews, and process enhancements
EDUCATION & EXPERIENCE
BS degree in Computer Science or equivalent field.
ADDITIONAL REQUIREMENTS
Additional Requirement
Responsibilities:
Please refer the Job description for details
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Other
Software Engineering
BSc
Computer Science
Proficient
1
London, United Kingdom