AIS - Senior Site Reliability Engineer

at  Apple

London, England, United Kingdom -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate19 Feb, 2025Not Specified19 Nov, 2024N/ASplunk,Accountability,Collaboration,Performance Tuning,Shell Scripting,System Administration,Docker,Documentation,Kubernetes,Difficult Situations,Computer Science,Communication Skills,TelemetryNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

SUMMARY

Posted: 18 Oct 2024
Role Number:200571905
Imagine what you could do here. At Apple, new ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there’s no telling what you can accomplish. We are seeking an extraordinary individual who is passionate about reliability engineering, software development, privacy, and information security with a desire to work in hyper-scale environments. The ideal candidate will have a strong background in production monitoring, a deep understanding of software development and operations, and a proven track record in managing large-scale production environments. This is a technical hands-on role that is focused on the delivery and support of technologies used to protect Apple.

DESCRIPTION

Our team is highly collaborative, and works closely with partner teams to deliver the best results for Apple. We strive to find the best solution while also considering the need to get things done efficiently for each engineering challenge we face. As a SRE in Apple Information Security, you will: - Operate, monitor, and triage all aspects of our production and non-production environments - Pioneer and implement the next generation telemetry system for AIS services - Establish alert handling procedures, runbooks, and collaborate with our global security team - Automate deployment and orchestration of services into the cloud environment as well as other routine processes - Actively participate in capacity planning, business continuity, and disaster recovery planning - Support partner teams across the enterprise - Cultivate and maintain relationships with internal and external third party vendors

  • Professional experience in Site Reliability Engineering, DevOps, or a related field.
  • Experience working with cloud compute environments like OpenStack, AWS, GCP or Azure
  • Experience with infrastructure as code (IaC), configuration management, CI/CD, and automation, e.g., Terraform, Pulumi, CloudFormation, CDK, Ansible, Chef, Puppet, Jenkins
  • Strong proficiency in software development using Python, Rust, and/or Go programming languages

PREFERRED QUALIFICATIONS

  • Bachelor’s degree in Computer Science, or a related field, or equivalent practical experience
  • Extensive experience administering, performance tuning and troubleshooting Linux systems
  • Excellent troubleshooting, problem solving, and debugging skills
  • Ability to cultivate an environment that emphasizes collaboration, accountability, and excellence
  • Excellent written and verbal communication skills
  • Ability to work under pressure and manage difficult situations in a dynamic work environment
  • Thrives in fast-paced environment and adopts a learning mindset; loves learning new technologies
  • Proficiency in implementing and correlating telemetry using monitoring and observability tools: Splunk, Grafana, Prometheus, ELK, SumoLogic or the like
  • Experience in shell scripting (e.g., bash/tcsh/zsh)
  • Experience with large environment system administration
  • Experience with measuring, analyzing, and optimizing performance
  • Experience operating with Scrum/Agile development methodologies
  • Strong understanding of concurrency, parallelism, and distributed system concepts
  • Passion for high-quality code, unit-tests, documentation, and production services
  • Previous experience working on a global team with 24/7 support model
  • Building and operating container orchestrating systems (Docker, Kubernetes, vagrant and micro-services)

Responsibilities:

  • Professional experience in Site Reliability Engineering, DevOps, or a related field.
  • Experience working with cloud compute environments like OpenStack, AWS, GCP or Azure
  • Experience with infrastructure as code (IaC), configuration management, CI/CD, and automation, e.g., Terraform, Pulumi, CloudFormation, CDK, Ansible, Chef, Puppet, Jenkins
  • Strong proficiency in software development using Python, Rust, and/or Go programming language


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Application Programming / Maintenance

Software Engineering

Graduate

Computer science or a related field or equivalent practical experience

Proficient

1

London, United Kingdom