Site Reliability Engineer Senior Lead

at  Mars

Slough SL1, England, United Kingdom -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate26 Nov, 2024Not Specified29 Aug, 20243 year(s) or aboveGood communication skillsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

JOB DESCRIPTION:

The Systems Reliability Engineering (SRE) Senior Lead is a pivotal leader within our organization, responsible for ensuring the reliability, performance, and scalability of our critical systems. This role is instrumental in strategizing and overseeing reliability with an end-to-end service delivery perspective, aligning technical infrastructure with business objectives to meet evolving customer needs. As an influential figure in our company, the Systems Reliability Engineering Senior Lead will spearhead initiatives to automate infrastructure, enhance system observability, and drive the transformation of our IT operations.

Responsibilities:

Systems Reliability Engineering Senior Lead is to ensure that the technology stack being deployed and its ability to be supported accordingly with the business requirements, the focus is in the infra tech stack and IT Operations support model:

System Reliability, Performance and best practices:

  • Design, implement, and maintain highly available and scalable systems.
  • Monitor system performance, reliability, and security using advanced monitoring and logging tools.
  • Proactively identify and resolve issues that could impact service availability.
  • Conduct assessments to ensure systems comply with market standards and best practices.

Automation and Infrastructure as Code (IaC):

  • Develop and maintain automated CI/CD pipelines to streamline deployments.
  • Implement Infrastructure as Code (IaC) using tools like Terraform, Ansible, or Others.
  • Automate repetitive tasks to increase system efficiency and reliability.

Collaboration and DevOps Culture:

  • Collaborate with software development teams to ensure new features are built with reliability in mind.
  • Advocate for best practices in software engineering, deployment, and operations and foster a culture of collaboration and continuous improvement across teams.

Capacity Planning and Scaling:

  • Conduct capacity planning to anticipate future growth and scaling needs.
  • Implement strategies to efficiently scale systems based on demand.


REQUIREMENT SUMMARY

Min:3.0Max:7.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Information technology computer science business management or a related field

Proficient

1

Slough SL1, United Kingdom