DevOps and SRE Lead| Telecom Domain| London at Infosys

London, England, United Kingdom -

Full Time

Start Date

Immediate

Expiry Date

18 Jan, 25

Salary

0.0

Posted On

18 Oct, 24

Experience

12 year(s) or above

Remote Job

Telecommute

Sponsor Visa

Skills

High Analytical Skills, Communication Skills, Infrastructure, Python, Docker, Code, Scripting Languages, New Relic, Global Delivery, Addition, Personality Profile

Industry

Information Technology/IT

Description

Role: DevOps and SRE Lead
Skills: Azure Cloud Platform, Terraform,Ansible, python/bash scripting,Gitlab, Kubernetes,Docker
Domain: Telecom OSS/BSS(B2B)
Location: London

Required:

Must have a proven track record of leading and implementing DevOps and SRE practices in complex environments using COTS products
Must have experience in Telco Domain
Must have experience in B2B
Must be able to articulate his ideas across different set of audience. i.e. technical and non-technical
Must be able to oversee the design, development, and implementation of DevOps pipelines and infrastructure, ensuring efficient and reliable delivery of software applications

Key Responsibilities

DevOps Strategy: Develop and implement a comprehensive DevOps strategy aligned with the organization’s goals, focusing on automation, collaboration, and continuous improvement.
Infrastructure Management: Oversee the design, implementation, and maintenance of cloud-based infrastructure (e.g., AWS, GCP, Azure).
Automation: Lead the development and implementation of automation tools and scripts to streamline processes, reduce manual effort, and improve efficiency.
CI/CD Pipelines: Build and maintain robust continuous integration and continuous delivery (CI/CD) pipelines to automate the software delivery process.
SRE Practices: Implement Site Reliability Engineering (SRE) practices to ensure high system availability, reliability, and performance.
Monitoring and Alerting: Establish effective monitoring and alerting systems to proactively identify and address issues.
Incident Management: Lead incident response efforts, coordinating with relevant teams to resolve issues promptly and effectively.
Team Management: Mentor and develop a team of DevOps and SRE engineers, fostering a culture of innovation and continuous learning.
Collaboration: Collaborate with development, testing, and operations teams to ensure smooth software delivery and support.
Best Practices: Stay up-to-date with industry trends and best practices in DevOps and SRE.

Required Skills

12+ years of experience in DevOps and/or SRE roles.
Strong understanding of cloud platforms preferably Azure (good to have - AWS, GCP) and infrastructure as code (IaC) tools (Terraform, Ansible).
Proficiency in scripting languages (Python, Bash).
Experience with CI/CD pipelines preferably Gitlab (good to have - ADO, CircleCI).
Knowledge of containerization technologies (Docker, Kubernetes).
Experience with monitoring and alerting tools preferably ELK (good to have - Prometheus, Grafana, New Relic).
Excellent problem-solving and troubleshooting skills.
Strong leadership and communication skills.
Ability to work independently and as part of a team.

Preferred

Certification in DevOps or cloud technologies.
Experience with microservices architecture.
Knowledge of security best practices.

Responsibilities

DevOps Strategy: Develop and implement a comprehensive DevOps strategy aligned with the organization’s goals, focusing on automation, collaboration, and continuous improvement.
Infrastructure Management: Oversee the design, implementation, and maintenance of cloud-based infrastructure (e.g., AWS, GCP, Azure).
Automation: Lead the development and implementation of automation tools and scripts to streamline processes, reduce manual effort, and improve efficiency.
CI/CD Pipelines: Build and maintain robust continuous integration and continuous delivery (CI/CD) pipelines to automate the software delivery process.
SRE Practices: Implement Site Reliability Engineering (SRE) practices to ensure high system availability, reliability, and performance.
Monitoring and Alerting: Establish effective monitoring and alerting systems to proactively identify and address issues.
Incident Management: Lead incident response efforts, coordinating with relevant teams to resolve issues promptly and effectively.
Team Management: Mentor and develop a team of DevOps and SRE engineers, fostering a culture of innovation and continuous learning.
Collaboration: Collaborate with development, testing, and operations teams to ensure smooth software delivery and support.
Best Practices: Stay up-to-date with industry trends and best practices in DevOps and SRE