DevOps and SRE Lead| Telecom Domain| London at Infosys
London, England, United Kingdom -
Full Time


Start Date

Immediate

Expiry Date

18 Jan, 25

Salary

0.0

Posted On

18 Oct, 24

Experience

12 year(s) or above

Remote Job

No

Telecommute

No

Sponsor Visa

No

Skills

High Analytical Skills, Communication Skills, Infrastructure, Python, Docker, Code, Scripting Languages, New Relic, Global Delivery, Addition, Personality Profile

Industry

Information Technology/IT

Description

Role: DevOps and SRE Lead
Skills: Azure Cloud Platform, Terraform,Ansible, python/bash scripting,Gitlab, Kubernetes,Docker
Domain: Telecom OSS/BSS(B2B)
Location: London

Required:

  • Must have a proven track record of leading and implementing DevOps and SRE practices in complex environments using COTS products
  • Must have experience in Telco Domain
  • Must have experience in B2B
  • Must be able to articulate his ideas across different set of audience. i.e. technical and non-technical
  • Must be able to oversee the design, development, and implementation of DevOps pipelines and infrastructure, ensuring efficient and reliable delivery of software applications

Key Responsibilities

  • DevOps Strategy: Develop and implement a comprehensive DevOps strategy aligned with the organization’s goals, focusing on automation, collaboration, and continuous improvement.
  • Infrastructure Management: Oversee the design, implementation, and maintenance of cloud-based infrastructure (e.g., AWS, GCP, Azure).
  • Automation: Lead the development and implementation of automation tools and scripts to streamline processes, reduce manual effort, and improve efficiency.
  • CI/CD Pipelines: Build and maintain robust continuous integration and continuous delivery (CI/CD) pipelines to automate the software delivery process.
  • SRE Practices: Implement Site Reliability Engineering (SRE) practices to ensure high system availability, reliability, and performance.
  • Monitoring and Alerting: Establish effective monitoring and alerting systems to proactively identify and address issues.
  • Incident Management: Lead incident response efforts, coordinating with relevant teams to resolve issues promptly and effectively.
  • Team Management: Mentor and develop a team of DevOps and SRE engineers, fostering a culture of innovation and continuous learning.
  • Collaboration: Collaborate with development, testing, and operations teams to ensure smooth software delivery and support.
  • Best Practices: Stay up-to-date with industry trends and best practices in DevOps and SRE.

Required Skills

  • 12+ years of experience in DevOps and/or SRE roles.
  • Strong understanding of cloud platforms preferably Azure (good to have - AWS, GCP) and infrastructure as code (IaC) tools (Terraform, Ansible).
  • Proficiency in scripting languages (Python, Bash).
  • Experience with CI/CD pipelines preferably Gitlab (good to have - ADO, CircleCI).
  • Knowledge of containerization technologies (Docker, Kubernetes).
  • Experience with monitoring and alerting tools preferably ELK (good to have - Prometheus, Grafana, New Relic).
  • Excellent problem-solving and troubleshooting skills.
  • Strong leadership and communication skills.
  • Ability to work independently and as part of a team.

Preferred

  • Certification in DevOps or cloud technologies.
  • Experience with microservices architecture.
  • Knowledge of security best practices.
Responsibilities
  • DevOps Strategy: Develop and implement a comprehensive DevOps strategy aligned with the organization’s goals, focusing on automation, collaboration, and continuous improvement.
  • Infrastructure Management: Oversee the design, implementation, and maintenance of cloud-based infrastructure (e.g., AWS, GCP, Azure).
  • Automation: Lead the development and implementation of automation tools and scripts to streamline processes, reduce manual effort, and improve efficiency.
  • CI/CD Pipelines: Build and maintain robust continuous integration and continuous delivery (CI/CD) pipelines to automate the software delivery process.
  • SRE Practices: Implement Site Reliability Engineering (SRE) practices to ensure high system availability, reliability, and performance.
  • Monitoring and Alerting: Establish effective monitoring and alerting systems to proactively identify and address issues.
  • Incident Management: Lead incident response efforts, coordinating with relevant teams to resolve issues promptly and effectively.
  • Team Management: Mentor and develop a team of DevOps and SRE engineers, fostering a culture of innovation and continuous learning.
  • Collaboration: Collaborate with development, testing, and operations teams to ensure smooth software delivery and support.
  • Best Practices: Stay up-to-date with industry trends and best practices in DevOps and SRE
Loading...