Site Reliability Engineering Manager
at Lexis Nexis
Farringdon, England, United Kingdom -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 29 Nov, 2024 | Not Specified | 30 Aug, 2024 | N/A | Orchestration,Access,Happiness,Reliability Engineering,Paternity,Splunk,Containerization,Maternity,Dental Insurance,Automation Tools,Kubernetes,Children,Azure | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
Site Reliability Engineering Manager
Are you an experienced manager with extensive software engineering experience?
Would you like to join our great reliability engineering team?
About our Team
At LexisNexis Intellectual Property (LNIP), our mission is to bring clarity to innovation by delivering better outcomes to the innovation community. We help innovators make more informed decisions, be more productive, and ultimately achieve superior results. By helping our customers achieve their goals, we support the development of new technologies and processes that ultimately advance humanity.
About the Role
As the Site Reliability Engineering Manager , you will be overseeing the production support of live systems, managing customer bugs and requests, and ensuring continuous monitoring and proactive support of our live systems. The ideal candidate will have a strong background in both software engineering and system administration, with a focus on maintaining the reliability, availability, and performance of our critical systems
Responsibilities
- Leading and mentoring a team of site reliability engineers, providing guidance, training, and performance evaluations.
- Overseeing the smooth operation and availability of live systems, ensuring minimal downtime and prompt resolution of incidents.
- Leading the incident management process, including identification, troubleshooting, resolution, and post-incident analysis.
- Managing the intake, prioritization, and resolution of customer-reported bugs and requests.
- Working closely with the product, development, and customer success teams to ensure timely and effective responses to customer issues.
- Designing, implementing, and maintaining monitoring tools and processes to ensure continuous tracking of system performance, availability, and security.
- Providing regular reports on system performance, incident metrics, and customer issue resolution to senior management.
Requirements
- Demonstrate experience in site reliability engineering, production support, or a related role.
- Demonstrate a decade of software engineering experience
- Proven experience managing and supporting live systems in a production environment.
- Strong understanding of incident management, monitoring tools, and automation processes.
- Demonstrate experience working with multiple cloud platforms ( AWS, Azure) as well as DevOps practices including CICD Pipelines as well as containerization and orchestration ( Kubernetes, Docker)
- Have experience working with multiple monitoring tools such as Datadog, Splunk
- Familiarity with automation tools and scripting skills such as Python / JS
Work in a way that works for you
We promote a healthy work/life balance across the organisation. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals.
- Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productive
Working for you
We know that your wellbeing and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer:
- Generous holiday allowance with the option to buy additional days
- Health screening, eye care vouchers and private medical benefits
- Wellbeing programs
- Life assurance
- Access to a competitive contributory pension scheme
- Save As You Earn share option scheme
- Travel Season ticket loan
- Electric Vehicle Scheme
- Optional Dental Insurance
- Maternity, paternity and shared parental leave
- Employee Assistance Programme
- Access to emergency care for both the elderly and children
- RECARES days, giving you time to support the charities and causes that matter to you
- Access to employee resource groups with dedicated time to volunteer
- Access to extensive learning and development resources
- Access to employee discounts scheme via Perks at Work
About the Business
LexisNexis Legal & Professional® provides legal, regulatory, and business information and analytics that help customers increase their productivity, improve decision-making, achieve better outcomes, and advance the rule of law around the world. As a digital pioneer, the company was the first to bring legal and business information online with its Lexis® and Nexis® services.
LexisNexis, a division of RELX, is an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form:
https://forms.office.com/r/eVgFxjLmAK
, or please contact 1-855-833-5120.
Please read our
Candidate Privacy Policy
Responsibilities:
Responsibilities
- Leading and mentoring a team of site reliability engineers, providing guidance, training, and performance evaluations.
- Overseeing the smooth operation and availability of live systems, ensuring minimal downtime and prompt resolution of incidents.
- Leading the incident management process, including identification, troubleshooting, resolution, and post-incident analysis.
- Managing the intake, prioritization, and resolution of customer-reported bugs and requests.
- Working closely with the product, development, and customer success teams to ensure timely and effective responses to customer issues.
- Designing, implementing, and maintaining monitoring tools and processes to ensure continuous tracking of system performance, availability, and security.
- Providing regular reports on system performance, incident metrics, and customer issue resolution to senior management
We promote a healthy work/life balance across the organisation. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals.
- Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productiv
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Other
Other
Graduate
Proficient
1
Farringdon, United Kingdom