Senior Site Reliability Engineer

at  Department for Work and Pensions

NUT, England, United Kingdom -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate29 Oct, 2024GBP 78517 Annual30 Jul, 2024N/ACode,Security,Reliability Engineering,Scripting,Logging,Infrastructure,Norway,Management Skills,Performance ManagementNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

JOB DESCRIPTION

The SRE team will put you in the position to work with application teams across the department on developing reliable and secure solutions to provide to citizens across the UK.
You will work with development teams from the design phase to help them use good practice and department standards when building their application infrastructure.

Additionally, responsibilities of the role will include:

  • Contributing authoritative advice and guidance to others in the organisation and externally
  • Design and develop the techniques for improving application reliability, run books, knowledge transfer to DWP Digital’s User Experience Command Centre (UXCC), and ongoing SRE strategy within your Functional and Professional Communities
  • Manage the error budget agreed with the product owner for the application and ensure that work is balanced in alignment with it
  • Act as the focal point for the investigation and resolution of major or complex incidents for the service, ensuring people with the right skills and expertise are proactively available to respond effectively
  • Assess the impact of change requests in consultation with stakeholders, providing technical expertise and authorising the implementation of subsequent changes
  • Manage on-call rotations such that all applications have out-of-hours SRE coverage
  • Coach and mentor application development and operations engineers in the practice and techniques of SRE
  • Conduct retrospectives for all high priority and major incidents ensuring they are done quickly and published
  • Routinely seek views and capture ideas from stakeholders and team members for improvements and encourage collaboration and innovation
  • Interdepartmental discussions and meetings with a wide variety of external bodies and organisations on a local, regional, national or international basis, leading community discussions about SRE best practice within Engineering

Check out these blogs about Working in DWP’s hybrid cloud services group and Sam’s life in the clouds

NATIONALITY REQUIREMENTS

This job is broadly open to the following groups:

  • UK nationals
  • nationals of the Republic of Ireland
  • nationals of Commonwealth countries who have the right to work in the UK
  • nationals of the EU, Switzerland, Norway, Iceland or Liechtenstein and family members of those nationalities with settled or pre-settled status under the European Union Settlement Scheme (EUSS) (opens in a new window)
  • nationals of the EU, Switzerland, Norway, Iceland or Liechtenstein and family members of those nationalities who have made a valid application for settled or pre-settled status under the European Union Settlement Scheme (EUSS)
  • individuals with limited leave to remain or indefinite leave to remain who were eligible to apply for EUSS on or before 31 December 2020
  • Turkish nationals, and certain family members of Turkish nationals, who have accrued the right to work in the Civil Service

Further information on nationality requirements (opens in a new window)

Responsibilities:

Additionally, responsibilities of the role will include:

  • Contributing authoritative advice and guidance to others in the organisation and externally
  • Design and develop the techniques for improving application reliability, run books, knowledge transfer to DWP Digital’s User Experience Command Centre (UXCC), and ongoing SRE strategy within your Functional and Professional Communities
  • Manage the error budget agreed with the product owner for the application and ensure that work is balanced in alignment with it
  • Act as the focal point for the investigation and resolution of major or complex incidents for the service, ensuring people with the right skills and expertise are proactively available to respond effectively
  • Assess the impact of change requests in consultation with stakeholders, providing technical expertise and authorising the implementation of subsequent changes
  • Manage on-call rotations such that all applications have out-of-hours SRE coverage
  • Coach and mentor application development and operations engineers in the practice and techniques of SRE
  • Conduct retrospectives for all high priority and major incidents ensuring they are done quickly and published
  • Routinely seek views and capture ideas from stakeholders and team members for improvements and encourage collaboration and innovation
  • Interdepartmental discussions and meetings with a wide variety of external bodies and organisations on a local, regional, national or international basis, leading community discussions about SRE best practice within Engineerin


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Other Industry

IT Software - Other

Other

Graduate

Proficient

1

Newcastle upon Tyne, United Kingdom