SRE MDPL Engineer
at ING
40-121 Katowice, województwo śląskie, Poland -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 26 Dec, 2024 | Not Specified | 28 Sep, 2024 | N/A | Good communication skills | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
We are looking for you, if you:
- have experience in operating system administration (Linux or Windows),
- know key cloud proconcepts you can describe cloud-native,
- understand and have knowledge about other stack layers – Network, Virtualization, Middleware, Databases,
- have good understanding of programming (preferred languages: Python, PoweShell, Golang),
- know how to use IaC/orchestration/automation tooling like Azure Pipelines, Ansible, Terraform,
- can identify and automate infrastructural management tasks using best infra-as-code practice,
- know key reliability engineering framework practices, consumer engineering idea and acronyms like SLI, MTTR and BCM are not just a couple random letters glued together.
English level - B2.
You’ll get extra points for:
- value your time and don’t log in to host to run commands – Infra as a Code is your creed,
- do not like solving Incidents you prevent them from happening,
- always be step ahead and use new technologies,
- energy and efficiency,
- being a problem solver, not a spotter,
- team player,
- working with minimum supervision.
Your responsibilities:
As the Site Reliability Engineering Department, we focus on four key topics:
- Run & Change,
- Enablement,
- Rapid Response,
- Education.
At your role you will mainly focus on:
- Implementation of reliability across global platforms & services, global supporting tooling and entities:
- Operating in strong cooperation with involved Enterprise Architects, other SREs & DevOps engineers,
- Implementing observability measures via respective tooling of our critical business services,
- Identifying service level objectives with associated indicators,
- Look for and elimination of manual and repetitive task (commonly known as toil,
- Planning and evaluating new releases of features within infrastructure environment (release trains).
- Later on, focus will also be on other practices e.g.:
- Mature major incident management process (major incident mgt, problem mgt, post-mortem & root-cause analysis),
- Mature capacity planning & forecasting practice,
- Mature reliability reporting,
- Introduction of Error budgeting,
- Knowledge management about spreading “reliability by design” concept and execution of all required reliability practices.
Information about the squad:
We are a Team of Infra admins who got tired of manual work and decided to move to Infra as a Code approach. We want to prevent, not repair and make our system Reliable. Taking best approach from Google and Microsoft we want to create Culture of SRE Engineering with focus on Design, Run Enable, Rapid Response, Educate and Review. Are you up for the challenge?
The role naming convention in the global ING job architecture will be “Engineer IV”
Responsibilities:
- Implementation of reliability across global platforms & services, global supporting tooling and entities:
- Operating in strong cooperation with involved Enterprise Architects, other SREs & DevOps engineers,
- Implementing observability measures via respective tooling of our critical business services,
- Identifying service level objectives with associated indicators,
- Look for and elimination of manual and repetitive task (commonly known as toil,
- Planning and evaluating new releases of features within infrastructure environment (release trains).
- Later on, focus will also be on other practices e.g.:
- Mature major incident management process (major incident mgt, problem mgt, post-mortem & root-cause analysis),
- Mature capacity planning & forecasting practice,
- Mature reliability reporting,
- Introduction of Error budgeting,
- Knowledge management about spreading “reliability by design” concept and execution of all required reliability practices
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Other
Software Engineering
Graduate
Proficient
1
40-121 Katowice, Poland