Observability Platform Operations Lead
at ING
Sydney, New South Wales, Australia -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 21 Sep, 2024 | Not Specified | 23 Jun, 2024 | N/A | Good communication skills | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
The Centre of Expertise (CoE)for Site Reliability Engineering (SRE)supports the organization’s strategy by enabling SRE capabilities towards continuous focus on system health, reliability, availability, capacity, performance, continuity, and management of IT services.
Excellent opportunity for a Platform Operations Lead, to join our SRE COE team for this newly created position reporting to Site Reliability Engineering Lead. You’ll play a critical role in managing and maintaining our organisations observability and incident response technology infrastructure and services. You’ll be overseeing the operation and performance of our observability platforms (Splunk, Grafana stack, PagerDuty) to ensure reliability, scalability, and security.
This role combines technical expertise, leadership, and strategic thinking to drive operational excellence. It requires collaboration with large set of stakeholders across infrastructure, security, platform engineering and SRE to ensure applications run smoothly and are scalable. It is a hands-on, multi-skilled role that touches application lifecycle management, technical design, technical testing and infrastructure.
What you’ll do…
- Lead the operation, maintenance, and optimization of our current and future observability platforms, to ensure 24/7 availability and reliability.
- Lead incident response efforts, including root cause analysis, resolution tracking and post-incident reviews.
- Develop, track and report on SLI/SLO and key performance indicators for the observability platforms.
- Mentor and lead a team of platform operations Devops/Engineers.
- Collaborate with security teams to enforce best practices, maintain platform security, and address vulnerabilities.
- Plan and manage a backlog of support work includes but not limited to incident response, defects, vulnerabilities, and security/risk related documentation.
ABOUT US
At ING, we want to make life simpler and more worthwhile – for everyone who banks with us, for the people who work with us, and the community at large, too.
When you come to work at ING, you’re joining a team where individuality isn’t just accepted, it’s encouraged. We’ve built a culture that’s fun, friendly and supportive – it’s the kind of place where you can be yourself and make the most of whatever you have to offer.
We give people the freedom to think differently, take ownership of their work, and make great things happen. We’re here to help you get ahead. And with our global network, there’s plenty of scope to take your career in new directions, perhaps even ones you’ve never considered.
We are all about celebrating success and as a result we are proud to be a WGEA Employer of Choice for Gender Equality and a certified Family Inclusive workplace.
Sound like the kind of place you’d feel at home. We’d love to hear from you.
(One last thing, ING operates a direct talent sourcing model. So, no agency introductions, please.)
Need more? Please Contact Mia Annamalai at mia.annamlai@ing.com. Application close date 8/07/24.
Responsibilities:
- Lead the operation, maintenance, and optimization of our current and future observability platforms, to ensure 24/7 availability and reliability.
- Lead incident response efforts, including root cause analysis, resolution tracking and post-incident reviews.
- Develop, track and report on SLI/SLO and key performance indicators for the observability platforms.
- Mentor and lead a team of platform operations Devops/Engineers.
- Collaborate with security teams to enforce best practices, maintain platform security, and address vulnerabilities.
- Plan and manage a backlog of support work includes but not limited to incident response, defects, vulnerabilities, and security/risk related documentation
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Network Administration / Security
Software Engineering
Graduate
Proficient
1
Sydney NSW, Australia