Observability Platform Operations Lead

at  ING

Sydney, New South Wales, Australia -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate21 Sep, 2024Not Specified23 Jun, 2024N/AGood communication skillsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

The Centre of Expertise (CoE)for Site Reliability Engineering (SRE)supports the organization’s strategy by enabling SRE capabilities towards continuous focus on system health, reliability, availability, capacity, performance, continuity, and management of IT services.
Excellent opportunity for a Platform Operations Lead, to join our SRE COE team for this newly created position reporting to Site Reliability Engineering Lead. You’ll play a critical role in managing and maintaining our organisations observability and incident response technology infrastructure and services. You’ll be overseeing the operation and performance of our observability platforms (Splunk, Grafana stack, PagerDuty) to ensure reliability, scalability, and security.
This role combines technical expertise, leadership, and strategic thinking to drive operational excellence. It requires collaboration with large set of stakeholders across infrastructure, security, platform engineering and SRE to ensure applications run smoothly and are scalable. It is a hands-on, multi-skilled role that touches application lifecycle management, technical design, technical testing and infrastructure.

What you’ll do…

  • Lead the operation, maintenance, and optimization of our current and future observability platforms, to ensure 24/7 availability and reliability.
  • Lead incident response efforts, including root cause analysis, resolution tracking and post-incident reviews.
  • Develop, track and report on SLI/SLO and key performance indicators for the observability platforms.
  • Mentor and lead a team of platform operations Devops/Engineers.
  • Collaborate with security teams to enforce best practices, maintain platform security, and address vulnerabilities.
  • Plan and manage a backlog of support work includes but not limited to incident response, defects, vulnerabilities, and security/risk related documentation.

ABOUT US

At ING, we want to make life simpler and more worthwhile – for everyone who banks with us, for the people who work with us, and the community at large, too.
When you come to work at ING, you’re joining a team where individuality isn’t just accepted, it’s encouraged. We’ve built a culture that’s fun, friendly and supportive – it’s the kind of place where you can be yourself and make the most of whatever you have to offer.
We give people the freedom to think differently, take ownership of their work, and make great things happen. We’re here to help you get ahead. And with our global network, there’s plenty of scope to take your career in new directions, perhaps even ones you’ve never considered.
We are all about celebrating success and as a result we are proud to be a WGEA Employer of Choice for Gender Equality and a certified Family Inclusive workplace.
Sound like the kind of place you’d feel at home. We’d love to hear from you.
(One last thing, ING operates a direct talent sourcing model. So, no agency introductions, please.)
Need more? Please Contact Mia Annamalai at mia.annamlai@ing.com. Application close date 8/07/24.

Responsibilities:

  • Lead the operation, maintenance, and optimization of our current and future observability platforms, to ensure 24/7 availability and reliability.
  • Lead incident response efforts, including root cause analysis, resolution tracking and post-incident reviews.
  • Develop, track and report on SLI/SLO and key performance indicators for the observability platforms.
  • Mentor and lead a team of platform operations Devops/Engineers.
  • Collaborate with security teams to enforce best practices, maintain platform security, and address vulnerabilities.
  • Plan and manage a backlog of support work includes but not limited to incident response, defects, vulnerabilities, and security/risk related documentation


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Network Administration / Security

Software Engineering

Graduate

Proficient

1

Sydney NSW, Australia