Observability Lead at Rest
Sydney, New South Wales, Australia -
Full Time


Start Date

Immediate

Expiry Date

29 Jun, 26

Salary

0.0

Posted On

31 Mar, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Observability Strategy, Operational Excellence, Monitoring, Alerting, Incident Response, Datadog, System Reliability, Compliance, Scalability, Root Cause Analysis, Community of Practice, CPS230 Reporting, Incident Management, IaC, DevOps, CI/CD

Industry

Financial Services

Description
Company Description Supporting millions of Aussies since 1988 with low fees and competitive long-term performance. Profits back to members, not shareholders Closing date: 14th April 2026 Please note Rest does not accept speculative resumes from recruitment agencies Rest will review applications prior to the closing date and may close the role earlier Job Description Established in 1988, Rest is one of Australia’s largest profit-to-member superannuation funds. We support more than two million members, with around $100 billion of funds under management and are recognised as a responsible investment leader*. We believe when members understand and engage with their super, they’re more likely to get a better retirement outcome. Everything we do at Rest is underpinned by our values and behaviours, we want to Be Daring, Keep it Simple, Take Action and Have Grit. To put it simply we want our people to thrive and love the work they do. The Observability Lead is responsible for leading Rest’s observability strategy, ensuring operational excellence, and driving continuous improvement in monitoring, alerting, and incident response. This role combines hands-on operational support with strategic governance and tooling ownership and leading initially one observability engineer. The Observability Lead will triage and investigate alerts, define observability standards, and manage enterprise observability platforms (primarily Datadog) to ensure system reliability, compliance, and scalability, focusing on end-to-end user journey. Key Responsibilities Own and maintain the Enterprise Observability Strategy, roadmap, and maturity model. Oversee incident detection and alerting processes to ensure timely response. Lead and contribute to Post-Incident Reviews (PIRs), offering observability insights and identifying monitoring/alerting gaps. Support incident triage and investigation using logs, metrics, and traces to accelerate root cause analysis. Define and maintain observability standards, frameworks, and maturity models across the organisation. Establish and run an Observability Community of Practice, fostering collaboration and knowledge sharing. Ensure compliance with internal audit, security, and data privacy policies, embedding observability into governance practices. Deliver reporting in compliance with CPS230, ensuring regulatory obligations are met. Qualifications Required Qualifications Bachelor’s or master’s degree in computer science, Information Technology, Engineering, or a related field. Proven experience in observability, monitoring, or site reliability engineering (SRE). Strong knowledge of incident management processes, including triage, escalation, and PIRs. Hands-on expertise with Datadog or similar observability platforms (e.g., Prometheus, Grafana, Splunk). Preferred Background in cloud-native environments (AWS, Azure, GCP). Experience with automation and Infrastructure-as-Code (IaC) for observability tooling. Knowledge of DevOps practices and CI/CD pipelines. Strong analytical skills with the ability to interpret complex logs, metrics, and traces. Experience defining and implementing observability standards, frameworks, and maturity models. Familiarity with compliance frameworks (audit, security, data privacy) and regulatory reporting (e.g., CPS230). Desirable certifications: AWS-specific certifications are highly valuable (AWS Solution Architect, Develop, SysOps, etc.) Observability Foundation – DevOps Institute Cribl Certified Observability Engineer (CCOE) – Cribl Site Reliability Engineering (SRE) Foundation / Practitioner – DevOps Institute Datadog Fundamentals – Datadog Additional Information Our benefits have been designed so you can tailor your experience with us and include: Personal and professional development opportunities Hybrid working Purchase leave scheme and gender neutral 16 weeks paid parental leave Super Contribution Continuation for 12 Months of parental leave Linkedin Learning Income Protection Insurance Rest Excellence awards (peer recognition awards based on Rest’s values and behaviours) Rest Stops - meeting free breaks If you share our values, believe you can help make a difference for our members and want to be part of a leading superannuation fund with a Super culture, please click Apply Now. Rest is committed to creating a flexible work environment and culture that embraces diversity, equity, and inclusion - where people feel welcome, safe to be themselves and inspired to do their best. We value the different backgrounds, lived experiences and abilities our diverse team brings. We welcome and encourage applications from candidates of all ages, cultural backgrounds, faiths, gender identities, sexual orientations and thinking styles. This includes people with disability, neurodiverse individuals, Aboriginal & Torres Strait Islander peoples and those with disrupted work history due to career or other breaks. Please note only people with the right to work in Australia will be considered. *Funds under management as at 31 July 2025. Rest is recognised as a Responsible Investment Leader by the Responsible Investment Association Australia (RIAA) in its Responsible Investment Benchmark Report 2022.
Responsibilities
The Observability Lead will own and maintain the Enterprise Observability Strategy, roadmap, and maturity model, while overseeing incident detection and alerting processes to ensure timely response. This role involves leading an engineer, defining standards, managing enterprise platforms like Datadog, and supporting incident triage to accelerate root cause analysis.
Loading...