Sr Specialist Site Reliability Engineer at Waystar
Louisville, Kentucky, USA -
Full Time


Start Date

Immediate

Expiry Date

28 Nov, 25

Salary

0.0

Posted On

28 Aug, 25

Experience

7 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Performance Tuning, Financial Services, Software Development, Capacity Planning

Industry

Information Technology/IT

Description

ABOUT THIS POSITION

We are seeking a highly skilled and proactive Senior Specialist, Site Reliability Engineering (SRE) to help drive reliability, scalability, and performance across our critical platforms. This role is ideal for a senior-level engineer who combines deep technical expertise with a passion for automation, observability, and operational excellence.
As a Senior Specialist, you’ll work on complex reliability challenges, lead technical initiatives, and collaborate across engineering, product, and infrastructure teams to ensure our systems are resilient and efficient.

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities
  • Reliability Engineering
  • Architect and implement solutions to improve system reliability, scalability, and performance.
  • Define and manage SLIs/SLOs and error budgets across services.
  • Lead efforts to automate operational tasks and improve system observability.
  • Incident Management & Root Cause Analysis
  • Serve as a technical lead during major incidents and drive resolution.
  • Conduct deep root cause analyses and implement long-term fixes.
  • Champion blameless postmortems and continuous improvement.
  • Technical Leadership
  • Lead cross-functional reliability initiatives and mentor junior engineers.
  • Influence system design and architecture to embed reliability from the ground up.
  • Collaborate with software engineers to optimize deployment pipelines and infrastructure.
  • Monitoring & Tooling
  • Enhance observability through metrics, logging, and tracing.
  • Develop and maintain dashboards, alerts, and automated recovery systems.
Loading...