Staff Site Reliability Engineer at Pura
Pleasant Grove, UT 84062, USA -
Full Time


Start Date

Immediate

Expiry Date

21 Nov, 25

Salary

0.0

Posted On

21 Aug, 25

Experience

15 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Python, Architecture, Distributed Systems, Node.Js, Infrastructure, Programming Languages, Code

Industry

Information Technology/IT

Description

QUALIFICATIONS:

  • 15+ years of extensive experience as a Site Reliability Engineer or similar role, with a proven track record of architecting solutions for large-scale distributed systems.
  • Expert-level proficiency in multiple programming languages including Python, Go, or Node.js, with demonstrated experience building complex automation frameworks and infrastructure tools.
  • Comprehensive mastery of cloud technologies, particularly AWS and GCP, including experience architecting multi-region, highly available systems.
  • Deep expertise in Kubernetes administration and architecture, including experience operating large-scale clusters, implementing custom controllers, and optimizing cluster performance.
  • Extensive experience with advanced observability platforms and practices, including implementing custom monitoring solutions and developing sophisticated alerting strategies.
  • Proven track record of designing and implementing complex IAM architectures for enterprise-scale organizations.
  • Distinguished expertise in Infrastructure as Code, particularly with Terraform, including experience developing custom providers and managing multi-cloud deployments.
  • Exceptional problem-solving abilities with demonstrated experience resolving critical production issues in complex, high-stakes environments.

JOIN THE PURA TEAM!

We’re looking for individuals who believe in the power of fragrance and technology to transform lives. If you’re ready to be part of a dynamic, fast-growing company at the forefront of an exciting industry, we’d love to hear from you.

Pura provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

  • All candidates are subject to a background check.

IwVBCK8Qu

Responsibilities

In this high-impact staff-level role, you will:

  • Architect, design, and implement enterprise-scale infrastructure solutions supporting Web, Mobile, Backend, and Data engineering teams, while providing technical leadership across cross-functional groups.
  • Define and drive adoption of reliability standards, architectural patterns, and engineering best practices across the organization, working closely with engineering and security leadership.
  • Lead performance optimization initiatives, implementing sophisticated monitoring strategies and leveraging advanced analytics to ensure exceptional system reliability and performance at scale.
  • Design and implement comprehensive automation frameworks for infrastructure provisioning, configuration management, and deployment processes, focusing on efficiency and scalability.
  • Serve as the technical authority for incident management, establishing robust incident response frameworks, leading cross-functional response efforts, and driving systematic improvements through detailed post-incident analysis.
  • Architect and implement enterprise-wide incident response strategies, including sophisticated playbooks and multi-tier escalation procedures aligned with business continuity requirements.
  • Partner with engineering leadership to drive reliability improvements through advanced automated testing frameworks, fault-tolerant architectures, and comprehensive disaster recovery strategies.
  • Provide technical mentorship and leadership to the broader engineering organization while contributing to the strategic direction of the SRE practice.
Loading...