Senior Site Reliability Engineer at ZipLiens

Franklin, Tennessee, United States -

Full Time

Start Date

Immediate

Expiry Date

01 Jun, 26

Salary

153000.0

Posted On

03 Mar, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Site Reliability Engineering, Scalability, Security, Infrastructure Reliability, Automation, Deployment Practices, Observability, Incident Investigations, Root Cause Analysis, CI/CD Pipelines, Infrastructure Provisioning, Access Controls, Secrets Management, Disaster Readiness, Python, Go

Industry

Consumer Services

Description

Zipliens is looking for a Senior Site Reliability Engineer to join our engineering team and support leading the reliability, scalability, and security of the systems that power our legal technology platform. Today, the Zipliens engineering team supports internal operational workflows, automated communications, and client-facing tools that provide greater transparency throughout the lien resolution process. As we continue to scale and invest in the next generation of our platform, this role will play a critical part in strengthening infrastructure reliability, advancing automation and deployment practices, and shaping our observability and operational standards. You will partner closely with software engineers and engineering leadership to drive stable, high-quality releases, reduce operational risk, and ensure our systems remain resilient, performant, and secure as the business grows. Responsibilities: Infrastructure Reliability & Availability Maintain and improve the availability, performance, and reliability of production and non-production environments. Proactively identify scalability and capacity risks and recommend mitigation strategies as platform demands grow. Enhance system observability through monitoring, logging, and alerting, and help define reliability metrics as systems scale. Lead incident investigations and drive root cause analysis, ensuring systemic improvements are implemented. Shape and evolve reliability standards and practices while remaining directly engaged in hands-on system improvements. Automation, CI/CD & Operational Tooling Build, own, and continuously improve CI/CD pipelines to support reliable, repeatable deployments. Drive automation of infrastructure provisioning, configuration, and operational workflows to reduce manual effort and operational risk. Develop and implement tooling that improves system performance, observability, and deployment confidence. Partner with software engineers to standardize and improve deployment practices, release processes, and operational readiness across services. Security, Access & Compliance Establish and enforce best practices for access controls, secrets management, and system hardening. Ensure backup, recovery, and disaster-readiness strategies are tested and reliable. Partner with engineering leadership on security reviews and compliance-related initiatives. Proactively identify and mitigate infrastructure and operational risks. Qualifications: 7+ years of experience in Site Reliability Engineering, DevOps, Infrastructure Engineering, or a related role. Strong troubleshooting skills with experience leading incident response efforts and driving systemic remediation improvements in production environments. Strong experience scaling and operating cloud-based production systems (AWS, GCP, or Azure). Experience designing and maintaining CI/CD pipelines and deployment automation. Experience with monitoring, logging, and alerting systems for reliability and performance. Strong understanding of cloud security fundamentals, including access controls, secrets management, and backup strategies. Proficiency in at least one scripting or programming language (e.g., Python, Go, Bash). Working knowledge of infrastructure-as-code tools (e.g., Terraform, CloudFormation) and containerization/orchestration technologies (Docker, Kubernetes). Strong written and verbal communication skills and experience collaborating with cross-functional teams. Ability to work on-site at least three days per week (approximately 60%) in our Franklin, TN office. Private Health Care Plan (Medical, Dental & Vision) Company HSA contributions for HDHP participants Flexible Spending Accounts (Health & Dependent Care) Company-Paid Short-Term Disability Coverage Voluntary Long-Term Disability, Life, AD&D, and Supplemental Coverage Options 401(k) Plan with Company Match Paid Time Off (Vacation, Sick Time & Select Holidays) Paid Parental Leave Pay Disclosure: The total base salary range for this role is $113,000 -$153,000 annually, with opportunity for a quarterly discretionary bonus. Final compensation will be determined based on skills and experience. Work Authorization: Applicants must be authorized to work in the United States without current or future visa sponsorship. We are unable to provide or assume visa sponsorship at this time.

Responsibilities

The role involves maintaining and improving the availability, performance, and reliability of production systems while proactively identifying scalability risks and enhancing observability through monitoring and alerting. Responsibilities also include building and improving CI/CD pipelines, automating infrastructure provisioning, and establishing best practices for security, access controls, and disaster readiness.