Start Date
Immediate
Expiry Date
04 Aug, 25
Salary
0.0
Posted On
05 May, 25
Experience
0 year(s) or above
Remote Job
Yes
Telecommute
Yes
Sponsor Visa
No
Skills
Circuit Breakers, Drive Change, Switches
Industry
Information Technology/IT
We’re on the lookout for a passionate and exceptional reliability engineer to join our dynamic team and help us transform the homecare industry. Rally with us in creating meaningful experiences for our hyper-growth healthcare startup.
KEY REQUIREMENTS
ABOUT THIS ROLE:
Enjoy ownership and responsibility, with a bias towards identifying problems and proposing and implementing solutions.
Strong experience with Ruby on Rails, especially in production SaaS systems.
Deep knowledge of background job processing (Sidekiq or similar), caching, and distributed systems.
Proven experience improving CI/CD pipelines, we currently use CircleCI but don’t discard a migration.
Comfortable designing and improving observability stacks (New Relic, Datadog, Honeycomb, etc.).
Experience building resilient systems — retries, back-offs, queueing, circuit breakers, graceful degradation, kill switches, isolation of workloads, etc.
Strong focus on developer ergonomics and reliability culture.
Bias toward action and delivering tools that improve system behavior and developer happiness.
WHAT YOU’LL DO
Own and improve our CI/CD pipelines (CircleCI), reducing deploy times and failure rates.
Build reliable retry/back-off mechanisms for critical user workflows.
Design and implement observability tooling, including synthetic checks, smoke tests, etc.
Help architect and implement failover and fallback mechanisms for critical vendors and workflows.
Work with Support to build debug tooling and dashboards that empower non-engineers.
Collaborate with engineering to define and template runbooks, kill switches, and disaster mitigation patterns.
Champion performance tuning.