Staff Site Reliability Engineer at Honeycombio
Remote, British Columbia, Canada -
Full Time


Start Date

Immediate

Expiry Date

29 Nov, 25

Salary

289200.0

Posted On

30 Aug, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Instrumentation, Aws, Kubernetes, Communication Skills, Reliability Engineering, Data Driven Decision Making

Industry

Information Technology/IT

Description

WHAT WE’RE BUILDING

Honeycomb is a service for the near and present future, defining observability and raising expectations of what developer tools can do! We’re working with well known companies like HelloFresh, Slack, LaunchDarkly, and Vanguard and more across a range of industries. This is an exciting time in our trajectory, we’ve closed Series D funding, scaled past the 200-person mark, and were named to Forbes’ America’s Best Startups of 2022 and 2023!
If you want to see what we’ve been up to, please check out these blog posts and Honeycomb.io press releases.

WHO WE ARE

We come for the impact, and stay for the culture! We’re a talented, opinionated, passionate, fiercely inclusive, and responsible group of bees. We have conviction and we strive to live our values every day. We want our people to do what they truly love amongst a team of highly talented (but humble) peers.

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities
  • Lead technically complex, cross-functional projects to help Honeycomb scale.
  • Help manage and grow our vendor relationships (like with AWS) - including strategic negotiations - and help others do the same.
  • Build organizational trust through transparent communication with engineering leadership and stakeholders.
  • Shape how the SRE team engages with the rest of Honeycomb (program management, embedding, education).
  • Drive technical improvements in AWS, Kubernetes, Helm, and Terraform usage.
  • Contribute to platform strategy and vision.
  • Improve and refine processes to ensure smooth operations and reduce friction for engineering teams.
  • Act and train others as an Incident Commander, and participate in the Platform on-call rotation.
  • Improve the internal observability of our systems through technical projects and enablement.
  • Help the organization navigate tradeoffs between reliability and its other goals and priorities.
  • Optional: act as an external ambassador through blog posts, conference talks, and presentations with support from our DevRel team.
Loading...