Senior Site Reliability Engineer

at  Supermetrics

Edmonton, AB, Canada -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate17 Dec, 2024Not Specified18 Sep, 20244 year(s) or aboveRedis,Postgresql,Mysql,Writing,Nginx,Php,Contour,Reliability Engineering,Queues,Kubernetes,Scripting Languages,Communication Skills,Aws,Documentation,Python,BashNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

We’re looking for a Senior Site Reliability Engineer to join our Infrastructure team in Canada!
This is an Individual contributor role and your contributions will ensure our platform is scalable, reliable, and easy to use.

REQUIREMENTS:

  • 4+ years of experience in Site Reliability Engineering, Platform Engineering, or related roles
  • Strong understanding of containers and experience operating Kubernetes clusters at scale.
  • Proficient in database concepts with hands-on experience in both relational and NoSQL databases.
  • In-depth knowledge of Linux systems and Terraform.
  • In-depth experience and understanding of AWS and/or GCP
  • Solid understanding of modern observability practices and tools
  • Automation mindset with the ability to automate repetitive tasks using scripting languages such as Python or Bash.
  • Team player spirit
  • Willing to take on-call rotations during non-business hours
  • Good communication skills, in particular in writing (documentation, but able to write good PRs too)
  • Strong problem-solving skills with a passion for the tools, technologies and problems in this space
  • Automation mindset with the ability to reduce toil by codifying repetitive tasks using scripting languages such as Python or Bash

Responsibilities:

  • Raise the team’s bar in Kubernetes expertise, getting to mentor, guide and support your direct colleagues as well as other members of our Engineering organization in working with managed Kubernetes clusters across providers
  • Operate the platform that enables our SaaS products to be used by thousands of businesses from around the world, defining SLAs and SLOs and driving the automation that will ensure we meet them
  • In a nutshell, you will use your expertise in containers, Kubernetes, databases, and automation to streamline our operations and improve our infrastructure.

Your day-to-day work and responsibilities will include:

  • Write Terraform configuration and modules that bootstrap a Kubernetes cluster, or review PRs with contributions from other members, making sure that our modules are truly reusable and well-defined, improving how we test and release them.
  • Write (using Golang, for example) and maintain or improve our tooling, ensuring it facilitates platform utilization by engineering teams.
  • Develop and maintain Helm charts for internal deployments and third-party software.
  • Respond to an incident with our production environment.
  • Support our pre-sales team and help them answer potential customers’ questions on our architecture and how we guarantee data security or consistency or ensure uptime.
  • Review an architecture change involving a new database and take part in the meetings discussing the pros and cons of such an approach.
  • Rewrite a Github Action to improve how we deploy to Kubernetes using GitOps.
  • Troubleshoot and resolve technical issues as they arise.
  • Participate in our on-call rotations to provide support, respond to incidents, or handle internal users’ questions.


REQUIREMENT SUMMARY

Min:4.0Max:9.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Proficient

1

Edmonton, AB, Canada