Site Reliability Engineer - k8s ( SRE ) - Remote

at  WebstaurantStore

Lititz, PA 17543, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate27 May, 2024USD 90000 Annual01 Mar, 2024N/AChat,Access,WebcamNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

WebstaurantStore is looking for Site Reliability Engineers. We are the internet’s largest restaurant supplier, and we are growing.

WHO WE ARE LOOKING FOR

We are looking for driven and motivated candidates with a variety of skills and experience. We require that SRE candidates possess an aptitude for solving technical problems, a willingness to learn, a desire to grow, and a desire to work with a team. This position requires prior experience managing on-premises Kubernetes.
Our work is broad enough that you will never master every tool. What we hope you will master instead is the debugging skills required to support tools for which you are not an expert. If you are familiar with some of our tools and want to learn the rest, please apply!

Our SREs typically start their careers as developers or as systems engineers. Developers that want a wider variety of work and enjoy working with infrastructure make a good fit. Likewise, systems engineers that have a desire to improve infrastructure and to reduce repetitive tasks also make a good fit.

  • Experience deploying and managing on-premises Kubernetes clusters.
  • Experience deploying Kubernetes resources with a CI/CD platform such as Argo-CD, Gitlab-CD, Flux, etc.
  • Experience using helm and kustomize to manage and template Kubernetes deployments.
  • Experience using Kubernetes secrets management platforms such as sealed-secrets and Hashicorp Vault.
  • Experience troubleshooting Kubernetes resources such as pods, nodes, deployments, etc.
  • Experience managing Kubernetes persistent storage using Rook / Ceph, iSCSI, NFS, etc.
  • Experience configuring Kubernetes ingress controllers such as HAProxy, NGinx, Traefik, etc.
  • Experience with a Service Mesh such as Istio or Consul is a plus, but not required.
  • Comfortable working with a team to accomplish technical feats.
  • Demonstrated ability to learn new technologies.
  • Attention to detail.
  • Experience with one or more programming/scripting languages. We primarily use Python and Golang, but knowledge of these languages is not required.
  • Experience with Linux in a work environment.
  • Experience troubleshooting across the entire stack: network, server, operating system, and application.
  • Some understanding of configuration management. We use Ansible and Terraform. Experience with any configuration management tool is a plus but not required.
  • Development and/or IT Operations skills and experience relevant to transitioning to/or continuing in an SRE role.
  • Experience with version control. We use git, if you have never used it, we can train you.

WHAT WE DO

SREs work to implement, support, and improve the systems that WebstaurantStore relies on to service our customers and grow our company.
We use automation and observability to ensure service uptime, performance, and growth. SREs build out new infrastructure and capabilities, maintain existing infrastructure and help departments to leverage the shared services we build and maintain.
We value experimenting with novel approaches and new technologies as we are always looking to improve our capabilities.
We value sound design principles and encourage review and discussion among the team to ensure that problems and projects are examined from all angles.
Reliable systems are key to keeping our customers satisfied. Reliable systems enable our fellow employees to do their work. Reliable systems allow our SREs to enjoy their nights and weekends. We focus a lot of effort on keeping our systems reliable.
SREs participate in an on-call rotation. The effort we put into reliability keeps the on-call volume low.

Responsibilities:

  • Experience deploying and managing on-premises Kubernetes clusters.
  • Experience deploying Kubernetes resources with a CI/CD platform such as Argo-CD, Gitlab-CD, Flux, etc.
  • Experience using helm and kustomize to manage and template Kubernetes deployments.
  • Experience using Kubernetes secrets management platforms such as sealed-secrets and Hashicorp Vault.
  • Experience troubleshooting Kubernetes resources such as pods, nodes, deployments, etc.
  • Experience managing Kubernetes persistent storage using Rook / Ceph, iSCSI, NFS, etc.
  • Experience configuring Kubernetes ingress controllers such as HAProxy, NGinx, Traefik, etc.
  • Experience with a Service Mesh such as Istio or Consul is a plus, but not required.
  • Comfortable working with a team to accomplish technical feats.
  • Demonstrated ability to learn new technologies.
  • Attention to detail.
  • Experience with one or more programming/scripting languages. We primarily use Python and Golang, but knowledge of these languages is not required.
  • Experience with Linux in a work environment.
  • Experience troubleshooting across the entire stack: network, server, operating system, and application.
  • Some understanding of configuration management. We use Ansible and Terraform. Experience with any configuration management tool is a plus but not required.
  • Development and/or IT Operations skills and experience relevant to transitioning to/or continuing in an SRE role.
  • Experience with version control. We use git, if you have never used it, we can train you


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Application Programming / Maintenance

Software Engineering

Graduate

Proficient

1

Lititz, PA 17543, USA