Staff Site Reliability Engineer, Infrastructure

at  Kentik

Remote, Oregon, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate05 May, 2025USD 250000 Annual05 Feb, 20255 year(s) or aboveTuning,Kafka,Platforms,ScratchNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

WHO WE ARE

Kentik is the network observability company. Our platform is a must-have for the network front line, whether digital business, corporate IT, or service provider. Network professionals turn to the Kentik Network Observability Cloud to plan, run, and fix any network, relying on our infinite granularity, AI-driven insights, and insanely fast search.
Kentik makes sense of network, cloud, host, and container flow, Internet routing, performance tests, and network metrics. We show network pros what they need to know about their network performance, health, and security to make their business-critical services shine. Networks power the world’s most valuable companies, and those companies trust Kentik. Market leaders like IBM, Box, and Zoom rely on Kentik for network observability. Visit us at kentik.com and follow us at @kentikinc.

WHAT WE DO

Kentik is looking for an experienced software engineer with an operational mindset to join our Infrastructure team as a Staff SRE.
This infra team is in charge of the software stack that powers Kentik - from configuration management and orchestration, API and service fabric, to datastores and data pipelines, developer experience and internal observability. In partnership with our hardware and network operations team, we provide a reliable platform for other engineering teams to build on.
We are an international group of collaborative, experienced developers and operations practitioners, with broad and deep knowledge of networks, systems and applications.

Responsibilities:

WHAT YOU’LL DO

The role is a mix of development and operations. You will be writing code for internal API services and tools, as well as operating these services in production, along with third-party components like envoy or postgres.

  • Own, scale and maintain our core infrastructure, both on bare metal deployments and major public clouds; keep everything healthy and up to date
  • Bring our overall reliability to new levels, streamline and simplify our stack, and contribute to our efforts automating all the things
  • Define and refine our platform offering - reliable, easy to use “paved path” solutions we provide to the rest of the engineering organization
  • Identify needs, spec plans, and deliver value to our internal customer teams on an ongoing basis
  • Participate in our low-noise on-call rotation, and help other teams with their internal monitoring needs and practices
  • Contribute to our nascent efforts to make customer On-Prems a repeatable, scalable product
  • Work with the hardware and network operations team to keep our datacenter humming and our systems provisioned
  • Collaborate with other engineers in a dynamic, fast-paced and very collaborative remote environment

Studies have shown that some candidates tend to apply to jobs only if they meet 100% of the qualifications. We encourage you to apply if you meet most of the criteria - even if you don’t match all of the qualifications, your skills and experience could be valuable in this role!

  • 5+ years of relevant experience
  • An SRE mindset and and the drive to build reliable, easy to operate systems
  • Running, scaling, tuning postgres, kafka or other datastores and third-party applications
  • Experience building apps and services (i.e. Go or nodejs, GRPC, postgres or mysql)
  • Shipped projects from scratch and maintained them over a period of time
  • Clear communication, both synchronous and via technical plans
  • A practice of instrumenting your services and setting up the right monitoring and alerts
  • Passion for building and providing amazing tools and platforms to other engineer

The compensation range for this position is: $185,000 - $250,000. This range reflects the low and high end of the U.S. compensation range Kentik reasonably and generally expects to pay the hired candidate in this role. The actual compensation offered may be lower or higher than the stated range depending on various factors, including but not limited to:

  • Experience with the skill sets required for success
  • Demonstrated competencies and potential
  • A geographic market-based approac


REQUIREMENT SUMMARY

Min:5.0Max:10.0 year(s)

Information Technology/IT

IT Software - Network Administration / Security

Software Engineering

Graduate

Proficient

1

Remote, USA