Staff Site Reliability Engineer

at  SentinelOne

Praha, Praha, Czech -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate06 Aug, 2024Not Specified08 May, 2024N/AOperations,Open Source,Architecture,Deployment Strategies,Redis,Soft Skills,Air,Nomad,Infrastructure,Kubernetes,Legacy Systems,Programming Languages,Production Experience,Kafka,CloudNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

ABOUT US:

SentinelOne is defining the future of cybersecurity through our XDR platform that automatically prevents, detects, and responds to threats in real-time. Singularity XDR ingests data and leverages our patented AI models to deliver autonomous protection. With SentinelOne, organizations gain full transparency into everything happening across the network at machine speed – to defeat every attack, at every stage of the threat lifecycle.
We are a values-driven team where names are known, results are rewarded, and friendships are formed. Trust, accountability, relentlessness, ingenuity, and OneSentinel define the pillars of our collaborative and unified global culture. We’re looking for people that will drive team success and collaboration across SentinelOne. If you’re enthusiastic about innovative approaches to problem-solving, we would love to speak with you about joining our team!

WHAT ARE WE LOOKING FOR?

We are looking for Staff SRE with prior extensive operations experience for a SaaS product, who can drive deployment re-architecture with focus on self-service and automation. Someone who has delivered SaaS products on multi-cloud, on-prem and air gapped environments, driven continuous delivery of software, has run incident post-mortems, has provided feedback to engineering architecture decisions and has automated repetitive operational tasks.
You will join a like minded team of SRE’s who help run our operations smoothly at scale by building a platform on which S1’s services can run. If the thought of running a large scale cybersecurity platform on various cloud providers, on-prem and air gapped environments excite you, you’ve found the right place!
As a team we value good written communication skills, data driven decisions and a keen eye for continuous improvements. You’ll help simplify, have a passion for new ideas and know how to execute iteratively towards the final goal. We value candor and collaboration.

WHAT EXPERIENCE & SKILLS SHOULD YOU BRING?

  • Multiple years of experience in running site reliability for SaaS products, running operations at a large scale and extensive experience in leading design and architecture of infrastructure (cloud and on-prem combined)
  • Multi-cloud experience, deep expertise with at least one of AWS/GCP/Azure platforms
  • Extensive production experience with orchestration systems like Kubernetes, Nomad or Mesos (We are a Kubernetes shop),
  • Any experience with Rancher, Platform9 or other managed k8s providers is desired
  • Familiarity with on-prem and air gapped deployments on top of k8s
  • Demonstrated experience with Kafka and Redis
  • Familiar with IaaC and tools (Terraform or Pulumi)
  • Familiarity with CI and practical delivery using any of the major tools, familiarity with deployment strategies like blue green, rolling deploys, canary deploys and best practices around deployment automation (with tools like shipit or spinnaker) is desired
  • Demonstrated Proficiency in at least 1 mainstream language (Python/GoLang/Ruby/etc)
  • Familiarity with SecOps & Compliance processes and their touch points with SRE is desired
  • Polyglot experience with other SRE tools – we integrate with more tools every day
  • Keeping a pulse on latest SRE trends and Open Source
  • Prior product building experience

APART FROM THE ABOVE TECHNICAL SKILLS, FOLLOWING SOFT SKILLS ARE REQUIRED:

  • Curiosity, fast-learning, pursuit to improvements, great communication
  • Ability to work in a diverse and distributed team
  • A self-starter that is passionate and motivated by new technologies and has empathy for legacy systems
  • A quick learner that can navigate through unfamiliar programming languages, systems and processes

Responsibilities:

SRE organization’s mission at SentinelOne (S1) is to keep our uptime promise to our customers by ensuring we meet our SLOs/SLAs, help our engineering teams ship software to our customers fast and with quality and ensure our customers are successful.

  • In this job as Staff SRE / Tech Lead, you will join the ‘Core SRE’ team at S1 and have an amazing opportunity to drive outcomes that improve reliability, stability and cost efficiency of S1’s ‘Singularity Platform’ – our largest customer facing service, which has over 11,000 B2B/B2G customers deployed across over 5 regions and 2 cloud service providers.
  • Big projects that are upcoming that you could work on include e.g.: Monitoring and Observability Uplift, Logging Pipeline modernization and more!

Your tools: Git, ArgoCD, Jenkins, Ansible, Kubernetes, Docker, Kafka, AWS, GCP, Terraform


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - System Programming

Software Engineering

Graduate

Proficient

1

Praha, Czech