Staff AI Infrastructure Engineer at SentinelOne
Remote, Oregon, USA -
Full Time


Start Date

Immediate

Expiry Date

01 Nov, 25

Salary

234600.0

Posted On

03 Aug, 25

Experience

7 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Python, Creativity, Kubernetes, Bash, Computer Science, Azure, Code, Jenkins, Aws, Security, Information Technology

Industry

Information Technology/IT

Description

ABOUT US

At SentinelOne, we’re redefining cybersecurity by pushing the limits of what’s possible—leveraging AI-powered, data-driven innovation to stay ahead of tomorrow’s threats.
From building industry-leading products to cultivating an exceptional company culture, our core values guide everything we do. We’re looking for passionate individuals who thrive in collaborative environments and are eager to drive impact. If you’re excited about solving complex challenges in bold, innovative ways, we’d love to connect with you.

WHAT ARE WE LOOKING FOR?

We’re seeking a Staff AI Infrastructure Engineer with deep expertise in building, automating, and managing AI infrastructure at scale. You will be instrumental in designing and maintaining the systems essential for serving and deploying AI models efficiently and securely across diverse cloud environments.

WHAT SKILLS AND EXPERIENCE SHOULD YOU BRING?

We are looking for an experienced infrastructure engineer who has:

  • A degree in Computer Science, Information Technology, or related field, or equivalent practical experience.
  • 7+ years of experience managing scalable, secure, and resilient infrastructure for AI and machine learning applications.
  • Deep proficiency with infrastructure-as-code tools like Helm, Terraform and ArgoCD.
  • Extensive hands-on experience with Kubernetes for deploying containerized workloads.
  • Demonstrated experience with major cloud platforms (AWS, GCP, Azure), specifically with services related to AI model hosting (e.g., Azure OpenAI).
  • Experience implementing and managing CI/CD pipelines (GitHub Actions, Jenkins).
  • Familiarity with compliance frameworks, particularly FedRAMP, and security best practices.
  • Strong scripting and automation skills using Python, Bash, or similar languages.
  • Excellent problem-solving skills, creativity, and self-driven motivation.

Exceptional candidates will also bring expertise in:

  • Previous experience as a Site Reliability Engineer (SRE), particularly in AI or ML contexts.
  • Monitoring and logging tools (Prometheus, Grafana, Datadog, Jaeger).
  • Networking concepts and security best practices within cloud infrastructure.
  • Professional certifications in Kubernetes or cloud platforms (AWS, Azure, GCP).
Responsibilities

As a Staff AI Infrastructure Engineer, you’ll join our globally distributed team to:

  • Architect, build, and maintain scalable infrastructure to host and serve AI products and models reliably.
  • Automate infrastructure deployment and management using Helm, ArgoCD and Terraform.
  • Manage and optimize Kubernetes clusters to support high-performance AI workloads.
  • Implement and manage CI/CD pipelines utilizing GitHub Actions and Jenkins.
  • Ensure infrastructure compliance with security standards including FedRAMP and related guidelines.
  • Collaborate closely with AI engineering, product teams, and DevOps to meet infrastructure requirements.
  • Monitor infrastructure health and performance, implementing optimizations proactively.
  • Drive infrastructure best practices and mentor team members to foster technical excellence.
Loading...