DevOps Engineer at INTUITIVE MACHINES LLC
Houston, TX 77059, USA -
Full Time


Start Date

Immediate

Expiry Date

03 Dec, 25

Salary

0.0

Posted On

03 Sep, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Storage, Firewalls, Ecr, Virtualization, Pipelines, Jira, C++, Network Services, Nfs, Documentation, Diagrams, Ssh, Smb, Consideration, Infiniband, Storage Systems, Color, Incident Handling, Aws, Iscsi, C, Switching, Ecs, Ansible, Rdma, Platforms, Devops, Confluence, Routing

Industry

Information Technology/IT

Description

REQUIRED QUALIFICATIONS

  • Five or more years in DevOps, SRE, or infrastructure engineering with significant Layer 2 and Layer 3 networking ownership.
  • Deep experience with switching, routing, firewalls, and VPNs, and next-generation firewall platforms.
  • Strong Linux administration and core network services, secure SSH, systemd, and storage using NFS, iSCSI, or SMB.
  • Proficiency with Ansible and Terraform for system and network automation.
  • Hands-on GitLab CI/CD across runners, pipelines, and artifact or container registries.
  • Python scripting with working knowledge of C and C++ for build and integration workflows.
  • Familiarity with AWS networking including VPCs, routing, and security groups and with hybrid architectures.
  • Clear communication, strong collaboration, and steady incident handling.
  • Must be willing to work fully on-premise.

PREFERRED EXPERIENCE

  • Operating HPC or simulation environments using Slurm and high-bandwidth low-latency networking such as 10, 25, 40, or 100 GbE and RDMA or InfiniBand.
  • Network observability at scale using Prometheus, Grafana, flow logs, syslog pipelines, and topology mapping.
  • AWS for AI or LLM workloads including ECR, ECS, EKS, or batch patterns with model orchestration and inference pipelines.
  • Virtualization and on-prem platforms such as Hyper-V and enterprise storage systems.
  • Documentation and change management using Jira and Confluence with clear runbooks and diagrams.
    US EEO Statement
    Intuitive Machines is an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected veteran status, age, or any other characteristic protected by law

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities

ROLE OVERVIEW

We’re hiring a hands-on DevOps Engineer with strong on-prem networking expertise. You’ll own infrastructure automation and CI/CD, operate and harden our on-site networks and Linux systems, manage an existing Monte Carlo HPC cluster, and support a small LLM workload in AWS.

WHAT YOU’LL DO

  • Design, implement, and secure on-prem networks: VLANs, routing with OSPF and BGP, VPNs with IPsec and SSL, firewall policies, NAT, high availability, QoS, segmentation, and zero trust.
  • Automate network and systems management using Ansible, Terraform, and Python for configuration templating, drift detection, inventory, and compliance.
  • Operate Linux servers and core services including DNS, DHCP, NTP, PKI, LDAP, and SSO with patching, hardening, and capacity planning.
  • Manage and optimize the existing Monte Carlo HPC cluster based on Slurm and Docker, including container networking, storage such as NFS and iSCSI, and high-throughput data paths.
  • Own GitLab CI/CD including runners, pipelines, registries, and release automation.
  • Build observability and incident response with Prometheus, Grafana, alerting, syslog or SIEM, NetFlow and SNMP; define SLOs and runbooks.
  • Integrate on-prem with AWS through site-to-site VPN or Direct Connect and support AI and LLM workloads.
  • Collaborate with developers to advance infrastructure as code and reliable release practices.
  • Work fully on-site with occasional maintenance windows and light datacenter tasks such as rack and stack, cabling, and labeling.
Loading...