Site Reliability Engineer at Motive
San Francisco, California, USA -
Full Time


Start Date

Immediate

Expiry Date

13 Dec, 25

Salary

226000.0

Posted On

16 Sep, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Good communication skills

Industry

Information Technology/IT

Description

WHO WE ARE:

Motive empowers the people who run physical operations with tools to make their work safer, more productive, and more profitable. For the first time ever, safety, operations and finance teams can manage their drivers, vehicles, equipment, and fleet related spend in a single system. Combined with industry leading AI, the Motive platform gives you complete visibility and control, and significantly reduces manual workloads by automating and simplifying tasks.
Motive serves more than 100,000 customers – from Fortune 500 enterprises to small businesses – across a wide range of industries, including transportation and logistics, construction, energy, field service, manufacturing, agriculture, food and beverage, retail, and the public sector.
Visit gomotive.com to learn more.

Responsibilities

ABOUT THE ROLE:

As a Staff Site Reliability Engineer on the Platform team, your role will be crucial in helping us design, scale, and manage our growing AWS-backed services for millions of connected IoT devices, mobile, and SaaS users. Your expertise in cloud-native and highly elastic service design and scaling practices is going to ensure our growing services, as well as new products and features operate smoothly and without manual intervention to achieve Motive’s strong 99.99% availability SLOs. Leveraging and advancing our robust and fully-codified infrastructure and Kubernetes environment, paired with AWS components that require thoughtful implementations, and of course advanced troubleshooting with teams, you can be a large part of Motive’s growth to the next million devices and beyond.

WHAT YOU’LL DO:

  • Collaborate with other engineering and product teams to design and build the infrastructure and services required to deliver new features to customers in a cloud-native and event-driven fashion.
  • Leverage and progress our IaC (Terraform) and CM (Helm) code and strategies for advanced scaling and self-service usage by engineering teams.
  • Identify and remove bottlenecks from systems in production throughout AWS services and with our Kubernetes platform.
  • Ensure 99.99% customer-facing uptime.
  • Continuously improve the monitoring and alerting capabilities of our platform, enabling us to be proactive instead of reactive.
Loading...