Site Reliability Engineering Manager - General Motors Insurance at GM Financial
Arlington, Texas, USA -
Full Time


Start Date

Immediate

Expiry Date

04 Dec, 25

Salary

213000.0

Posted On

04 Sep, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Openshift, Python, Perl, Microsoft Azure, Ruby, Pipelines, Training, Bash

Industry

Information Technology/IT

Description

WHY GENERAL MOTORS INSURANCE?

At General Motors Insurance, we are building an Insurtech business that will reinvent auto insurance. We are fully owned and backed by auto industry leaders, General Motors and GM Financial. This is a truly unique opportunity to join at the foundational stage of a start-up leading the transformation of the auto insurance experience.
GM has the largest connected vehicle fleet worldwide. In the US alone, there are currently 9M+ connected GM vehicles on the road and that number is projected to triple in the next 10 years. More than that, the OnStar system currently has access to over 900 data points from the vehicle. This surge in information about vehicles and how they are driven will revolutionize auto insurance. This disruption is backed by the bold GM vision of zero crashes, zero emissions and zero congestion. We are serious about the safety and financial security of our customers. If you are passionate about driving innovation and delivering results in a fast-paced, value-focused environment, General Motors Insurance is looking for you.
Position open until filled.

EXPERIENCE AND EDUCATION

  • 5 – 7 years of hands-on experience with C# and JavaScript/TypeScript in production environments required
  • 5 – 7 years of hands-on experience in cloud technologies with Microsoft Azure required
  • 3 – 5 years of hands-on experience with scripting with Bash, Perl, Ruby or Python
  • 3 – 5 years of experience implementing IaC (Terraform, ARM, Bicep) preferred
  • 3 – 5 years of experience with Docker Datacenter
  • 3 – 5 years of hands-on administration experience on Machine Learning platforms
  • 3 – 5 years of experience in Mesos, AKS/Kubernetes, OpenShift and/or Deis or other such container/platform-as-a-service orchestrator
  • 3 – 5 years of experience with CI/CD tools and pipelines (Azure DevOps, Github Actions)
  • Bachelor’s Degree in related field or equivalent experience required
  • Master’s Degree in related field preferred
    What We Offer: Generous benefits package available on day one to include: 401K matching, bonding leave for new parents (12 weeks, 100% paid), tuition assistance, training, GM employee auto discount, community service pay and nine company holidays.
    Our Culture: Our team members define and shape our culture — an environment that welcomes innovative ideas, fosters integrity, and creates a sense of community and belonging. Here we do more than work — we thrive.
    Compensation : Competitive salary and bonus eligibility.
    Work Life Balance : Remote work environment.
    The base salary range for this role is: USD $115,000 to $213,000
    At GM Financial, we strive for equity in all aspects of our business, including pay equity. This is the GM Financial pay range for this job and role level. The exact salary and compensation will vary based on factors like knowledge, skills, experience and education.
    This role is eligible to participate in performance-based incentive plans. Full-time employees are eligible to participate in health benefits on day one of employment.
Responsibilities

ABOUT THE ROLE:

The Site Reliability Engineering Manager provides both technical leadership and strategic direction for building and maintaining highly reliable, scalable, and performant software systems that support critical business products. This role blends hands-on technical capability with people leadership—driving SLO adoption, leading incident management, and partnering closely with product, architecture, and development teams to ensure business continuity and availability.

JOB DUTIES

  • Lead and mentor a team of Site Reliability Engineers, fostering a culture of reliability, learning, and continuous improvement.
  • Stay hands-on when needed for coding, automating, debugging, and contributing to reliability tools and pipelines.
  • Define and manage Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets, ensuring alignment with business goals.
  • Own and maintain the PagerDuty implementation, including supporting teams with on-call scheduling, escalation flows, and incident response processes.
  • Empower and guide engineers to serve as Incident Commanders during critical events, fostering a ‘you build it, you run it’ culture; support them through post-incident reviews and Root Cause Analyses (RCAs) to drive preventive measures and shared learning.
  • Collaborate with product and development teams early in the lifecycle to design for operability, scalability, and maintainability.
  • Implement observability solutions using Azure Monitor, Application Insights, Log Analytics, and integrate CDN/WAF telemetry (Akamai, Azure Front Door; AWS CloudFront as applicable).
  • Reduce toil by automating operational tasks using scripting languages (PowerShell, Python, JavaScript/TypeScript).
  • Participate in an on-call rotation to support troubleshooting and communication efforts outside of normal business hours.
Loading...