Senior Data Engineer, Commercial Software

at  General Motors

Mountain View, California, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate02 Dec, 2024USD 23200 Annual06 Sep, 20243 year(s) or aboveComputer Science,Data Science,Accident Benefits,Data Transformation,Kubernetes,Life Insurance,It,State Laws,Flexible Spending Accounts,Big Data,Software Craftsmanship,HealthNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

JOB DESCRIPTION

Hybrid: This role is categorized as hybrid. This means the successful candidate is expected to report to [ insert specific location where employee is expected to report] three times per week, at minimum [or other frequency dictated by the business if more than 3 days ]

OVERVIEW:

The Commercial Software division at GM is seeking talented individuals across various software engineering roles to join us. Our mission? Driving forward all digital solutions for commercial fleet owners and drivers, catering to businesses of all sizes, from small and medium businesses to large enterprises.
In today’s dynamic landscape, the demand for intelligent, safe, and eco-friendly transportation solutions is skyrocketing. GM is at the forefront of this transformative journey, championing connected software-defined vehicles that redefine the driving experience. At the heart of this paradigm shift lies software – enabling seamless communication, bolstering security, facilitating updates, processing data, and ultimately, delivering an unparalleled user experience. These capabilities extend beyond consumer benefits, holding even greater significance for business owners.
Recognizing this pivotal moment, GM has recruited a significant number of Silicon Valley engineering, product, and leaders to lead the GM Software & Services organization. As a leading OEM with a vast fleet of vehicles spanning the globe, we’re uniquely positioned to leverage our comprehensive control over in-vehicle and cloud software. Our aim? Seamlessly integrating solutions such as fleet management, energy optimization, transportation logistics, safety systems, and more, all geared towards enhancing efficiency and sustainability.
To realize our vision, we’re actively expanding our software teams in Silicon Valley. We seek individuals who can blend a passion for technology and sustainability with curiosity, rigorous thinking and a strong customer centric approach. This is an exceptional opportunity where you can leverage the scale of GM and make a significant impact and find fulfillment in crafting innovative software solutions. We invite you to join us on this exciting journey toward a better future.

QUALIFICATIONS

Required:

  • Bachelor’s degree (Master’s preferred) or equivalent experience in computer science, data science, engineering, or related quantitative field
  • 5+ years of industry experience developing, implementing, and maintaining solutions for Big Data or data warehousing systems
  • 3+ years of industry experience working in a cloud environment (Azure preferred)
  • 3+ years experience working with SQL query authoring for automated data transformation (familiarity with dbt preferred, but not required).
  • 2+ years of experience developing streaming data processing pipelines (use of Spark/pySpark preferred)
  • Basic understanding of machine learning/statistical learning principles
  • Experience implementing and maintaining data workflow orchestration and integration tools (e.g. Airflow/Astro, Prefect, dbt cloud, etc.)
  • Understanding of and experience with application of data quality tools integrated with CI/CD automation frameworks in functional deployment environments (e.g., Github Actions/Azure DevOps pipelines).
  • Familiarity with data quality testing frameworks (Great Expectations, Deequ)
  • Self-driven with an interest in on-the-job learning.

Preferred:

  • Familiarity with enterprise warehouse data modeling techniques (e.g., Kimball)
  • Experience integrating simulation systems with distributed, data-intensive processing or analytics applications

Desired:

  • Working familiarity with terraform
  • Domain knowledge in automotive systems
  • Engagement with modern data stack community (open source and commercial)
  • High degree of attention to software craftsmanship and professionalism
  • Experience working with containerization technologies and orchestration platforms (specifically Docker and Kubernetes)

Compensation: The compensation information is a good faith estimate only. It is based on what a successful applicant might be paid in accordance with applicable state laws. The compensation may not be representative for positions located outside of New York, Colorado, California, or Washington.

  • The salary range for this role is $ 152,100.00 - 232,00.00 . The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position.Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance.

Benefits:

  • Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.

Responsibilities:

ROLE SUMMARY:

As a Senior Data Engineer, you will be responsible for designing, developing, and maintaining efficient data workflows and associated cloud infrastructure that support GM Commercial Software’s core analytic products and services. A key focus of this role will be on developing and optimizing streaming data pipelines, ideally utilizing Spark or similar technologies. You will work closely with our data scientists, DevOps, and software engineers to automate pipelines that process streaming vehicle telemetry data, ensuring real-time data processing and transformation. Additionally, you will transform data within our data lakehouse into deployable data models that power our automated fleet insights, visualizations, and emerging machine learning, optimization, and AI applications. The ideal candidate will have extensive experience in setting up ELT pipelines to handle massive volumes of spatiotemporal data, designing enterprise data warehouse models, implementing robust data quality tests, and leveraging data pipeline-as-code integration systems. Your expertise in streaming data technologies will be crucial for ensuring the smooth and efficient orchestration of the workflows that power our advanced analytics platform.

RESPONSIBILITIES:

  • Design, implement, and maintain streaming data pipelines using spark or analogous technologies.
  • Design, implement, and maintain ELT pipelines in our cloud-native data lakehouse platform.
  • Work with data scientists to turn exploratory analyses into production data transformation workflows within our multihop data lakehouse architecture.
  • Support enterprise-wide use of BI tools and assist in developing internal and external analytics products (e.g., dashboards).
  • Generate and deploy feature engineering pipelines to power internal and customer-facing machine learning-based products.
  • Maintain data quality testing and monitoring tools.
  • Work with information security, DevOps, and DataOps to maintain data classification, auditing, and access and cost control policies.


REQUIREMENT SUMMARY

Min:3.0Max:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Computer science data science engineering or related quantitative field

Proficient

1

Mountain View, CA, USA