Staff Software Engineer, ML Data Platform

at  Cruise

Remote, Oregon, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate01 Sep, 2024USD 257500 Annual01 Jun, 2024N/AGood communication skillsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

We’re Cruise, a self-driving service designed for the cities we love.
We’re building the world’s most advanced self-driving vehicles to safely connect people to the places, things, and experiences they care about. We believe self-driving vehicles will help save lives, reshape cities, give back time in transit, and restore freedom of movement for many.
In our cars, you’re free to be yourself. It’s the same here at Cruise. We’re creating a culture that values the experiences and contributions of all of the unique individuals who collectively make up Cruise, so that every employee can do their best work.
Cruise is committed to building a diverse, equitable, and inclusive environment, both in our workplace and in our products. If you are looking to play a part in making a positive impact in the world by advancing the revolutionary work of self-driving cars, come join us. Even if you might not meet every requirement, we strongly encourage you to apply. You might just be the right candidate for us.
(Additional locations: San Francisco CA, Sunnyvale CA)
As an engineer on this team, you will be responsible for building and supporting a petabyte-scale data platform in the cloud and providing powerful foundations for Cruise’s ML Data Platform tools, frameworks, and services. Responsibilities include ensuring scalable, transparent, and reliable data ingestion and management; development of fast, robust, and spike-resistant data consumption, data mining, and processing tools for the entire company; and development of orchestration for large-scale post-processing, and computational pipelines.

What you’ll be doing:

  • Lead us in the development, optimization and productionization of the next generation data processing platform using Beam and Spark in the cloud.
  • Build self-serve capabilities to help customers to adopt the next generation data processing platform
  • Use the latest cloud technologies to own, design, implement, and test scalable distributed data systems in the cloud. Champion engineering excellence by continuously improving systems and processes
  • Own technical projects from start to finish, contribute to the team’s product roadmap, and be responsible for major technical decisions and tradeoffs. Effectively participate in team’s planning, code reviews and design discussions
  • Consider the effects of projects across multiple teams and proactively manage conflicts. Work together with partner teams and orgs to achieve cross-organizational goals and satisfy broad requirements
  • Conduct technical interviews with well-calibrated standards and play an essential role in recruiting activities. Effectively onboard and mentor junior engineers and/or interns

What you must have:

  • Experience building a data processing system using Beam / Spark and its ecosystems from the ground up.
  • Experience optimizing those data processing clusters for cost efficiency and performance
  • Experience building serving systems capable of delivering data at high-throughput, low-latency and high QPS in a cost-efficient and spike-resilient manner.
  • Experience building full ML model lifecycle solutions - from feature engineering to training, validation, deployment and monitoring.
  • Experience building scalable infrastructure on the cloud with Python or Java/Scala (or similar)
  • 10+ years working with big data
  • BS, MS or Ph.D. in Computer Science, Electrical Engineering, Mathematics, Physics, or another relevant field; or equivalent real-world experience
  • Passionate about self-driving technology and its potential impact on the world
  • Attention to detail and a passion for seeking truth
  • A track record of efficiently solving complex problems
  • Startup mentality - openness to dealing with unknown unknowns and wearing many hats

Bonus points!

  • Demonstrable expertise in a building end-to-end data ingestion, processing and serving systems at petabyte scale from the ground up
  • Proficiency in writing SQL queries for analytic purposes
  • Relevant publications

The salary range for this position is $175,100 - 257,500. Compensation will vary depending on location, job-related knowledge, skills, and experience. You may also be offered a bonus, and benefits. These ranges are subject to change.

Responsibilities:

  • Lead us in the development, optimization and productionization of the next generation data processing platform using Beam and Spark in the cloud.
  • Build self-serve capabilities to help customers to adopt the next generation data processing platform
  • Use the latest cloud technologies to own, design, implement, and test scalable distributed data systems in the cloud. Champion engineering excellence by continuously improving systems and processes
  • Own technical projects from start to finish, contribute to the team’s product roadmap, and be responsible for major technical decisions and tradeoffs. Effectively participate in team’s planning, code reviews and design discussions
  • Consider the effects of projects across multiple teams and proactively manage conflicts. Work together with partner teams and orgs to achieve cross-organizational goals and satisfy broad requirements
  • Conduct technical interviews with well-calibrated standards and play an essential role in recruiting activities. Effectively onboard and mentor junior engineers and/or intern


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Proficient

1

Remote, USA