Senior Software Engineer, AI Infra at ROBOFORCE INC
Milpitas, California, United States -
Full Time


Start Date

Immediate

Expiry Date

11 Mar, 26

Salary

0.0

Posted On

11 Dec, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Yes

Skills

C++, Python, ML Frameworks, PyTorch, JAX, Cloud Provider, Kubernetes, Containerization, GPU Provisioning, SQL, NoSQL, Postgres, MySQL, BigQuery, ElasticSearch, Redis

Industry

Robotics Engineering

Description
Why RoboForce RoboForce is an AI robotics company building Physical AI and Robo-Labor system for dull, dirty, and dangerous work. Our flagship robot, TITAN, is a super humanoid robot designed for industrial environments. We are based in Milpitas, CA and require 5 days/week in-office collaboration. We are looking for a Senior Software Engineer to build scalable AI infrastructure. As a Senior Software Engineer, you will architect and implement the core training infrastructure that enables large-scale model training, validation workflows, and production deployment for physical robots. You will work across cloud systems, GPU clusters, data pipelines, and robotics runtime environments to create a high-performance platform for Physical AI. Responsibilities: Build and maintain distributed training pipelines leveraging PyTorch, JAX, or equivalent frameworks across multi-GPU and multi-node clusters. Build tools for data collection, training, and deployment of neural networks on RoboForce robots. Architect robust cloud-native and on-prem GPU infrastructure across multi-cloud environments. Build high-throughput data workflows to support large-scale dataset ingestion, versioning, and distributed storage. Optimize end-to-end training performance: CPU–GPU transfers, NVMe caching, I/O pipelines, containerized runtime environments, and CUDA-level optimizations. Integrate training artifacts into on-robot inference stacks. Requirements Bachelor’s or Master’s degree in Computer Science or related field with 5+ years of experience. Strong proficiency with C++, Python, and ML frameworks (e.g., PyTorch, JAX). Deep experience with at least one major cloud provider (GCP, AWS, Azure) and familiarity with Kubernetes, containerization, and GPU machine provisioning. Strong understanding of SQL and NoSQL data stores (Postgres, MySQL, BigQuery, ElasticSearch, Redis). Requires 5 days/week in-office collaboration with the teams. Bonus Qualifications Expertise in profiling and optimizing CPU-GPU interactions. Experience scaling neural network training jobs and GPU programming with CUDA. Proven ability to develop annotation and dataset management tools. Benefits Competitive stock options/equity programs. Health, dental, and vision insurance, 401(k) plan. Visa sponsorship and green card support for qualified candidates. Lunches and dinners, a fully stocked kitchen, and regular team-building events.
Responsibilities
The Senior Software Engineer will architect and implement core training infrastructure for large-scale model training and deployment for physical robots. Responsibilities include building distributed training pipelines and optimizing end-to-end training performance.
Loading...