AI Infrastructure Engineer at StackAI
San Francisco, CA 94105, USA -
Full Time


Start Date

Immediate

Expiry Date

03 Dec, 25

Salary

0.0

Posted On

03 Sep, 25

Experience

4 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Good communication skills

Industry

Information Technology/IT

Description

WHAT WE’RE LOOKING FOR

  • 4+ years of backend engineering experience (Python preferred).
  • Deep expertise in task queues, job orchestration, and distributed systems.
  • Hands-on experience with Redis, Celery, RabbitMQ, Celery Beat, and ideally Temporal.
  • Experience scaling systems at a startup or in fast-paced environments.
  • Strong understanding of deploying, monitoring, and optimizing AI/ML systems in production (infrastructure toolsets and CI/CD practices)
  • Familiarity with containerization (Docker, Kubernetes), IaC (Terraform), or MLOps pipeline.

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities

ABOUT THE ROLE

We’re hiring an AI Infrastructure Engineer to shape and scale the backend systems that power our AI platform. As a Series A company, your work will be foundational, enabling safe, efficient, and reliable AI workflows from end to end.

WHAT YOU’LL DO

  • Design and implement scalable backend architectures tailored for AI workloads (inference, orchestration, monitoring).
  • Manage and optimize distributed job orchestration using Celery, Temporal, RabbitMQ, and Celery Beat.
  • Enhance data pipelines and caching strategies using Redis.
  • Collaborate closely with ML engineers to integrate models into production, ensuring scalability and reliability.
  • Build robust monitoring, observability, retry, and fault tolerance systems around job execution.
  • Support infrastructure management and incident response to ensure uptime and performance.
  • Contribute to platform infrastructure and tooling to support Stack AI’s rapid growth trajectory.
Loading...