Software Engineer, Inference Scalability and Capability tags.new at Anthropic
London, England, United Kingdom -
Full Time


Start Date

Immediate

Expiry Date

10 Sep, 25

Salary

325000.0

Posted On

12 Jun, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Distributed Systems, Python, Kubernetes

Industry

Information Technology/IT

Description

ABOUT ANTHROPIC

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

YOU MAY BE A GOOD FIT IF YOU:

  • Have significant software engineering experience
  • Are results-oriented, with a bias towards flexibility and impact
  • Pick up slack, even if it goes outside your job description
  • Enjoy pair programming (we love to pair!)
  • Want to learn more about machine learning research
  • Care about the societal impacts of your work

STRONG CANDIDATES MAY ALSO HAVE EXPERIENCE WITH:

  • High performance, large-scale distributed systems
  • Implementing and deploying machine learning systems at scale
  • LLM optimization batching and caching strategies
  • Kubernetes
  • Python
Responsibilities

Our Scalability and Capability Inference team is responsible for building and maintaining the critical systems that serve our LLMs to a diverse set of consumers. As the cornerstone of our service delivery, the team focuses on scaling inference systems, ensuring reliability, optimizing compute resource efficiency, and developing new inference capabilities. The team tackles complex distributed systems challenges across our entire inference stack, from optimal request routing to efficient prompt caching.

Loading...