Machine Learning Systems Engineer - Data & Evaluation, Horizons at Anthropic
San Francisco, California, USA -
Full Time


Start Date

Immediate

Expiry Date

07 Sep, 25

Salary

405000.0

Posted On

07 Jun, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Good communication skills

Industry

Information Technology/IT

Description

ABOUT ANTHROPIC

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

Responsibilities

ABOUT THE ROLE

As a Data & Evaluation Engineer on the Horizons team, you will build the software infrastructure that enables our AI models to use tools effectively and measure their performance. You’ll develop and extend our agent framework, create and implement evaluations, manage training data pipelines, and apply data science techniques to improve model capabilities. This engineering-focused role combines software development with empirical analysis to drive advances in model performance and capabilities.

The Horizons team leads Anthropic’s reinforcement learning research and development, playing a critical role in advancing our AI systems. We’ve contributed to all Claude models, with significant impacts on the autonomy and coding capabilities of Claude 3.5 and 3.7 Sonnet. Our work spans several key areas:

  • Developing systems that enable models to use computers effectively
  • Advancing code generation through reinforcement learning
  • Pioneering fundamental RL research for large language models
  • Building scalable RL infrastructure and training methodologies
  • Enhancing model reasoning capabilitie
Loading...