Start Date
Immediate
Expiry Date
09 Oct, 25
Salary
0.0
Posted On
09 Jul, 25
Experience
0 year(s) or above
Remote Job
Yes
Telecommute
Yes
Sponsor Visa
No
Skills
Good communication skills
Industry
Information Technology/IT
Location
Toronto, New York, Seattle, San Francisco, United States, Canada
Employment Type
Full time
Location Type
Hybrid
Department
Modelling
Modeling
Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in many real-world use cases, we must continue to develop new techniques to accurately measure our models’ performance on frontier capabilities. In this role, you are responsible for creating next-generation evaluation methods and scalable infrastructure to measure LLM progress.