Sr. AI/ML QA Automation Engineer at Apple

Cupertino, California, United States -

Full Time

Start Date

Immediate

Expiry Date

10 Mar, 26

Salary

0.0

Posted On

10 Dec, 25

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

AI Modeling, Software QA, Automated Testing, Data Validation, Ethical AI Practices, ML Models Evaluation, Python, iOS, macOS, watchOS, tvOS, LLM Usage, System Design, Debugging, Release Management, Machine Learning, NLP Libraries

Industry

Computers and Electronics Manufacturing

Description

The Apple Photos application is a comprehensive photo and video management solution that seamlessly integrates across the entire Apple ecosystem, enabling users to capture, organize, edit, and share their visual memories with unprecedented ease and intelligence. Working on the Photos team means contributing to one of Apple's most personal and widely-used applications, combining cutting-edge AI, elegant design, and robust engineering to help billions of users around the world preserve and relive their most important life moments. This is a high-impact role where you'll work at the intersection of AI modeling, agentic workflows, information retrieval, software engineering, evaluation and metrics, and help us push the boundaries of how AI can transform Apple’s products. DESCRIPTION This role blends traditional software QA skills with advanced evaluation methodologies for modern AI models, including LLMs, multimodal systems, and ML-driven product features. As a member of the team, you will work closely with experienced engineers and machine learning experts to qualify and refine features powered by vision, language, and cross-modal intelligence. You will be responsible for designing rigorous evaluation strategies for both objective and subjective ML behaviors, creating reliable automated testing pipelines, and developing LLM-driven evaluators that complement human judgement. The ideal candidate is self-directed, creative, and comfortable with ambiguity, with strong technical and interpersonal skills. They have hands-on experience testing ML models directly, defining qualitative scoring rubrics, building reproducible evaluation frameworks, and ensuring that AI behavior is safe, consistent, and aligned with product and on-device constraints. MINIMUM QUALIFICATIONS BS/MS or equivalent experience in Computer Science or related field 3+ years of experience working in Software Quality Assurance Strong software engineering skills, including system design, development, testing, debugging, release and maintenance Expertise with hands on experience in automated software testing, data validation, and ethical AI practices Familiarity with LLM usage to improve efficiency of their daily work Ability to evaluate ML models directly (vision or multimodal) and diagnose model failures using quantitative and qualitative methods Expertise in Python Proficiency with iOS, macOS, watchOS, tvOS or similar operating systems PREFERRED QUALIFICATIONS Experience Testing AI Models for accuracy, robustness, fairness, and performance Experience developing LLM based automated evaluation frameworks Expertise in Swift and/or Obj-C Strong programming skills in Python and experience with ML/NLP libraries 3+ years of proven ability in machine learning, including hands-on work with LLMs Understanding of prompt engineering, and retrieval-augmented generation (RAG) Knowledge of statistics based evaluation approaches, ML training pipelines and accuracy improvements of ML systems

Responsibilities

The role involves designing rigorous evaluation strategies for ML behaviors and creating automated testing pipelines. You will work closely with engineers and machine learning experts to ensure AI features are safe and consistent.