Applied Machine Learning Research Engineer - Perception Algorithms at Apple

Sunnyvale, California, USA -

Full Time

Start Date

Immediate

Expiry Date

10 Jun, 25

Salary

264200.0

Posted On

10 Mar, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Machine Learning, Computer Vision, Creativity, Data Curation, Learning Techniques, Pipelines, Optimization Techniques, Training, Computer Science, Network Performance, Assessment, Python, Robotics

Industry

Information Technology/IT

Description

SUMMARY

Posted: Mar 5, 2025
Role Number:200593756
Do you have a passion for deep learning and computer vision problems? We are looking for someone who thrives on collaboration and wants to push the boundaries of what is possible today! Join our team of committed deep learning engineers in the Video Computer Vision group! We are a centralized applied research and engineering organization responsible for developing real-time on-device Computer Vision, Machine Perception, and Generative technologies across Apple products. Our shipped technologies power features in ARKit, MeasureApp, RoomScan, Accessibility, and multiple VisionPro features. As a member of the Video Computer Vision group you will develop new technologies in the area of scene understanding and for Apple’s next generation products.

DESCRIPTION

We are looking for a skilled deep learning and computer vision engineer for our team. In this role, you will perform research and development work to design algorithms for challenging real world problems in the domain of scene understanding. Your key responsibilities in this role are: - Research, design, train, and evaluate machine learning/deep learning algorithms to address product goals. - Benchmark and analyze machine learning/deep learning algorithms to understand limitations. - Optimize algorithms for real time and low power constraints. - Support algorithm integration into Apple products. - Collaborate with teams across Apple with multidisciplinary skills.

MINIMUM QUALIFICATIONS

MS in Computer Science or related field with focus on machine learning, computer vision, robotics or similar.
Experience in designing and training efficient network architectures for a diverse set of tasks e.g. image2image translation tasks (like Semantic Segmentation, Panoptic Segmentation), language-guided visual grounding, etc.
Experience with data curation for training vision language models, assessment of vision language model capabilities, introduction of new capabilities into backbones trained with vision-language supervision / self-supervised learning.
Consistent track record of researching, inventing and/or shipping advanced machine learning algorithms.
Solid mathematical foundation of machine learning and deep learning techniques.
Strong coding skills in python (with pytorch) and C/C++.

PREFERRED QUALIFICATIONS

Creativity and curiosity for solving highly complex problems.
Experience with designing and training with pipelines which consume large (billion scale) data for training efficient vision language models for edge-devices. This includes writing efficient data loading pipelines, utilizing distributed GPU training framework.
Experience with advanced task-specific quality optimization techniques (few-shot learning, meta-learning, domain adaptation, knowledge-distillation) for improving network performance and handling specific failure cases (long-tailed distributions/under-represented classes) for downstream tasks.
Strong coding skills in ObjectiveC.
Excellent communication and collaboration skills.

Responsibilities

Please refer the Job description for details