Senior Machine Learning Engineer - SIML, ISE

at  Apple

Cupertino, California, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate27 Nov, 2024USD 264200 Annual29 Aug, 2024N/AData Science,Statistics,Python,Computer Science,Learning Techniques,Git,Analytics,Statistical Data Analysis,Presentation Skills,User Experience,Conducting,Statistical Packages,Mathematics,EngineersNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

SUMMARY

Posted: Jun 26, 2024
Weekly Hours: 40
Role Number:200551872
The System Intelligence and Machine Learning team is in charge of creating datasets that power many of Apple’s intelligent software. Our datasets range from very small targeted sets to Petabyte scale datasets. We are looking for an expert Machine Learning engineer, or Data Scientist who can help create and improve the datasets used in Generative AI through proven understanding and usage of ML and stats. As a senior member of the System Intelligence and Machine Learning Data team, you will be using Apple technologies to refine our datasets, perform ML-based QA, remove toxicity and select the right images, videos or texts through active selection and model-in-the-loop methodologies. Focus areas range from text processing across many languages (toxic language detection and removal, identification of colloquial vs formal language) to image and video understanding, deduplication and processing. As part of this role you will also own our data synthesis efforts in various modalities including image, text, videos and audio.

DESCRIPTION

In this role, you will be working to deepen our understanding of how various datasets can improve the quality of Apple’s ML models on a range of products. You will particularly help shape Apple’s Datasets that are used for generative AI by removing irrelevant or toxic assets, selecting the right assets by employing various asset selection algorithms, and synthesizing new datasets by utilizing Apple proprietary ML models. For this, you will also use your stats and ML background to build models and algorithms that can select the right assets for ML experiences from a large pool of available assets. And you will work with our data engineers to put your models in data pipelines to run on large scale datasets. In our team, you are encouraged to collaborate with other AIML product stakeholders and partners to understand needs, design Machine Learning models that help us better understand our data and automatically pick the right assets for ML training. Our Data Scientists actively evaluate and present the progress of their work. Your creative decision making will be applied daily.

KEY QUALIFICATIONS

  • Proven track record in a Machine Learning Engineering or Applied Scientist role, preferably in a technology company.
  • Familiarity with a broad range of Machine Learning techniques and relevant statistical packages to engineer ML solutions end-to-end.
  • Experience in contributing to production codes; ability to rapidly prototype algorithmic ideas in notebook environments and translate them into production code.
  • Proficient in state-of-the-art ML techniques, particularly in the field of Generative AI and Large Language Models (Transformer architecture, diffusion models, CLIP and various visual and text embedding models, GPT and BERT style language models).
  • Strong proficiency with Python (Scikit learn, Jupyter), PyTorch, SQL-based languages. Working proficiency with Git.
  • Proven experience in data science and analytics, including statistical data analysis. Experience crafting, conducting, analyzing, and interpreting experiments and deep-dive investigations.
  • Outstanding communication and presentation skills with the ability to explain difficult technical topics to everyone from data scientists, engineers, and business partners.

EDUCATION & EXPERIENCE

Bachelors, Masters or PhD degree in Computer Science, Statistics, Mathematics, Engineering; or equivalent experience.

ADDITIONAL REQUIREMENTS

  • Strong analytical product intuition: able to understand the user experience and use data to guide the development of products.
  • Experience in synthetic data generation for videos, images, text and audio is desired.
  • Ability to understand a technically complex product, and work with engineering leads and data engineers.
  • Ability to build relationships across multiple functions and establish strong partnerships.

Responsibilities:

Please refer the Job description for details


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Computer Science, Mathematics, Statistics

Proficient

1

Cupertino, CA, USA