Senior Machine Learning Engineer - SIML, ISE
at Apple
Cupertino, California, USA -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 27 Nov, 2024 | USD 264200 Annual | 29 Aug, 2024 | N/A | Data Science,Statistics,Python,Computer Science,Learning Techniques,Git,Analytics,Statistical Data Analysis,Presentation Skills,User Experience,Conducting,Statistical Packages,Mathematics,Engineers | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
SUMMARY
Posted: Jun 26, 2024
Weekly Hours: 40
Role Number:200551872
The System Intelligence and Machine Learning team is in charge of creating datasets that power many of Apple’s intelligent software. Our datasets range from very small targeted sets to Petabyte scale datasets. We are looking for an expert Machine Learning engineer, or Data Scientist who can help create and improve the datasets used in Generative AI through proven understanding and usage of ML and stats. As a senior member of the System Intelligence and Machine Learning Data team, you will be using Apple technologies to refine our datasets, perform ML-based QA, remove toxicity and select the right images, videos or texts through active selection and model-in-the-loop methodologies. Focus areas range from text processing across many languages (toxic language detection and removal, identification of colloquial vs formal language) to image and video understanding, deduplication and processing. As part of this role you will also own our data synthesis efforts in various modalities including image, text, videos and audio.
DESCRIPTION
In this role, you will be working to deepen our understanding of how various datasets can improve the quality of Apple’s ML models on a range of products. You will particularly help shape Apple’s Datasets that are used for generative AI by removing irrelevant or toxic assets, selecting the right assets by employing various asset selection algorithms, and synthesizing new datasets by utilizing Apple proprietary ML models. For this, you will also use your stats and ML background to build models and algorithms that can select the right assets for ML experiences from a large pool of available assets. And you will work with our data engineers to put your models in data pipelines to run on large scale datasets. In our team, you are encouraged to collaborate with other AIML product stakeholders and partners to understand needs, design Machine Learning models that help us better understand our data and automatically pick the right assets for ML training. Our Data Scientists actively evaluate and present the progress of their work. Your creative decision making will be applied daily.
KEY QUALIFICATIONS
- Proven track record in a Machine Learning Engineering or Applied Scientist role, preferably in a technology company.
- Familiarity with a broad range of Machine Learning techniques and relevant statistical packages to engineer ML solutions end-to-end.
- Experience in contributing to production codes; ability to rapidly prototype algorithmic ideas in notebook environments and translate them into production code.
- Proficient in state-of-the-art ML techniques, particularly in the field of Generative AI and Large Language Models (Transformer architecture, diffusion models, CLIP and various visual and text embedding models, GPT and BERT style language models).
- Strong proficiency with Python (Scikit learn, Jupyter), PyTorch, SQL-based languages. Working proficiency with Git.
- Proven experience in data science and analytics, including statistical data analysis. Experience crafting, conducting, analyzing, and interpreting experiments and deep-dive investigations.
- Outstanding communication and presentation skills with the ability to explain difficult technical topics to everyone from data scientists, engineers, and business partners.
EDUCATION & EXPERIENCE
Bachelors, Masters or PhD degree in Computer Science, Statistics, Mathematics, Engineering; or equivalent experience.
ADDITIONAL REQUIREMENTS
- Strong analytical product intuition: able to understand the user experience and use data to guide the development of products.
- Experience in synthetic data generation for videos, images, text and audio is desired.
- Ability to understand a technically complex product, and work with engineering leads and data engineers.
- Ability to build relationships across multiple functions and establish strong partnerships.
Responsibilities:
Please refer the Job description for details
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Other
Software Engineering
Graduate
Computer Science, Mathematics, Statistics
Proficient
1
Cupertino, CA, USA