ML Framework (MetalLM) Engineer

at  Apple

Cupertino, California, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate06 Oct, 2024USD 208300 Annual07 Jul, 2024N/AOptimization Techniques,Triton,Computer Architecture,Design,Machine LearningNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

SUMMARY

Posted: Jul 3, 2024
Weekly Hours: 40
Role Number:200558203
Apple’s ML Frameworks team in GPU, Graphics and Displays org provides GPU acceleration for popular Machine learning libraries such as TensorFlow, PyTorch and JAX using Metal runtime and device backend. It optimizes compute performance with kernels and computational graphs that are fine-tuned for the unique characteristics of each Metal GPU family. We are always looking for exceptionally dedicated individuals to grow our outstanding team.

DESCRIPTION

Our team is seeking extraordinary machine learning and GPU programming engineers who are passionate about providing robust compute solutions for accelerating Machine learning libraries on Apple Silicon. Role has the opportunity to influence the design of compute and programming models in next generation GPU architectures. Responsibilities: * Work on cutting-edge ML inference framework project and optimize code for efficient and scalable ML inference using distributed techniques such as Data parallelism * Design and develop compiler based optimizations for Metal backend in ML frameworks such as torch.compile for PyTorch * Implement features of Metal device backend for ML training acceleration technologies * Work with Core teams of PyTorch, JAX or Tensorflow to provide Metal runtime and device backend support * Tune GPU-accelerated training across products. * Performing in-depth analysis, compiler and kernel level optimizations to ensure the best possible performance across hardware families.

  • 3+ years of programming and problem-solving experience with C/C++/ObjC
  • Contributions to an AI framework such as PyTorch , JAX or Tensorflow
  • Experience with graph compilers such as Triton, OpenXLA or LLVM/MLIR is a plus

PREFERRED QUALIFICATIONS

  • Experience with Distributed training or inference techniques is a plus
  • GPU compute programming models & optimization techniques
  • Good understanding of machine learning fundamentals.
  • Experience with system level programming and computer architecture.

Responsibilities:

  • 3+ years of programming and problem-solving experience with C/C++/ObjC
  • Contributions to an AI framework such as PyTorch , JAX or Tensorflow
  • Experience with graph compilers such as Triton, OpenXLA or LLVM/MLIR is a plu


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Proficient

1

Cupertino, CA, USA