Senior Machine Learning Engineer

at  Microsoft

Redmond, WA 98052, USA -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate25 Aug, 2024USD 229200 Annual26 May, 20241 year(s) or aboveLanguages,Ordinances,Citizenship,Communication Skills,Aws,Microsoft,Color,Computer Science,Ethnicity,Regulations,Software Projects,Azure,Consideration,Base Pay,Deep LearningNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

The Business Applications Group is a rapidly growing organization that is responsible for the Microsoft Dynamics 365 suite of products, Microsoft Flow, PowerApps, AI Builder, Power BI and more. Microsoft is a leader in Software as a Service, and this organization is at the heart of how business applications are designed and delivered.
Our group is making massive investments in AI to lead the industry to create smart, personalized, business applications leveraging a variety of models. We are looking for a Senior Machine Learning Engineer to join our team and help build the model training and inference tools to meet our ambitions. As part of the BAP Copilot AI Team, you will work directly with our product teams and our Data Scientists to design, develop, and deploy models that provide personalized, low-latency inference. You will contribute to our deployment of models ranging up to 70B parameter range including personalized, fine-tuned scenarios.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond

REQUIRED/MINIMUM QUALIFICATIONS:

  • Bachelor’s Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience.
  • At least 3 years experience building and deploying Machine Learning models in a production environment.
  • At least 4 years building and deploying production software projects
  • At least 1 year of experience with machine learning frameworks like PyTorch, Tensorflow, ONNX, or TensorRT in production environments.

ADDITIONAL OR PREFERRED QUALIFICATIONS:

  • Bachelor’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR Master’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience.
  • Experience profiling and optimizing runtime performance, particularly on GPUs.
  • Experience optimizing deep models for performance: quantizing, compressing, distillation, pruning, or related techniques
  • Demonstrated engagement with the ML engineering community
  • Excellent written and spoken communication skills and ability to motivate a technical team to work together on an ambitious project.
  • Have at least 1 year of hands on experience with PyTorch, Tensorflow, ONNX, TensorRT, or other Deep Learning libraries
  • Have at least 1 year of experience building and deploying systems in a cloud environment like Azure, AWS, or GCP
    Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.
    Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
    Microsoft will accept applications for the role until June 14, 2024.

    BETJobs #BAPJobs

Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations

Responsibilities:

We are looking for an individual with demonstrated experience optimzing deep learning models for scale.

Responsibilities include:

  • Collaborate with data scientists and software engineers to develop and deploy machine learning models.
  • Design and implement model optimizations given the hardware, latency, and model constraints of our features.
  • Stay up to date on emerging tools and methods for more efficient, lower latency model inference (e.g. Triton, advanced attention mechanisms.)
  • Design and implement model inference infrastructure for personalized, fine-tuned models using mLora or other techniques.
  • Develop internal tools for customization and optimization of models for real-time inference
  • Implements tests and telemetry to monitor and continuously improve our infrastructure.
  • Monitor and maintain deployed models, ensuring reliability and low latency at scale.
  • As part of the AI Architecture group, advise on model training best practices and project lifecycle.


REQUIREMENT SUMMARY

Min:1.0Max:4.0 year(s)

Computer Software/Engineering

IT Software - System Programming

Software Engineering

Graduate

Languages including but not limited to c c c java javascript or python

Proficient

1

Redmond, WA 98052, USA