Principal Researcher at Microsoft
Redmond, Washington, United States -
Full Time


Start Date

Immediate

Expiry Date

20 Feb, 26

Salary

0.0

Posted On

22 Nov, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Foundation Models, Language Models, Multimodal Models, Supervised Finetuning, Distillation, Reinforcement Learning, Low-Rank Adaptation, PyTorch, DeepSpeed, AzureML, Machine Learning Engineering, Data Science, Synthetic Data Generation, Publications, Software Libraries, Model Alignment

Industry

Software Development

Description
Research focusing on foundation models - Designing and training language and multimodal models (and finetunes) with supervised finetuning, distillation, reinforcement learning, low-rank adaptation. Research focusing on environment interaction, policy specification and verification and human intent alignment. Implementation: Train, distill, and finetune language and vision models in PyTorch, DeepSpeed, AzureML stack. Machine learning engineering - Build pipelines to test designs, algorithms and models. Data science - Research and develop synthetic data generation strategies. Proactively follow state of the art research and share latest work, write papers, attend conferences and share knowledge in the wider team. Doctorate in relevant field AND 3+ years related research experience OR equivalent experience. Experience conducting research as part of a research program in academic or industry settings. 3+ years' experience training and/or finetuning large language models, model compression, distillation to small language models. 3+ years' experience with Python and deep learning libraries Experience in large language/multimodal model training, finetuning and distillation. Demonstrated record of publications in top-tier conferences or journals (ICLR, ACL, EMLP, ICML, CVPR, ICCV, ECCV, NeurIPS, TPAMI, etc.). Experience in creating reusable software libraries and packages. Experience using dataset curation, data generation using prompting state of art LLMs and/or model alignment. Experience with reinforcement learning libraries or formal methods and verification is a plus. Demonstrated ability and passion for incubating new ideas, solving problems, and building working systems.
Responsibilities
The Principal Researcher will focus on designing and training language and multimodal models, including supervised finetuning and reinforcement learning. They will also implement machine learning engineering practices and develop synthetic data generation strategies.
Loading...