Applied Sciences IC4 at Microsoft

, , Israel -

Full Time

Start Date

Immediate

Expiry Date

01 Mar, 26

Salary

0.0

Posted On

01 Dec, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Natural Language Processing, Large Language Models, Machine Learning, Information Retrieval, Deep Learning, Applied Statistics, Data Analysis, Problem Solving, Reinforcement Learning, Model Alignment, Benchmarking, Agentic Flows, Customer Obsession, Communication Skills, Collaboration Skills, Evaluation Metrics

Industry

Software Development

Description

Responsibilities include close collaboration with engineering and product teams to innovate, design, and assess comprehensive AI solutions for millions of enterprise users. You will design, fine-tune, and deliver models and agentic flows for integration with Excel Agent and on-canvas experiences. You will leverage state-of-the-art LLM fine-tuning and retrieval methods, with robust evaluation metrics and A/B testing to ensure data-driven progress. You will gather and curate relevant benchmarks, build a comprehensive evaluation framework, and develop GPT-based evaluators (LLM-as-a-Judge). Run controlled experiments to compare performance, efficiency, and scalability using data-driven metrics and A/B testing focusing on reproducible and impactful results. You will continuously study emerging literature, share insights with leadership and peers during research reviews and deep dives adapt quickly to new findings, and integrate them into experiments and when applicable share with broader research community. You will design, fine-tune, and deliver models and agentic flows for integration with Excel Agent and on-canvas experiences. You will leverage state-of-the-art LLM fine-tuning and retrieval methods, with robust evaluation metrics and A/B testing to ensure data-driven progress. You will gather and curate relevant benchmarks, build a comprehensive evaluation framework, and develop GPT-based evaluators (LLM-as-a-Judge). Run controlled experiments to compare performance, efficiency, and scalability using data-driven metrics and A/B testing focusing on reproducible and impactful results. You will continuously study emerging literature, share insights with leadership and peers during research reviews and deep dives adapt quickly to new findings, and integrate them into experiments and when applicable share with broader research community. M.Sc. / Ph.D. in Computer Science, Information Systems, or Data Science (Ph.D. strongly preferred). Candidates with master's degrees with proven industry experience or a strong publication record in the areas of LLM, Information Retrieval, Machine Learning, Natural Language Processing, and Deep Learning are considered as well. We require strong hands-on (at least 3+ years) of experience in building and deploying Machine Learning products. Key areas of expertise include Natural Language Processing and Large Language Models, along with an understanding of concepts such as Privacy and Responsible AI. Candidates are expected to demonstrate a strong history of successfully translating applied research into production-ready solutions, along with a proven track record of delivering projects within large-scale production environments. We are seeking candidates with proven expertise in the LLM domain, demonstrating comprehensive knowledge of relevant concepts in the domain. Ideal applicants should be proficient in areas such as LLM's post training, including CPT, SFT and RL, LLM benchmarking, agentic flows, and model alignment. Outstanding proficiency in problem-solving and data analysis, with substantial expertise in applied statistics. Notably experienced in evaluating the performance of large language models (LLMs), developing benchmarks tailored to practical scenarios. PhD degree in Computer Science, Information Systems, or Data Science. Proven track record in training large language models and post-training large language models, using reinforcement learning or similar techniques. First-hand experience building LLM flows and agentic AI models. Customer obsession and passionate about making real world product impact. Excellent verbal and written communication skills, with the ability to simplify and explain complex ideas. Effective collaboration skills while working effectively within a globally distributed organization.

Responsibilities

The role involves collaborating with engineering and product teams to innovate and assess AI solutions for enterprise users. Responsibilities include designing and delivering models for integration with Excel Agent and conducting controlled experiments to evaluate performance and scalability.