Senior Researcher – LLM Systems at Microsoft
Redmond, Washington, United States -
Full Time


Start Date

Immediate

Expiry Date

24 Jan, 26

Salary

0.0

Posted On

26 Oct, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Distributed Systems, Operating Systems, Large-Scale ML Serving, Algorithmic Innovations, Systems Innovations, Cloud Infrastructure, Machine Learning, Software Engineering, Research Collaboration, Product Innovation, Latency Improvement, Throughput Improvement, Cost Reduction, Reliability Enhancement, Deployment Safety, Endpoint Configuration

Industry

Software Development

Description
Generative AI is transforming how people create, collaborate, and communicate - redefining productivity across Microsoft 365 and our customers globally. At Microsoft, we run the biggest platform for collaboration and productivity in the world with hundreds of millions of consumer/enterprise users. Tackling AI efficiency challenges is crucial for delivering these experiences at scale. Within our Microsoft wide Systems Innovation initiative, we are working to advance efficiency across AI systems, where we look at novel designs and optimizations across AI stacks: models, AI frameworks, cloud infrastructure, and hardware. We are an Applied Research team driving mid- and long-term product innovations. We closely collaborate with multiple research teams and product groups across the globe who bring a multitude of technical knowledge in cloud systems, machine learning and software engineering. We communicate our research both internally and externally through academic publications, open-source releases, blog posts, patents, and industry conferences. Further, we also collaborate with academic and industry partners to advance the state of the art and target material product impact that will affect 100s of millions of customers. We are looking for a Senior Researcher – LLM Systems to invent, analyze, and productionize the next generation of serving architectures for transformer-based models across cloud and edge. The candidate will focus on algorithmic and systems innovations, including batching, routing, scheduling, caching, deployment safety, and endpoint configuration, that materially improve latency, throughput, cost, and reliability under real-world SLAs for Microsoft Copilots. The qualified candidate brings a solid background in distributed systems, operating systems, and/or large-scale ML serving, plus the ambition to translate research into impact in production environments. This role blends rigorous research (theory + measurement) with hands-on engineering, and includes publishing papers, filing patents, and collaborating across research and product teams to advance the state of the art.   Have a look at this link for reading: Efficient AI - Microsoft Research [https://www.microsoft.com/en-us/research/group/efficient-ai/]   Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
The Senior Researcher will invent, analyze, and productionize next-generation serving architectures for transformer-based models across cloud and edge. The role involves algorithmic and systems innovations to improve latency, throughput, cost, and reliability for Microsoft Copilots.
Loading...