Senior Software Engineer at Microsoft
Beijing, Beijing, China -
Full Time


Start Date

Immediate

Expiry Date

17 Feb, 26

Salary

0.0

Posted On

19 Nov, 25

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Python, C/C++, Machine Learning, CUDA, Deep Learning, Model Compression, NVIDIA GPUs, AMD GPUs, Kernel Development, Algorithm Optimization, Scheduling, Parallelization, Infrastructure Maintenance, Technical Problem Solving, Growth Mindset, Learning

Industry

Software Development

Description
- Keep up to date with and utilize the latest developments in LLM system optimization. - Discover/solve impactful technical problems, advance state-of-the-art LLM technologies, and translate ideas into production. - Optimize LLM inference workloads through innovative kernel, algorithm, scheduling, and parallelization technologies. - Continuously maintain internal LLM inference infrastructure. - A bachelor's degree or higher in computer science, engineering, or a related field, PhD is preferred - Strong programming skills in Python and C/C++ - 2+ years of experience in machine learning system development and optimization - 2+ years of experience in CUDA kernel development and optimization - Experience in optimizing communication layer / kernels for deep learning systems - Experience in machine learning model compression - Experience on different hardware such as both NVIDIA and AMD GPUs is a plus - A growth mindset and a passion for learning new things This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled. *
Responsibilities
The Senior Software Engineer will optimize LLM inference workloads and maintain internal LLM inference infrastructure. They will also solve impactful technical problems and advance state-of-the-art LLM technologies.
Loading...