Deep Learning Performance Software Engineer at NVIDIA
Beijing, Beijing, China -
Full Time


Start Date

Immediate

Expiry Date

23 Dec, 25

Salary

0.0

Posted On

24 Sep, 25

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Deep Learning, Software Engineering, C/C++ Programming, Python, MLIR, AI Agent, Performance Modelling, Profiling, Debugging, Code Optimization, GPU Programming, CUDA, OpenCL, Performance Optimization, Software Design, Agile

Industry

Herstellung von Computerhardware

Description
We are now looking for a Deep Learning Performance Software Engineer! We are expanding our research and development for deep learning. We seek excellent Software Engineers and Senior Software Engineers to join our team. We specialize in developing GPU-accelerated Deep learning software. Researchers around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in numerous areas. Join the team that builds software to enable new solutions. Your ability to work in a fast-paced customer-oriented team is required and excellent communication skills are necessary. What you’ll be doing: Develop deep learning compiler Develop highly optimized deep learning kernels End-to-end performance optimization Do performance optimization, analysis, and tuning What we need to see: Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI) SW Agile skills helpful Excellent C/C++ programming and software design skills Python experience a plus MLIR experience a plus AI agent experience a plus Performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU GPU programming experience (CUDA or OpenCL) desired 3 years of relevant work experience NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and talented people on the planet working for us. If you're creative and autonomous, we want to hear from you! NVIDIA is the world leader in accelerated computing. NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society. Learn more about NVIDIA.
Responsibilities
Develop deep learning compilers and highly optimized deep learning kernels. Perform end-to-end performance optimization, analysis, and tuning.
Loading...