Computer Architecture Intern - LLM, 2026 at NVIDIA
Shanghai, Shanghai, China -
Full Time


Start Date

Immediate

Expiry Date

17 Jan, 26

Salary

0.0

Posted On

20 Oct, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Software Engineering, GPU Programming, LLM Inference, Python, C++, CUDA, Deep Learning, Problem-Solving, Collaboration, Initiative, Results Delivery, PyTorch, Reinforcement Learning, Human Feedback, LLM Training Frameworks, FSDP, Megatron LLM

Industry

Computer Hardware Manufacturing

Description
We are now looking for Computer Architecture Interns in our group! The NVIDIA Architecture group is looking for world class architects and computer science interns to join and lead our various architecture efforts. A key part of NVIDIA's strength is to innovate in the graphics and parallel computing fields delivering the highest performance in the world for parallel processing algorithms. We are constantly looking for ways to improve our GPU architecture and maintain our leadership by developing new parallel programming models, new architectures and new infrastructure that is required to make this successful. What you'll be doing: Develop and refine software solutions to expedite LLM SW stack (could be within inference/post train or pre-train phase) by harnessing the power of GPU technology. Collaborate closely with a world-class team of engineers to implement and refine GPU-based algorithms. Analyze and determine the most effective methods to improve performance, ensuring seamless execution across diverse computing environments. Engage in both individual and team projects, contributing to NVIDIA's mission of leading the AI revolution. Work in an empowering and inclusive environment to successfully implement groundbreaking AI solutions. What we need to see: Proven experience in software engineering, particularly in GPU programming and LLM inference. Strong proficiency in programming languages such as Python, C++, and CUDA. A solid understanding of deep learning frameworks and techniques. Outstanding problem-solving skills and the ability to work collaboratively in a team setting. Ambitious approach with a proven track record of taking initiative and delivering results. A degree in Computer Science, Engineering, or a related field, or equivalent experience. Experience with PyTorch, RLHF (Reinforcement Learning with Human Feedback), and LLM training frameworks like FSDP/Megatron LLM is a plus. NVIDIA is the world leader in accelerated computing. NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society. Learn more about NVIDIA.
Responsibilities
Develop and refine software solutions to expedite the LLM software stack by leveraging GPU technology. Collaborate with a team of engineers to implement and enhance GPU-based algorithms for improved performance.
Loading...