Principal Software Development Engineer - (LLM reinforcement learning) at Advanced Micro Devices
Helsinki, , Finland -
Full Time


Start Date

Immediate

Expiry Date

02 Nov, 25

Salary

0.0

Posted On

03 Aug, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Good communication skills

Industry

Information Technology/IT

Description
Responsibilities

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.
AMD together we advance_
Join AMD Silo AI’s Base Models team to work on open source post-training. You will turn strong base checkpoints into assistant-grade models through supervised fine-tuning, reward-modeling, and RL, while keeping multilingual and low-resource performance front-and-center.

THE ROLE

  • Design, implement and tune post-training methods (SFT, DPO/PPO/GRPO, RLVR) on large-scale HPC clusters.
  • Develop high-throughput synthetic-data pipelines with verifiable results.
  • Integrate relevant metrics with the Evaluation team to enable rapid feedback loops.
  • Publish code, data sets and training recipes under permissive licenses; upstream improvements to TRL or other similar frameworks.
  • Collaborate on OpenEuroLLM post-training efforts.
Loading...