Start Date
Immediate
Expiry Date
29 Oct, 25
Salary
211180.0
Posted On
29 Jul, 25
Experience
0 year(s) or above
Remote Job
Yes
Telecommute
Yes
Sponsor Visa
No
Skills
Good communication skills
Industry
Information Technology/IT
Job Summary
The Red Hat Performance and Scale Engineering team is looking for a Senior Performance Engineer to join us in the PSAP (Performance and Scale for AI Platforms) team, driving the performance and scalability of distributed inference for Large Language Models (LLMs).
Serving modern LLMs for production inference requires distributing the model, the computation, and the requests across numerous specialized hardware accelerators across multiple nodes. This introduces complex performance challenges, from optimizing inter-GPU and inter node communication and kernel execution to minimizing latency under concurrent loads. You will be responsible for characterizing, modeling, and enhancing the performance of these distributed systems, ensuring that Red Hat’s AI platforms offer industry-leading throughput, latency, and cost-efficiency.
This role needs a seasoned engineer that thinks creatively, adapts to rapid change, and has the willingness to learn and apply new technologies. You will be joining a vibrant open source culture, and helping promote performance and innovation in this Red Hat engineering team. The border mission of the Performance and Scale team is to establish performance and scale leadership of the Red Hat product and cloud services portfolio. The scope includes component level, system and solution analysis and targeted enhancements. The team collaborates with engineering, product management, product marketing and customer support as well as Red Hat’s hardware and software ecosystem partners.
At Red Hat, our commitment to open source innovation extends beyond our products - it’s embedded in how we work and grow. Red Hatters embrace change – especially in our fast-moving technological landscape – and have a strong growth mindset. That’s why we encourage our teams to proactively, thoughtfully, and ethically use AI to simplify their workflows, cut complexity, and boost efficiency. This empowers our associates to focus on higher-impact work, creating smart, more innovative solutions that solve our customers’ most pressing challenges.
What you will do:
What you will bring:
The following is considered a plus: