MLOps Engineer at Bjak
Deutschland, , Germany -
Full Time


Start Date

Immediate

Expiry Date

27 Nov, 25

Salary

0.0

Posted On

27 Aug, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Kubernetes, Platforms

Industry

Information Technology/IT

Description

TRANSFORM LANGUAGE MODELS INTO REAL-WORLD APPLICATIONS

We’re building AI systems for a global audience. We are living in an era of AI transition - this new project team will be focusing on building applications to enable more real world impact and highest usage for the world.
This role is a global role with hybrid work arrangement - combining flexible remote work with in-office collaboration at our HQ. You’ll work closely with regional teams across product, engineering, operations, infrastructure and data to build and scale impactful AI solutions.

REQUIREMENTS

  • Experience with model serving platforms such as vLLM or HuggingFace TGI
  • Proficiency in GPU orchestration using tools like Kubernetes, Ray, Modal, RunPod, LambdaLabs
  • Ability to monitor latency, costs, and scale systems efficiently with traffic demands
  • Experience setting up inference endpoints for backend engineers
Responsibilities

WHY THIS ROLE MATTERS

You’ll fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production. Your work ensures our models are not only intelligent, but also safe, trustworthy, and impactful at scale.

WHAT YOU’LL DO

  • Run and manage open-source models efficiently, optimizing for cost and reliability
  • Ensure high performance and stability across GPU, CPU, and memory resources
  • Monitor and troubleshoot model inference to maintain low latency and high throughput
  • Collaborate with engineers to implement scalable and reliable model serving solutions
Loading...