Senior Site Reliability Engineer at Nordhealth

Helsinki, , Finland -

Full Time

Start Date

Immediate

Expiry Date

11 Sep, 25

Salary

0.0

Posted On

12 Jun, 25

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Python, Django, Infrastructure, Addition, Aws, Automation, Kubernetes, Octopus Deploy, Code, Operators, Scripting

Industry

Computer Software/Engineering

Description

WHO ARE WE?

Nordhealth’s mission is to build software that improves the daily lives of healthcare professionals. We build software that empowers veterinary and therapy professionals to provide the best possible care experiences to their patients. Our products are used daily by over 50,000 professionals in clinics and hospitals across 30+ countries. We excel with 20+ years of experience in healthcare and veterinary software.
We understand that talent comes from everywhere and anywhere. The greater our diversity, the better the products we deliver. That’s why we are a remote-first company, headquartered in Helsinki, Finland, with all 400+ employees working either remotely or from collaboration hubs. While our market presence is currently strongest in the Nordics, our customer base is rapidly growing in our other markets too, especially in Europe and North America (more at our website nordhealth.com.)

WHAT’S IN IT FOR YOU?

At Nordhealth, we do things a little bit differently. We value continuous improvement, diverse teams and autonomy which drive our collaboration. Our global healthcare domain is rapidly developing and we are seeking colleagues who enjoy working in this type of environment.

In addition, we offer:

The chance to work in a meaningful industry and in a fast-growing, global company on a path to changing digital healthcare
Competitive compensation and benefits
A fully remote role with a true async culture
Learning and professional growth opportunities
The tools you need, and enjoy using
Frequent company events and talented colleagues from around the world

If you enjoy working in a fast-growing and international environment with the possibility to make an impact, this might be the perfect job for you. Apply now! We’ll fill the position as soon as we find the right person

Ideally, you have already gained some experience from working in a fast growing, global SaaS company. In addition, our humble wishlist includes:

Ideally, 5+ years of experience in Site Reliability, Platform Engineering, or DevOps roles.
Expertise in Infrastructure-as-Code, especially with Terraform.
Solid hands-on experience with Kubernetes, including Helm, Kustomize, operators, and cluster-level debugging.
Strong understanding of observability principles and tools (e.g., Prometheus, Grafana, OpenTelemetry, Datadog).
Proficient in Python for scripting, automation, and tooling. Experience with Django for database migrations is also highly valued
Practical experience with CI/CD systems (e.g., GitHub Actions, GitLab CI, Octopus Deploy, ArgoCD, or similar).
Experience building or supporting Internal Developer Platforms or developer self-service tools.
Familiarity with cloud platforms (AWS, GCP, or Azure); AWS preferred.
Comfortable in a highly collaborative, remote-first environment

Responsibilities

ABOUT THE ROLE

We are here to create great healthcare products. That’s at our very core, but it’s the people who make companies great, not the other way around. We spend more time with our co-workers than anyone else in our lives. Being part of an exceptional team is not only important for your personal mindfulness but key for your professional progression.
We’re now looking for an SRE to support our growth and join our Platform Team!
This is a unique opportunity to join the engineering team of the Veterinary business unit that has a total of 65 software engineers!

YOUR KEY RESPONSIBILITIES INCLUDE:

Design, implement, and manage cloud infrastructure using Terraform and Infrastructure-as-Code best practices.
Operate and optimize Kubernetes clusters and associated tooling to support highly available, scalable services.
Help shape and evolve our observability maturity (metrics, tracing, logging) to improve system insight and reliability, including monitoring and alerting
Automate operational tasks and build scalable tooling in Python to reduce toil and accelerate development.
Drive CI/CD pipeline improvements and adoption, ensuring fast, reliable, and secure software delivery.
Lead the design and implementation of our Internal Developer Platform (IDP) to empower product and engineering teams with self-service capabilities.
Define and enforce SLOs, SLIs, and error budgets in collaboration with engineering teams.
Participate in and improve the incident response process, including on-call rotations and postmortems.

WHAT WILL HELP YOU TO BE SUCCESSFUL IN THIS ROLE?

Ideally, you have already gained some experience from working in a fast growing, global SaaS company. In addition, our humble wishlist includes:

Ideally, 5+ years of experience in Site Reliability, Platform Engineering, or DevOps roles.
Expertise in Infrastructure-as-Code, especially with Terraform.
Solid hands-on experience with Kubernetes, including Helm, Kustomize, operators, and cluster-level debugging.
Strong understanding of observability principles and tools (e.g., Prometheus, Grafana, OpenTelemetry, Datadog).
Proficient in Python for scripting, automation, and tooling. Experience with Django for database migrations is also highly valued
Practical experience with CI/CD systems (e.g., GitHub Actions, GitLab CI, Octopus Deploy, ArgoCD, or similar).
Experience building or supporting Internal Developer Platforms or developer self-service tools.
Familiarity with cloud platforms (AWS, GCP, or Azure); AWS preferred.
Comfortable in a highly collaborative, remote-first environment.