Start Date
Immediate
Expiry Date
19 Nov, 25
Salary
0.0
Posted On
20 Aug, 25
Experience
5 year(s) or above
Remote Job
Yes
Telecommute
Yes
Sponsor Visa
No
Skills
Python, Kubernetes, Devops, Bash
Industry
Information Technology/IT
We are seeking a Site Reliability Engineer (SRE) to help scale and secure mission-critical platforms for a leading financial institution in Amsterdam. As part of a cross-functional engineering team, you’ll focus on observability, reliability, incident response and operational excellence across distributed systems.
This role demands both engineering skill and operational discipline. Previous experience in a banking or regulated enterprise environment is mandatory.
REQUIREMENTS:
• 3–5 years in an SRE, DevOps or Platform Engineering role
• Strong skills in observability tooling (Prometheus, Grafana, ELK, Splunk, etc.)
• Experience with incident management and post-mortem analysis
• Proficient with Kubernetes and infrastructure automation (Terraform, Helm)
• Solid scripting (Bash, Python, Go)
• Build and improve monitoring, logging and alerting for high-availability systems
• Support production reliability across Kubernetes, cloud and on-prem environments
• Define SLOs, SLIs and error budgets in collaboration with development teams
• Lead root cause analysis and incident response processes
• Automate operational tasks and drive reliability through infrastructure-as-code
• Contribute to playbooks, runbooks and operational readiness reviews