Site Reliability Engineer at Cost Engineering
3332 Zwijndrecht, , Netherlands -
Full Time


Start Date

Immediate

Expiry Date

30 Sep, 25

Salary

0.0

Posted On

01 Jul, 25

Experience

3 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Azure, Microsoft Sql Server, Python, Java, Infrastructure Technologies, Logging, Docker, Containerization

Industry

Information Technology/IT

Description

This team is the right hand of our management. With their advice, analyzing expertise and knowledge of business and finances they make sure our company processes run smoothly.

DO YOU GET ENERGY FROM SOLVING COMPLEX AND TECHNICAL IT CHALLENGES WITHIN AN IN-HOUSE SOFTWARE DEVELOPMENT COMPANY? WE ARE LOOKING FOR SOMEONE WITH A PASSION FOR RELIABILITY, PERFORMANCE AND AVAILABILITY. ARE YOU THE PERSON THAT BRINGS OUR SYSTEMS AND APPLICATIONS TO THE NEXT LEVEL?

We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our IT Operations team. As an SRE, you will be responsible for ensuring the reliability, availability, and performance of our systems and applications. You will collaborate closely with our development teams, system engineers, and other stakeholders to build and maintain robust and scalable infrastructure.

QUALIFICATIONS

  • At least 3 years of experience in maintaining and securing cloud-based infrastructure
  • Automation skills in the context of IaC
  • Understanding of cloud infrastructure technologies (Azure) and containerization (Docker, Kubernetes)
  • Solid understanding of networking concepts and protocols
  • Windows server 2016/2019, Office 365
  • Microsoft SQL server 2017/2019
  • Familiar with monitoring and logging (e.g. ELK stack)
  • Knowledge of Python, Java, and/or other programming language would be a plus
Responsibilities
  • Design, implement, and maintain highly available and scalable infrastructure solutions.
  • Collaborate with software development teams to ensure proper integration of software and infrastructure components.
  • Monitor system performance and proactively identify and resolve any issues or bottlenecks.
  • Implement and enhance automation tools for deployment, configuration management, and monitoring.
  • Conduct incident response and root cause analysis to prevent system failures from recurring.
  • Optimize system performance through capacity planning, performance tuning, and resource utilization analysis.
Loading...