Site Reliability Engineer (m/f/d) at Jedox GmbH
79098 Freiburg, Baden-Württemberg, Germany -
Full Time


Start Date

Immediate

Expiry Date

28 May, 25

Salary

0.0

Posted On

28 Feb, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Good communication skills

Industry

Information Technology/IT

Description

YOUR PROFILE

You are a motivated, hands-on individual with a solid technical foundation in SRE or DevOps. You thrive in a collaborative environment and enjoy solving complex technical challenges with a focus on automation and innovation.

  • Proven experience in a Site Reliability Engineer or DevOps role
  • Hands-on experience with a major cloud provider, preferably Microsoft Azure
  • Solid expertise with Kubernetes, ideally including AKS
  • Understanding of operating systems, networking principles, and tools such as firewalls, reverse/forward proxies, and load balancers
  • Hands-on experience with observability tools like Grafana, Prometheus, Loki, Thanos, and Honeycomb
  • Proficiency in writing Helm charts, Bash scripting, Argo CD, creating GitHub Actions workflows, and containerizing applications
  • Solid understanding of networking and security best practices is a plus
  • Experience in software development skills is a plus (we use Go, PHP, Python, JavaScript)
  • Strong communication and collaboration skills
  • Technical curiosity, and persistence in ensuring smooth system operation
  • Strong analytical skills with a focus on automation to solve complex technical issues
  • Excellent verbal and written communication skills in English (German is a plus)

ABOUT US

Jedox is a leading software solution that enables business planning, budgeting, and forecasting for finance, sales, and other business functions with leading-edge technology to drive Digital Transformation and provide tangible customer value. Constant innovation has made us a leader in the Enterprise Performance Management (EPM) sector.

Responsibilities

We are looking for a Site Reliability Engineer (SRE) (m/f/d) to join our growing team. In this role, you will help us build and maintain a cloud-only infrastructure to ensure our global customers can access our services with unparalleled reliability and performance. As part of our SRE team, you will focus on automation, observability, and the implementation of Service Level Objectives (SLOs) and Service Level Indicators (SLIs).

  • Improve and maintain our observability stack, ensuring the health and performance of our systems
  • Monitor alerts and respond promptly to ensure high availability and reliability
  • Collaborate with stakeholders to implement best practices for incident response and Root Cause Analysis
  • Maintain and enhance security measures to ensure compliance with organizational and industry standards
  • Document incident reports, standard operating procedures, and troubleshooting guides to foster knowledge-sharing within the team
  • Work closely with development, product, and operations teams to ensure seamless service delivery
  • Participate in post-incident reviews and implement improvements to prevent future issues
  • Track and report on SLOs and SLIs to ensure the reliability of our services
Loading...