Senior Site Reliability Engineer (SRE) - Disaster Recovery Specialist (m/f/ at Dynatrace
Linz, OÖ, Austria -
Full Time


Start Date

Immediate

Expiry Date

29 Apr, 25

Salary

56000.0

Posted On

30 Jan, 25

Experience

5 year(s) or above

Remote Job

No

Telecommute

No

Sponsor Visa

No

Skills

Information Technology, Reliability Engineering, Infrastructure, Code, Aws, Azure, Communication Skills, Computer Science, Google Cloud, Disaster Recovery

Industry

Information Technology/IT

Description

COMPANY DESCRIPTION

Dynatrace exists to make software work perfectly. Our platform combines broad and deep observability and continuous runtime application security with advanced AIOps to provide answers and intelligent automation from data. This enables innovators to modernize and automate cloud operations, deliver software faster and more securely, and ensure flawless digital experiences.

JOB DESCRIPTION

  • Design, implement, and maintain disaster recovery solutions for our cloud-based SaaS environment, ensuring rapid and effective recovery in the event of system failures or disasters
  • Develop and document comprehensive disaster recovery plans, procedures, and runbooks, and regularly conduct drills and exercises to test and validate the effectiveness of these plans
  • Collaborate with engineering, operations, and security teams to identify (e.g by Chaos Engineering) and mitigate potential risks to system availability and data integrity while at the same time increase the system resilience
  • Monitor system performance and health metrics, proactively identify areas for improvement, and implement preventive measures to enhance system reliability and resilience
  • Participate in incident response and post-incident reviews, analyze root causes of failures, and implement corrective actions to prevent recurrence

QUALIFICATIONS

  • Degree in Computer Science, Information Technology, or related field
  • 5+ years of hands-on experience in site reliability engineering, ideally with a focus on disaster recovery, preferably in a cloud-based SaaS environment.
  • Strong expertise in designing and implementing disaster recovery solutions using industry-leading technologies and methodologies.
  • Proficiency in cloud platforms such as AWS, Azure, or Google Cloud Platform
  • Experience with infrastructure as code (IaC) tools such as Terraform or CloudFormation.
  • Excellent communication skills with the ability to effectively collaborate with cross-functional teams and communicate technical concepts to non-technical stakeholders.
Responsibilities

Please refer the Job description for details

Loading...