Platform Engineer (m/f/d) with a focus on Platform-Infra-Operations

at  Serviceware SE

Deutschland, , Germany -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate30 Apr, 2025Not Specified31 Jan, 2025N/AGood communication skillsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

Your perspectives with us
Serviceware is a leading provider of software for managing services, offering IT infrastructure solutions and strategic consulting. Our hosting infrastructure supports developers in delivering their software as a cloud service to our valued customers. Our infrastructure spans multiple virtual datacenters hosted on public cloud providers (including AWS, Azure, and Hetzner Cloud) and a private cloud based entirely on Linux systems.
Role Overview: We are seeking a Platform Engineer with a focus on Platform-Infra-Operations to join our team. We are looking for engineers and administrators who can help us with any subset of the following topics our team is responsible for.
These tasks are waiting for you
Team Responsibilities:

Infrastructure Management

  • Base VM Management: We use ansible and terraform to easily obtain and manage Linux VMs from our used cloud providers – AWS and the Hetzner Cloud as public cloud providers and a VMWare based private cloud.
  • Network and Connectivity: We are responsible both for the internal network our VMs communicate with built on Wireguard, as well as the secure reachability of the Applications of our Developers – utilizing different technologies based on the cloud provider such as Amazons ELBs, as well as DNS, Consul DNS, Traefik and Let’s Encrypt to automate the certificate handling.
  • Performance Monitoring: We monitor and scale systems using Zabbix and closely cooperate with our product development and operation teams to ensure effective usage of our compute resources.
  • Capacity Planning: We assist in capacity planning and scalability efforts to accommodate growing data volumes and user traffic.
  • Security and Compliance: We try to centrally solve many deal-breakers our customers often require, such as their security and compliance regulations or their backup and restore requirements. This way, our development teams do not need to do this on an individual basis.

Container Management Services

  • Container Scheduling: Our team provides our product teams a container scheduling solution using HashiCorp Nomad, Consul and Vault. This provides our development teams with easy and secure ways to both schedule and deploy their applications, manage secrets in a safe and secure environment and allows for an effective and loose coupling between the applications and our infrastructure services.
  • Automation: We support our development teams to achieve a high level of automation and repeatability for their deyploments and applications in production. This simplifies daily processes for both teams, as we can cooperate by relying on automation, instead of coordination through procesess wherever possible.
  • Runtime Security: Our product development already uses various security assesments in their build processes, such as Snyk and Sonarqube. However, it is very possible that the security community discovers security issues affecting already built and deployed containers later. It is our responsibility to ensure we detect, manage and report these threats.

    Application Persistence Management

  • Persistence Ownership: In the context of application persistence, our team is responsible to provide performant, highly available and durable application persistences, including backups and restores of these persistence solutions.

  • Database Management: The main persistence most applications use in our environment is PostgreSQL. We utilize PostgreSQL, Patroni and PGBackrest to provide our development teams with highly available PostgreSQL clusters and point in time recovery if necessary.
  • Object Storage Management: We provide our developers with S3 compatible storage based upon self-hosted Mino clusters.
  • File Storage Management: For rare cases, we also offer GlusterFS volumes for our applications.

Collaboration and Optimization

  • Close Cooperation: We work closely with both datacenter operations and development teams. We are not an isolated silo at a great distance, but rather work very closely with the development teams, even forming virtual teams across product development and operations.
  • Expert Support: Often enough, development teams require expertise on how to handle certain cases. It is part of our responsibility as a team to share our experiences with the solutions from other teams. This includes but is not limited to specialized PostgreSQL Extensions for specific cases, configuration automation and structuring, and many more cases.
  • Documentation: We create and maintain documentation using different tools chosen based on the context. The most fundamental documentation on infrastructure operations is managed in Obsidian to have minimal dependencies. Documentation meant for the broader ServiceWare is written or imported into our own product – Serviceware Knowledge and distributed in the Serviceware Platform Services from there.

Infrastructure Figures

  • 2 / 6 major products hosted in the SCE, 1 in current migration
  • 800 productive Nomad jobs, 400 Nomad Jobs in dev
  • 500 VMs across 3 cloud providers and 6 private HVs
  • 18 Postgres Clusters ranging from 25GB to 4.5TB in size, with up to 1.5k tps

What you have to offer

  • Proven experience in Linux system administration, network administration, or cloud infrastructure management.
  • Experience with automated management and provisioning of Linux servers. We use Ansible and Terraform here, but the mindset is more important to us.
  • Some contact with Containers or Container Orchestration systems. In our case, this is HashiCorp Nomad, Consul and Vault running on Docker.
  • You have provided persistence for applications or users in the past. We are using PostgreSQL, Minio and to a lesser extent GlusterFS and Elasticsearch for this.
  • Experience with scripting languages (Python, Bash).
  • An interesting war story troubleshooting a tough problem in your career
  • Ability to work collaboratively in a team environment and communicate effectively with stakeholders.

What we offer you

  • Opportunities for professional growth and development
  • A collaborative and innovative work environment
  • Flexible working hours and remote working options

Your contact at SERVICEWARE
Vanessa Rivera
Human Resources
+49 6434 9450 0
Be part of Serviceware

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities:

Please refer the Job description for details


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Application Programming / Maintenance

Software Engineering

Graduate

Proficient

1

Deutschland, Germany