[TECH] [ENGINEERING] Senior Platform Engineer SRE [FOUNDATIONS]
at Alumni Network Job Board
Berlin, Berlin, Germany -
Start Date | Expiry Date | Salary | Posted On | Experience | Skills | Telecommute | Sponsor Visa |
---|---|---|---|---|---|---|---|
Immediate | 06 Sep, 2024 | Not Specified | 07 Jun, 2024 | N/A | Good communication skills | No | No |
Required Visa Status:
Citizen | GC |
US Citizen | Student Visa |
H1B | CPT |
OPT | H4 Spouse of H1B |
GC Green Card |
Employment Type:
Full Time | Part Time |
Permanent | Independent - 1099 |
Contract – W2 | C2H Independent |
C2H W2 | Contract – Corp 2 Corp |
Contract to Hire – Corp 2 Corp |
Description:
WHAT YOU’LL BRING
- Experience with observability tools and infrastructure (e.g. Prometheus, VictoriaMetrics, Grafana)
- Proficiency in CI/CD tools (e.g. GitHub Actions, Argo CD, Argo Rollouts) and principles (e.g. GitOps, Progressive Delivery)
- Good knowledge of infrastructure-as-code tools such as Terraform
- Familiarity with at least one cloud platform (e.g. AWS, Azure, or GCP) and its services
- Familiarity with SRE best practices and principles (e.g. SLIs/SLOs, incident management, post-mortems, etc.)
- Experience building highly available and observable systems at scale (preferably in Go or Python)
- Strong asynchronous communication and collaboration skills
Responsibilities:
THE ROLE
The Foundations Alliance builds the tools, services, systems, and infrastructure that engineering teams across HelloFresh use daily.
As a Platform Engineer in the Site Reliability team, you will play a key role in upholding high reliability and performance standards of critical components in HelloFresh. You will build self-service tools for HelloFresh engineers to release code confidently and observe their applications seamlessly.
We’d love to hear from you if you’re passionate about reliability, observability, and automation!
WHAT YOU’LL DO
- Architect and build infrastructure automation at scale for 1000+ engineers
- Implement self-service observability and release management tools
- Drive positive change in change failure rate, mean time to detect, and mean time to restore metrics
- Consult with and educate engineers in observability, releases, and incident management best practices
REQUIREMENT SUMMARY
Min:N/AMax:5.0 year(s)
Information Technology/IT
IT Software - Application Programming / Maintenance
Software Engineering
Graduate
Proficient
1
Berlin, Germany