Reliability Engineer for Forecast Delivery

at  ECMWF

Bologna, Emilia-Romagna, Italy -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate09 Jul, 2024Not Specified10 Apr, 2024N/AAws,Scripting,Cloud,Docker,Essential Training,Ruby,Linux System Administration,Python,Ansible,Perl,Logging,AnalyticsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

THE TEAM

The role sits in the Forecast Delivery Team, within the Application Delivery Section of our Computing Department. The Section provides platforms and services that enable ECMWF teams to consume computing resources at different levels (PaaS, SaaS) and to consistently deploy applications with different levels of support to a high degree of quality and reliability.
The Section achieves this through innovation in the areas of computer systems administration automation, application deployment and operation, reliability engineering, identity and access management, container orchestration, observability (monitoring, logging, and tracing), and PaaS/SaaS application development.

EDUCATION AND EXPERIENCE

  • A university degree (EQF Level 6) or equivalent industry experience
  • Experience in a mission-critical 24/7 operational environment
  • Experience in an NWP and/or suites and/or forecast production domain is desirable but not essential training will be provided

Responsibilities:

THE ROLE

We are in search of a highly motivated analyst to work in the newly formed Application Delivery section at ECMWF. In this role, you will support ECMWFs critical operational production systems, in particular data acquisition and observation pre-processing, and weather product generation and delivery. Much of the work involves analysis of events which arise in the systems and working with developers and infrastructure teams to implement improvements to system observability, quality, and reliability.

MAIN DUTIES AND KEY RESPONSIBILITIES

  • Supporting the ECMWF critical operational forecast production systems, in particular:
  • data acquisition and observation pre-processing
  • product generation, dissemination and archiving
  • Automation of the deployment of software to containers, VMs, or bare metal
  • Developing observability capabilities for services and their underlying systems
  • Advising on quality assurance for new contributions and changes to critical operational systems
  • Advising on and testing new developments targeted for operational implementation
  • Contributing to documentation and training, including cross-training within the team
  • Advocating for reliability engineering within ECMWF and its partners
  • Participating in regular 24-hour on-call rotas for any critical systems and services in the relevant areas
  • Any other relevant domains related to the teams portfolio


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Application Programming / Maintenance

Software Engineering

Graduate

Proficient

1

Bologna, Emilia-Romagna, Italy