Emergency Call Management - Site Reliability Engineer

at  Motorola Solutions

Gatineau, QC, Canada -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate15 Sep, 2024Not Specified18 Jun, 2024N/ADevops,Architecture,Systems Programming,Disaster Recovery,Incident Response,Root Cause,Scrum,Kanban,High Availability Architecture,Communication Skills,Capacity Planning,Software Development,Creativity,Computer Engineering,Hybrid CloudNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

COMPANY OVERVIEW

At Motorola Solutions, we’re guided by a shared purpose – helping people be their best in the moments that matter – and we live up to our purpose every day by solving for safer. Because people can only be their best when they not only feel safe, but are safe. We’re solving for safer by building the best possible technologies across every part of our safety and security ecosystem. That’s mission-critical communications devices and networks, AI-powered video security & access control and the ability to unite voice, video and data in a single command center view. We’re solving for safer by connecting public safety agencies and enterprises, enabling the collaboration that’s critical to connect those in need with those who can help. The work we do here matters.

JOB DESCRIPTION

As a software engineer on the Emergency Call Management site reliability engineering (ECM-SRE) team you will join a team of talented software engineers who work directly with product and engineering teams to constantly improve reliability across our suite of public safety products.

Your responsibilities will include:

  • Architecture and implementation of Monitoring/Observability objectives. This includes maintenance of Alert response playbooks.
  • Creation and reinforcement of the HA and reliability strategy.
  • Triage of customer-reported incidents and problems to the proper software team, requiring troubleshooting and problem management skills.
  • Maintenance and reporting of SLOs and error budget.
  • Facilitation of Chaos Engineering activities with multiple engineering teams.
  • Developing the SRE culture and sharing best practices across Motorola Solutions’ Emergency Call Management organization.
  • On-call support alongside multiple engineering teams for products and services in production. This role focuses on Incident Command to maintain focus and direction of the incident process. This is essential to meet regulatory reporting requirements.
  • Assist Motorola Solutions’ customer support teams in creating customer facing communication documents, requiring strong communication skills.
  • Facilitation of Failure Mode and Effects Analysis with multiple engineering teams.

The right individual will have a passion for observability, reliability, automation, incident response, and enabling innovation.
-
En tant qu’ingénieur logiciel au sein de l’équipe d’ingénierie de la fiabilité des sites de gestion des appels d’urgence (ECM-SRE), vous rejoindrez une équipe d’ingénieurs logiciels talentueux qui travaillent directement avec les équipes de produits et d’ingénierie pour améliorer constamment la fiabilité de notre ensemble de produits de sécurité publique.

Vos responsabilités comprendront :

  • Architecture et mise en œuvre des objectifs de surveillance/observabilité. Cela comprend la maintenance des livres de jeu de réponse aux alertes.
  • Création et renforcement de la stratégie de disponibilité élevée et de fiabilité.
  • Traitement des incidents et problèmes signalés par les clients et attribution au service logiciel approprié, nécessitant des compétences en dépannage et en gestion des problèmes.
  • Maintenance et rapport des objectifs de niveau de service (SLO) et du budget d’erreurs.
  • Facilitation des activités d’ingénierie du chaos avec plusieurs équipes d’ingénierie.
  • Développement de la culture SRE et partage des meilleures pratiques au sein de l’organisation de gestion des appels d’urgence de Motorola Solutions.
  • Support d’astreinte aux côtés de plusieurs équipes d’ingénierie pour les produits et services en production. Ce rôle met l’accent sur le commandement des incidents pour maintenir la concentration et la direction du processus d’incident. Cela est essentiel pour répondre aux exigences de rapport réglementaires.
  • Aide aux équipes de support client de Motorola Solutions dans la création de documents de communication destinés aux clients, nécessitant de solides compétences en communication.
  • Facilitation de l’analyse des modes de défaillance et de leurs effets avec plusieurs équipes d’ingénierie.

La bonne personne aura une passion pour l’observabilité, la fiabilité, l’automatisation, la gestion des incidents et la promotion de l’innovation.

BASIC REQUIREMENTS

  • BS in Computer Engineering (or equivalent degree)
  • 4+ years of professional software development
  • Excellent communication skills
  • Experience developing cloud-based applications
  • Experience developing REST-based APIs and implementing microservice principles and architectures
  • Experience with modern DevOps tooling (including CI/CD pipelines)
  • Familiarity with the concepts involved in designing a high availability architecture
  • Familiarity with observability and monitoring
  • Familiarity with automated testing
  • Creativity and persistence when solving complex problems
  • Enthusiasm for learning key technologies, architectures, processes, and best practices

PREFERRED SKILLS

  • Familiarity with SRE or DevOps
  • Familiarity with container deployment and orchestration technologies at scale
  • Familiarity with SLOs and SLIs
  • Familiarity with incident response, disaster recovery, root cause analysis, and postmortems
  • Familiarity with IaC
  • Familiarity with chaos engineering
  • Familiarity with redundancy and failovers
  • Familiarity with capacity planning and load balancing
  • Familiarity with service mesh
  • Familiarity with feature flags, canary releases, or blue/green deployments
  • Familiarity with hybrid cloud architecture
  • Familiarity developing cloud-based applications with a multi-tenant database architecture
  • Familiarity with systems programming (network stack, file system, OS services) and networking (L2 vs. L3, network architecture, VLANs, etc)
  • Experience working in Agile teams leveraging Scrum, Kanban, or other methodologies and/or understanding of Agile development concepts
  • Experience being on-call for a product in production
    -

BASIC REQUIREMENTS

  • BS in Computer Engineering (or equivalent degree)
  • 4+ years of professional software development experience developing cloud-based applications, developing REST-based APIs and implementing microservice principles and architectures and experience with modern DevOps tooling (including CI/CD pipelines)

TRAVEL REQUIREMENTS

Under 10%

Responsibilities:

  • Architecture and implementation of Monitoring/Observability objectives. This includes maintenance of Alert response playbooks.
  • Creation and reinforcement of the HA and reliability strategy.
  • Triage of customer-reported incidents and problems to the proper software team, requiring troubleshooting and problem management skills.
  • Maintenance and reporting of SLOs and error budget.
  • Facilitation of Chaos Engineering activities with multiple engineering teams.
  • Developing the SRE culture and sharing best practices across Motorola Solutions’ Emergency Call Management organization.
  • On-call support alongside multiple engineering teams for products and services in production. This role focuses on Incident Command to maintain focus and direction of the incident process. This is essential to meet regulatory reporting requirements.
  • Assist Motorola Solutions’ customer support teams in creating customer facing communication documents, requiring strong communication skills.
  • Facilitation of Failure Mode and Effects Analysis with multiple engineering teams


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

BSc

Engineering

Proficient

1

Gatineau, QC, Canada