Site Reliability Engineer - SRE

at  Coveo

Quebec City, QC, Canada -

Start DateExpiry DateSalaryPosted OnExperienceSkillsTelecommuteSponsor Visa
Immediate21 Apr, 2025Not Specified22 Jan, 2025N/AGood communication skillsNoNo
Add to Wishlist Apply All Jobs
Required Visa Status:
CitizenGC
US CitizenStudent Visa
H1BCPT
OPTH4 Spouse of H1B
GC Green Card
Employment Type:
Full TimePart Time
PermanentIndependent - 1099
Contract – W2C2H Independent
C2H W2Contract – Corp 2 Corp
Contract to Hire – Corp 2 Corp

Description:

DRIVING SYSTEM RELIABILITY THROUGH AUTOMATION, MONITORING, AND PROACTIVE INCIDENT MANAGEMENT.

At Coveo, the Site Reliability Engineer (SRE) for the Index team will focus on improving operational efficiency and automating manual tasks. The SRE’s workload will be evenly split, with 50% dedicated to operational tasks such as troubleshooting, debugging, and communication, and 50% focused on improvements, including automation, tool development, dashboard creation, and documentation.
Ultimately, the SRE will help foster a proactive, collaborative culture that anticipates and addresses performance issues

Responsibilities:

HERE IS A GLIMPSE AT YOUR RESPONSIBILITIES:

  • Define critical KPIs to monitor system health, and develop centralized dashboards for real-time visibility and performance tracking.
  • Act as the first line for all unplanned requests, maintain awareness of incoming tasks, and ensure progress tracking and proactive communication. Present maintenance summaries at each sprint review.
  • Identify common debugging workflows and create runbooks, tools, and dashboards to streamline the debugging process, improving resolution speed and system predictability.
  • Design and implement automation solutions to reduce manual interventions, and collaborate with developers to improve operational efficiency.
  • Propose and manage system limits to enhance predictability and prevent issues, and identify tools or processes to proactively address customer performance challenges.
  • Establish regular syncs with the Support team to align on priorities, gather feedback, and ensure visibility into ongoing efforts and challenges.

HERE IS WHAT WILL QUALIFY YOU FOR THE ROLE:

  • Solid technical knowledge of programming and scripting, particularly with Python.
  • Strong analytical and problem-solving skills.
  • Great communications skills and the desire to connect with developers and stakeholders.

What would make you stand out :

  • Ability to evaluate the broader impact of actions, balancing innovation with caution.
  • Experience with cloud based distributed systems.

DO YOU THINK YOU CAN BRING THIS ROLE TO LIFE?

You don’t need to check every single box; passion goes a long way and we appreciate that skillsets are transferable.
Send us your application, we want to get to know you! Join the Coveolife!
We encourage all qualified candidates to apply regardless of, for example, age, gender, disability, gaps in CV, national or ethnic background. We know that applying for a new role is a lot of work and we really appreciate your time.

li-hybri


REQUIREMENT SUMMARY

Min:N/AMax:5.0 year(s)

Information Technology/IT

IT Software - Other

Software Engineering

Graduate

Proficient

1

Quebec City, QC, Canada