Infrastructure Engineer at Gracemark
Montréal, QC, Canada -
Full Time


Start Date

Immediate

Expiry Date

25 Sep, 25

Salary

95.0

Posted On

26 Jun, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

French, Infrastructure, Code, Dns Management, Security

Industry

Information Technology/IT

Description

REQUIRED SKILLS AND EXPERIENCE

  • Expertise in Infrastructure as Code (IaC) tools, especially Terraform and/or Crossplane.
  • Strong background in AWS VPC architecture, including subnets, routing, security groups, and cross-region networking.
  • Experience with Site-to-Site VPN, Direct Connect, and Transit Gateway routing.
  • Proficiency in AWS ALB, Route53 DNS management, and cloud observability tools (e.g., Datadog).
  • Knowledge of security best practices, including least privilege and blast radius minimization.
  • Familiarity with Kubernetes Load Balancer Controllers and External DNS Controllers.
  • Experience operating in PCI-compliant environments is a plus.
  • Knowledge of PrivateLink, BGP, and service meshes (e.g., Cilium) is a plus.
    Job Type: Full-time

Language:

  • French (required)
Responsibilities

ROLE OVERVIEW

We are seeking an experienced Infrastructure Consultant to join the Cloud Platform Infrastructure team. This role is critical in helping transition customers to a new generation platform and in evolving the platform for use across multiple global markets and business models.
The successful candidate will use Site Reliability Engineering (SRE) principles to ensure platform stability, scalability, and performance. This role offers the opportunity to work on large-scale production systems, implement infrastructure automation, and optimize cloud resources.

KEY RESPONSIBILITIES

  • Apply SRE principles: change management, monitoring, emergency response, capacity planning, and production readiness reviews.
  • Lead infrastructure projects aimed at improving system resilience and reducing operational risk.
  • Develop infrastructure automation and tooling to reduce manual work and system downtime.
  • Drive visibility improvements across SLIs, SLOs, SLAs, and dependency mapping.
  • Manage and scale production infrastructure, with a focus on AWS cloud resources.
  • Participate in on-call rotation and improve the team’s on-call processes.
  • Optimize cloud resource usage and assist with cost management efforts.
  • Collaborate with cross-functional teams to drive infrastructure improvements.
Loading...