Senior Site Reliability Engineering at Royal Bank of Canada
Toronto, Ontario, Canada -
Full Time


Start Date

Immediate

Expiry Date

31 Jul, 26

Salary

0.0

Posted On

02 May, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Site Reliability Engineering, Incident Management, Problem Management, Technical Leadership, Automation, Cloud Computing, Mainframe, Ansible, Dynatrace, PagerDuty, ServiceNow, GitHub, Elastic, Logstash, Kibana, Grafana

Industry

Financial Services

Description
Job Description WHAT IS THE OPPORTUNITY? This role will be responsible for leading the design, development, implementation and support of Site Reliability Engineering (SRE) solutions for applications supported by the Commercial Payments Technology (CPT) SRE organization. The incumbent will need advanced knowledge and experience working in an application development and/or technology operations organization. Perform production support role and partner with SRE Delivery team in incident management and problem management. WHAT WILL YOU DO? Technical Leadership: Lead code and non-functional (performance, security, maintainability, compliance, change management) reviews of all production bound SRE solutions Drive transformation by continuously looking for ways to automate existing processes, Run engineering mindset meetups accelerating breadth and depth of knowledge in community Manage SRE application assets (virtual machines, cloud instances, mainframe, source code repositories, etc.)Publish technical design for SRE solutions Publish and/or review implementation plans for SRE solutions bound to production, Explore new capabilities and technologies to drive innovation (including coding and publishing how-to documentation) Track, audit, monitor and implement on technical work streams, Act as portfolio SME (Subject Matter Expert) – understand & document common components, core functionalities, infrastructure of supported application Production Support: Escalation point in the on-call rotation, and support our maintenance, scheduled work, support and release deployment requirements Lead in incident management and problem management for applications in scope Incident management and problem management for applications in scope and RCA Action items fulfillment/ownership Focus on Continuous improvement and technical standards – Drive improvements in productivity, monitoring, tooling and best practices Manage technology currency (server patching, certificate renewal, compliance, etc.) with keen eye on automating opportunities Ensure availability and uptime of applications in scope, as per service level objectives, Manage PagerDuty rules/tuning/tagging, Moogsoft Situation management, Dynatrace tuning (RUM, Problem Card reduction), Provide expertise, direction, coaching and development to build the SRE teams capability Provide assistance with selecting & building a high performing diverse team that leverages individual capabilities & strengths WHAT DO YOU NEED TO SUCCEED? Must have: Advanced knowledge of industry practices, with focus on SRE Advanced experience in a variety of environments (Cloud, distributed and mainframe, business workflows and services/APIs, databases) Excellent communication skills, direct style (e.g. I did or did not do something, it does or does not work as opposed I believe or I understand it to be) Effective negotiation skills, stakeholder management, Ability to influence at the Director level (unit and other partner units) Mainframe knowledge and work experience Hands-on experience in a variety of SRE languages and tools (Ansible, Dynatrace Managed, Moog, PagerDuty, ServiceNow, GitHub, Slack, Elastic, Logstash, Kibana, Grafana , Catch Point, RedHat OCP) Nice-to-have: Computer Engineering, Computer Science, related (technical) degree/diploma, or related breadth of experience Exposure to Azure, docker and OCP Exposure to UCD, GitHub Experience in agile ways of working Middleware experience What’s in it for you? We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual. A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable Leaders who support your development through coaching and managing opportunities Ability to make a difference and lasting impact in technology transformation Work in a dynamic, collaborative, progressive, and high-performing team A world-class training program in financial services and technology Flexible work/life balance options Opportunities to do challenging work Opportunities to take on progressively greater accountabilities Opportunities to building close relationships with clients #LI-POST Job Skills Agile Methodology, Application Infrastructure, Group Problem Solving, IT Automation, IT Monitoring, Operations Support, Problem Solving, Production Support, Site Reliability Engineering, Software Development Life Cycle (SDLC), Software Engineering, Software Product Technical Knowledge, System Applications, Systems Software Additional Job Details Address: RBC WATERPARK PLACE, 88 QUEENS QUAY W:TORONTO City: Toronto Country: Canada Work hours/week: 37.5 Employment Type: Full time Platform: TECHNOLOGY AND OPERATIONS Job Type: Regular Pay Type: Salaried Posted Date: 2026-03-13 Application Deadline: 2026-05-29 Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above Our Employment Opportunities At RBC, we are guided by living shared values of Client First, Integrity, Collaboration, Respect and Excellence and winning together as One RBC. We believe an inclusive workplace that has diverse perspectives is core to our continued growth as one of the largest and most successful banks in the world. Maintaining a workplace where our employees feel supported to perform at their best, effectively collaborate, drive innovation, and grow professionally helps to bring our Purpose to life and create value for our clients and communities. RBC strives to deliver this through policies and programs intended to foster a workplace based on respect, belonging and opportunity for all. Join our Talent Community Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you. Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities at jobs.rbc.com. RBC is presently inviting candidates to apply for this existing vacancy. Applying to this posting allows you to express your interest in this current career opportunity at RBC. Qualified applicants may be contacted to review their resume in more detail.
Responsibilities
The role involves leading the design, development, and support of SRE solutions while driving automation and technical transformation across the organization. You will act as an escalation point for production support, managing incident resolution and ensuring application availability through rigorous monitoring and standards.
Loading...