This recruiter is online.

This is your chance to shine!

Apply Now

Senior Manager, Site Reliability Engineering (SRE) – Digital Banking

Toronto, ON
  • Number of positions available : 1

  • To be discussed
  • Starting date : 1 position to fill as soon as possible

Date limite pour présenter sa candidature :

07/03/2025

Adresse :

33 Dundas Street West

Groupe de famille d'emploi :

Technologie

Role Overview

We are seeking a hands-on and strategic Senior Manager to lead our Site Reliability Engineering (SRE) and Infrastructure Patching teams supporting the Digital Banking Platform. This role is crucial to our mission of providing always-on, secure, and high-performing banking services for millions of customers.

Key Responsibilities

Technical Leadership & Incident Management

  • Provide strategic oversight for incident resolution efforts led by the SRE team, ensuring rapid restoration and comprehensive root cause analysis (RCA).
  • Collaborate across engineering, platform, and security teams to troubleshoot issues spanning full-stack environments (cloud, container, and legacy platforms).
  • Maintain high availability and performance of digital banking applications (primarily AWS, OpenShift, Linux, with some legacy WebSphere).
  • Champion proactive monitoring, observability, and alerting (e.g., Dynatrace, OpenSearch).

SRE & Reliability Engineering

  • Define and implement best practices for reliability, scalability, and availability tailored to large-scale digital banking.
  • Continuously improve CI/CD pipelines, release automation, and deployment practices.
  • Drive rigorous postmortem analysis and a culture of blameless continuous improvement.
  • Optimize for scalability, redundancy, and resilience-minimizing customer impact from incidents.

Infrastructure Patching

  • Oversee patching and maintenance for cloud and on-prem environments (AWS, OpenShift, Red Hat VMs, some WebSphere).
  • Ensure zero-downtime patching strategies and automation to mitigate operational risk and security vulnerabilities.
  • Partner with security teams to enforce compliance, harden platforms, and remediate vulnerabilities.

Reporting & Analytics

  • Provide strategic direction and oversight for reporting frameworks and analytics capabilities, ensuring actionable insights into platform reliability and operational performance.
  • Collaborate with teams to refine dashboards, metrics, and reporting tools that provide clear visibility for stakeholders and leadership.
  • Drive initiatives to improve data accuracy and alignment with organizational goals, ensuring reporting supports decision-making and strategic priorities.

Team Leadership & Process Improvement

  • Lead, mentor, and grow a high-performing team of 8-10 SREs.
  • Drive a culture of ownership, operational excellence, and continuous learning.
  • Establish and enforce best practices for incident management, operational documentation, and process automation.
  • Collaborate with development, infrastructure, and product teams to enhance observability, deployment, and proactive issue detection.

Required Skills

  • Hands-on troubleshooting skills in complex, distributed, or high-availability technical environments.
  • Experience in observability, monitoring, and incident management for critical platforms.
  • Demonstrated leadership in technical settings-may include leading projects, initiatives, or mentoring teams, even if not previously a formal people manager.
  • Strong ability to provide oversight and strategic direction for reporting and analytics frameworks, ensuring alignment with organizational goals.
  • Excellent communicator, able to translate technical detail for both engineers and executives.

Why Join?

  • Direct impact: Drive the reliability and performance of a critical digital banking platform.
  • Technical leadership: Autonomy to shape best practices and modernize operations.
  • Challenging & rewarding: Work on complex, large-scale systems supporting millions.

Designs how code is deployed, configured, and monitored, as well as the availability, latency, change management, emergency response, and management capacity of services in production. Helps teams to determine what new features can be incorporated and when by using service-level agreements (SLAs) to define the required reliability of the system through service-level indicators (SLI) and service-level objectives (SLO). Applies software engineering to automate IT operations tasks - e.g. production system management, change management, incident response, and emergency response. Acts as a link between the development and operations teams. Applies expertise to conduct chaos tests and performance test for critical business requirements.

  • Deploys, configures, and monitors code as well as the availability, latency, change management, emergency response, and management capacity of services in production.
  • Helps the development and operations teams establish Service level indicators (SLIs), Service level objectives (SLOs) and Error budgets.
  • Performs automation to increase efficiency and decrease risk like log analysis, performance tuning, patch application, testing of production settings, incident response, and post-mortem analysis.
  • Supports in system design consulting, platform management, and capacity planning.
  • Debugs production issues across services and levels of the technology stack.
  • Improves service health visibility by recording metrics, logs, and traces across all services in order to pinpoint the reasons of an incident.
  • Computes the cost of SLA breaches and assists management in calculating the impact of system reliability. Helps development and operations teams understand the cost of downtime.
  • Fosters a culture aligned to BMO purpose, values and strategy and role models BMO values and behaviours in all that they do.
  • Ensures alignment between values and behaviour that fosters diversity and inclusion.
  • Regularly connects work to BMO’s purpose, sets inspirational goals, defines clear expected outcomes, and ensures clear accountability for follow through.
  • Builds interdependent teams that collaborate across functional and operating groups to create the highest value for all stakeholders.
  • Attracts, retains, and enables the career development of top talent.
  • Improves team performance, recognizes and rewards performance, coaches employees, supports their development, and manages poor performance.
  • Operates at a group/enterprise-wide level and serves as a specialist resource to senior leaders and stakeholders.
  • Applies expertise and thinks creatively to address unique or ambiguous situations and to find solutions to problems that can be complex and non-routine.
  • Implements changes in response to shifting trends.
  • Broader work or accountabilities may be assigned as needed.

Qualifications:

Intermediate level of proficiency:

  • DevOps.
  • Cybersecurity and privacy concepts, principles and solutions.
  • Emotional agility.

Advanced level of proficiency:

  • IT infrastructure library.
  • Robot Process Automation.
  • Cloud Computing.
  • Configuration Management.
  • Container Orchestration.
  • System Design and Implementation.
  • Incident management.
  • Learning Agility.
  • Building and managing relationships.
  • API Management.
  • Automation and Automation Pipelines.
  • Automated Testing.
  • Quality Assurance and Control.
  • Verbal & written communication skills.
  • Analytical and problem solving skills.
  • Collaboration & team skills; with a focus on cross-group collaboration.
  • Able to manage ambiguity.
  • Data driven decision making.
  • Typically 7+ years of relevant experience and post-secondary degree in related field of study or an equivalent combination of education and experience.
  • Seasoned professional with a combination of education, experience and industry knowledge.

Salaire :

$92,400.00 - $171,600.00

Type de rémunération :

Salaire

Ce qui précède représente la fourchette et le type de rémunération de BMO Groupe financier.

Les salaires varieront en fonction de facteurs comme l’emplacement, les compétences, l’expérience, les études et les qualifications pour le poste et pourront inclure une structure de commissions. Les salaires pour les postes à temps partiel seront calculés au prorata du nombre d’heures travaillées régulièrement. Pour les rôles à commission, le salaire susmentionné représente la cible de BMO Groupe financier pour la première année au poste.

La rémunération totale offerte par BMO variera selon le type de rémunération associé au poste et peut comprendre des primes de rendement, des primes discrétionnaires ainsi que d’autres avantages et récompenses. BMO offre également une assurance santé, le remboursement des frais de scolarité, une assurance accident et une assurance vie, ainsi que des régimes d’épargne-retraite. Pour en savoir plus sur nos avantages sociaux, consultez le site : https://jobs.bmo.com/ca/fr/R%C3%A9mun%C3%A9ration-globale

À propos de nous

À BMO, nous sommes animés par une raison d’être commune : Avoir le cran de faire une différence dans la vie, comme en affaires. Cette raison d’être nous invite à entraîner des changements positifs et durables pour nos clients, nos collectivités et nos gens. En travaillant ensemble, en innovant et en repoussant les limites, nous transformons des vies et des entreprises et favorisons la croissance économique partout dans le monde.

En tant que membre de l'équipe de BMO, vous êtes valorisé, respecté et entendu, et vous avez plus de moyens pour progresser et obtenir des résultats. Nous nous efforçons de vous aider à obtenir des résultats dès le premier jour, pour vous-même et nos clients. Nous vous offrirons les outils et les ressources dont vous avez besoin pour franchir de nouvelles étapes, car vous aidez nos clients à franchir les leurs. Au moyen de formation et de coaching approfondis ainsi que de soutien de la direction et d'occasions de réseautage, nous vous aiderons à acquérir une expérience enrichissante et à élargir votre groupe de compétences.

Pour en savoir plus, visitez-nous à l'adresse https://jobs.bmo.com/ca/fr.

BMO s'engage à offrir un milieu de travail inclusif, équitable et accessible. Nous apprenons de nos différences et tirons notre force des gens et de leurs différents points de vue. Des mesures d’adaptation sont disponibles sur demande pour les candidats qui participent à tous les aspects du processus de sélection. Pour demander des mesures d’adaptation, veuillez communiquer avec votre recruteur.

Remarque aux recruteurs : BMO n’accepte pas les curriculum vitæ non sollicités provenant de toute source autre que le candidat directement. Tout curriculum vitæ non sollicité envoyé à BMO, directement ou indirectement, sera considéré comme la propriété de BMO. BMO ne paiera aucuns frais pour les placements découlant de la réception d’un curriculum vitæ non sollicité. Une agence de recrutement doit d’abord détenir une entente de service écrite valide et dûment signée avant d’envoyer des curriculum vitæ.


Requirements

Level of education

undetermined

Work experience (years)

undetermined

Written languages

undetermined

Spoken languages

undetermined