This recruiter is online.

This is your chance to shine!

Apply Now

Senior SRE Engineer with Dynatrace Experience to Accelerate Observability Implementation for Major Banking Client

Toronto, ON
  • Number of positions available : 1

  • To be discussed
  • Contract job

  • Starting date : 1 position to fill as soon as possible

Senior Site Reliability Engineer (SRE) - Dynatrace SME

Location: Toronto (Hybrid)

Duration: 6 months, with strong possibility of extension

Start Date: ASAP


Overview

Our Major Banking Client is seeking two Senior Site Reliability Engineers (SREs) with deep, hands-on Dynatrace expertise to accelerate their enterprise observability transformation. These are SME-level roles responsible for designing, orchestrating, and optimizing the full Dynatrace platform to enhance visibility, resilience, and reliability across complex banking systems.


Key Responsibilities
  • Lead the end-to-end orchestration of the Dynatrace platform, including Infrastructure, Synthetic Monitoring, Operating Systems, Databases, and Incident Management.
  • Integrate Dynatrace with ServiceNow, ensuring seamless ticketing, event correlation, and automated incident workflows.
  • Leverage Dynatrace Davis AI to enable predictive insights, automated root cause analysis, and self-healing capabilities.
  • Define and implement Dynatrace Site Reliability Guardian (SRG) including objectives, workflows, and integrations with JIRA and ServiceNow.
  • Embed SRG checkpoints within CI/CD pipelines to support go/no-go deployment decisions.
  • Utilize the Dynatrace WCCS (Well Calibrated Customer Success) framework to achieve true end-to-end observability from a business-centric lens.
  • Apply SRE best practices and golden signals across application, web, and database tiers, configuring thresholds and alerts aligned with SLOs, SLIs, and SLAs.
  • Develop and optimize Dynatrace Query Language (DQL) scripts for advanced monitoring, dashboards, and analytics.
  • Collaborate with application, infrastructure, and DevOps teams to mature the bank’s observability strategy and governance model.
  • Create technical documentation, runbooks, and operational standards to support ongoing reliability initiatives.


Must Have Skills & Experience
  • 7+ years of progressive experience in Site Reliability Engineering within large enterprise or financial environments.
  • 3+ years of deep, hands-on experience with Dynatrace, including deployment, configuration, AI enablement, SRG, and WCCS frameworks.
  • Proven experience integrating Dynatrace with ServiceNow, JIRA, and CI/CD pipelines.
  • Expertise with Dynatrace DQL, alerting policies, and advanced dashboard configuration.
  • Solid understanding of SRE principles, including golden signals, error budgets, and post-incident reviews.
  • Experience working in hybrid or multi-cloud environments (AWS, Azure, GCP).
  • Proficiency in Infrastructure as Code (Terraform, Ansible) and scripting languages (Python, Bash, PowerShell).
  • Strong communication and documentation skills, capable of mentoring peers and influencing cross-functional teams.


Nice to Have
  • Dynatrace Professional or Master Certification
  • Hands-on experience with AIOps, Kubernetes, or microservices observability
  • Exposure to GitOps, Splunk, AppDynamics, or DataDog
  • Previous participation in large-scale observability transformation programs

Apply

Requirements

Level of education

undetermined

Work experience (years)

undetermined

Written languages

undetermined

Spoken languages

undetermined