Ce recruteur est en ligne!

Voilà ta chance d'être vu en premier!

Postuler maintenant

Senior SRE Engineer with Dynatrace Experience to Accelerate Observability Implementation for Major Banking Client

Toronto, ON
  • Nombre de poste(s) à combler : 1

  • À discuter
  • Emploi Contrat

  • Date d'entrée en fonction : 1 poste à combler dès que possible

Senior Site Reliability Engineer (SRE) - Dynatrace SME

Location: Toronto (Hybrid)

Duration: 6 months, with strong possibility of extension

Start Date: ASAP


Overview

Our Major Banking Client is seeking two Senior Site Reliability Engineers (SREs) with deep, hands-on Dynatrace expertise to accelerate their enterprise observability transformation. These are SME-level roles responsible for designing, orchestrating, and optimizing the full Dynatrace platform to enhance visibility, resilience, and reliability across complex banking systems.


Key Responsibilities
  • Lead the end-to-end orchestration of the Dynatrace platform, including Infrastructure, Synthetic Monitoring, Operating Systems, Databases, and Incident Management.
  • Integrate Dynatrace with ServiceNow, ensuring seamless ticketing, event correlation, and automated incident workflows.
  • Leverage Dynatrace Davis AI to enable predictive insights, automated root cause analysis, and self-healing capabilities.
  • Define and implement Dynatrace Site Reliability Guardian (SRG) including objectives, workflows, and integrations with JIRA and ServiceNow.
  • Embed SRG checkpoints within CI/CD pipelines to support go/no-go deployment decisions.
  • Utilize the Dynatrace WCCS (Well Calibrated Customer Success) framework to achieve true end-to-end observability from a business-centric lens.
  • Apply SRE best practices and golden signals across application, web, and database tiers, configuring thresholds and alerts aligned with SLOs, SLIs, and SLAs.
  • Develop and optimize Dynatrace Query Language (DQL) scripts for advanced monitoring, dashboards, and analytics.
  • Collaborate with application, infrastructure, and DevOps teams to mature the bank’s observability strategy and governance model.
  • Create technical documentation, runbooks, and operational standards to support ongoing reliability initiatives.


Must Have Skills & Experience
  • 7+ years of progressive experience in Site Reliability Engineering within large enterprise or financial environments.
  • 3+ years of deep, hands-on experience with Dynatrace, including deployment, configuration, AI enablement, SRG, and WCCS frameworks.
  • Proven experience integrating Dynatrace with ServiceNow, JIRA, and CI/CD pipelines.
  • Expertise with Dynatrace DQL, alerting policies, and advanced dashboard configuration.
  • Solid understanding of SRE principles, including golden signals, error budgets, and post-incident reviews.
  • Experience working in hybrid or multi-cloud environments (AWS, Azure, GCP).
  • Proficiency in Infrastructure as Code (Terraform, Ansible) and scripting languages (Python, Bash, PowerShell).
  • Strong communication and documentation skills, capable of mentoring peers and influencing cross-functional teams.


Nice to Have
  • Dynatrace Professional or Master Certification
  • Hands-on experience with AIOps, Kubernetes, or microservices observability
  • Exposure to GitOps, Splunk, AppDynamics, or DataDog
  • Previous participation in large-scale observability transformation programs

Apply

Exigences

Niveau d'études

non déterminé

Années d'expérience

non déterminé

Langues écrites

non déterminé

Langues parlées

non déterminé