Senior Cloud Operations Engineer to manage reliability of critical infrastructure platforms (GCP or Azure) with one of our major banking clients- 38265/382
S.i. Systems
Toronto, ON-
Number of positions available : 1
- Salary To be discussed
-
Contract job
- Published on September 15th, 2025
-
Starting date : 1 position to fill as soon as possible
Description
Senior Cloud Operations Engineer to manage reliability of critical infrastructure platforms (GCP or Azure) with one of our major banking clients- 38265/38274
Location Address: Toronto - hybrid - onsite 2x/week min. - Subject to change: *3-4 days onsite may be required based on business needs*
Contract Duration: ASAP - 01/30/2026 (Possibility of extension & conversion to FTE)
Schedule Hours: 9am-5pm Monday-Friday; standard 37.5 hrs/week (Possible OT)
Story Behind the Need
- Business group: Ops - Public Cloud TechOps DevOps SRE
- Project: Agile Cloud Enablement
Combining the competencies of DevOps, Systems Administration, and Cloud Engineering, the role of Cloud Operations Engineer provides the opportunity to combine your technical ability, strategic thinking and detail-oriented execution in a fast-paced, dynamic environment. You will join a team with the purpose of constantly improving the reliability of our systems through continuous improvements to running infrastructure. You will work with application teams to deliver continuous improvements to applications and support the transformation of our approach to both operations and development. You will work with the Cloud Engineering team to design and implement tools and processes that monitor and respond to the state of our systems.
Typical Day in Role:
• Manage reliability of critical infrastructure platforms on our Public Cloud Platforms (Google and Azure)
• Improve and maintain site availability, scalability, service and system performance and reliability
• Investigate system errors and problems, bottleneck analysis of the system at scale, etc.
• Provide solutions for performance management, disaster recovery, monitoring and access management
• Participate in solution design sessions
• Participate in planning and retrospective sessions, attending stand-ups, etc.
• Build and operate highly available and scalable software and infrastructure.
• Supporting application teams on the use of the platform including providing guidance on design patterns, best practices, and security considerations.
• Our teams are flexible and fast - you will be asked to provide peer review and quality control on a daily basis.
• Be part of on call rotation
Candidate Requirements/Must Have Skills:
1. 10+ years of experience in an Operations role
2. 3+ years of experience supporting Kubernetes (GKE & AKS) clusters in GCP and/or Azure
3. 3+ years of experience working with pipelines, and security tools such as Aquasec
4. 3+ Years of demonstrated experience with Terraform and Github
5. 2+ years of experience developing in any of the following languages (Java, Javascript, Python, Ruby, Go, C#)
Nice-To-Have Skills:
• Terraform Cloud Experience
• Strong knowledge of Agile & Lean methodologies for requirements / design methodology
• Experience supporting containers, container orchestration platforms.
• Knowledge of software design patterns, infrastructure architecture, DevOps, or security considerations.
• Experience designing and implementing tasks in Continuous Integration systems (Jenkins, Travis, CircleCI, etc.).
• Understanding of software release process (environments, binary repositories, CI/CD).
• Experience with Tanzu (PCF), Pipelines, and other cloud development platforms
• Knowledge of network engineering - DNS, TCP/IP, Load balancing, DMZ, routing protocols, etc.
• Knowledge of Cloud security - Cryptographic key management, certificate infrastructures/PKI, secure coding practices, etc.
• System Administration experience and/or Enterprise Operations skills
• Fluency in Spanish
Education:
• Post-secondary degree in a technical field such as computer science, computer engineering or related IT field is an asset.
Best VS. Average Candidate:
• A self-starter with a strong sense of personal accountability and team responsibility.
• Detail oriented, analytical and capable of investigating complex / technical issues and provide alternative solutions, project & production methodologies.
• Strong Experience in supporting containers, container orchestration platforms.
•Experience designing and implementing tasks in Continuous Integration systems (Jenkins, Travis, CircleCI, etc.).
Candidate Review & Selection
• 1st round MS Teams video interview - Hiring Manager (30 minutes)
• 2nd round MS Teams video/in-person interview - Panel with Cloud Engineers (1 hour)
• The interviews with assess both soft skills (situational questions) and technical skills (discuss how the candidate has utilized specific technologies)
Hiring Manager’s availability to interview: ASAP
Requirements
undetermined
undetermined
undetermined
undetermined
Other S.i. Systems's offers that may interest you