Skip to content
mimi

Senior Site Reliability Engineer

Manulife

Waterloo · Hybrid Full-time Senior CA$113k – CA$163k/yr Today

About the role

About

As a Senior Site Reliability Engineer, you will play a pivotal role in designing, deploying, and operating scalable, secure, and highly reliable cloud‑based platforms. You will leverage your deep expertise in Infrastructure‑as‑Code, Azure cloud services, and DevOps practices to support data, analytics, and AI‑enabled workloads. This role contributes directly to advancing platform reliability, automation‑first delivery, and operational excellence. You will partner closely with engineering, security, risk, and architecture teams to ensure platforms meet enterprise standards and business needs.

Responsibilities

  • Design, build, and manage cloud infrastructure using Terraform, cloud‑native technologies, and automation‑first principles to deliver scalable and resilient platforms.
  • Develop, maintain, and optimize CI/CD pipelines and developer toolchains using Jenkins, GitOps, and related technologies.
  • Operate and support Azure‑based platforms, including Databricks and APIM/APIOps, ensuring performance, security, and regulatory compliance.
  • Monitor, troubleshoot, and resolve production issues and persistent platform problems to minimize downtime and operational impact.
  • Collaborate with cross‑functional stakeholders (engineering, security, risk, architecture) to deliver reusable platform patterns, reference architectures, and continuous improvements.

Required Qualifications

  • 7+ years of experience with Infrastructure‑as‑Code using Terraform.
  • 7+ years of experience deploying and managing Azure cloud infrastructure.
  • 5+ years of experience with CI/CD pipelines, Jenkins, GitOps, and Groovy scripting.
  • 3+ years of experience supporting Databricks or similar data/analytics platforms.
  • 5+ years of experience with API Management and APIOps solutions (GWAM).

Preferred Qualifications

  • Experience supporting AI/GenAI platforms and AI operations workloads.
  • Experience working on data and analytics platforms at scale.
  • Platform automation experience using Python or other scripting languages.
  • Strong communication and technical documentation skills.
  • Experience optimizing performance across CPU, GPU, or accelerator‑based workloads.

Benefits

  • Empowerment to learn and grow the career you want.
  • Flexible environment where well‑being and inclusion are prioritized.
  • Support as part of a global team to shape the future you want to see.
  • Eligible employees receive customizable benefits including health, dental, mental health, vision, short‑ and long‑term disability, life and AD&D insurance, adoption/surrogacy and wellness benefits, and employee/family assistance plans.
  • Retirement savings plans (including pension and a global share ownership plan with employer matching contributions) and financial education and counseling resources.
  • Generous paid time off program in Canada including holidays, vacation, personal, and sick days, plus statutory leaves of absence.

Requirements

  • 7+ years of experience with Infrastructure-as-Code using Terraform.
  • 7+ years of experience deploying and managing Azure cloud infrastructure.
  • 5+ years of experience with CI/CD pipelines, Jenkins, GitOps, and Groovy scripting.
  • 3+ years of experience supporting Databricks or similar data/analytics platforms.
  • 5+ years of experience with API Management and APIOps solutions (GWAM).

Responsibilities

  • Design, build, and manage cloud infrastructure using Terraform, cloud-native technologies, and automation-first principles to deliver scalable and resilient platforms.
  • Develop, maintain, and optimize CI/CD pipelines and developer toolchains using Jenkins, GitOps, and related technologies.
  • Operate and support Azure-based platforms, including Databricks and APIM/APIOps, ensuring performance, security, and regulatory compliance.
  • Monitor, troubleshoot, and resolve production issues and persistent platform problems to minimize downtime and operational impact.
  • Collaborate with cross-functional stakeholders (engineering, security, risk, architecture) to deliver reusable platform patterns, reference architectures, and continuous improvements.

Benefits

health insurancedental insurancemental health insurancevision insuranceshort-term disability insurancelong-term disability insurancelife insuranceAD&D insurance coverageadoption/surrogacy benefitswellness benefitsemployee/family assistance plansretirement savings planspension planglobal share ownership planpaid time offholidaysvacation dayspersonal dayssick daysleaves of absence

Skills

API ManagementAPIOpsAzureCI/CDDatabricksDevOpsGitOpsGroovyInfrastructure-as-CodeJenkinsTerraform

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free