SI
Senior Manager – Site Reliability Engineering
Siri InfoSolutions, Inc.
Toronto · Hybrid Contract Senior 2w ago
About the role
Job Description
- Define and own the SRE transformation roadmap aligned with business objectives and platform priorities.
- Demonstrate strong understanding of SRE principles including SLOs, SLIs, error budgets, toil management, and observability.
- Lead SRE maturity assessments across Observability, Incident Management, Problem Management, Shift Left, and Operational Readiness.
- Establish and govern SRE operating models, collaborating with Development, Operations, Security, and Architecture teams.
- Drive adoption of SLOs, SLIs, SLAs, and error budgets across critical services.
- Act as the primary interface between engineering teams, service management, leadership, and external partners.
- Manage multi-workstream SRE programs, ensuring delivery against scope, timelines, risks, and dependencies.
- Prepare executive-level status updates, dashboards, and steering committee communications.
- Oversee improvements in incident reduction, MTTR, availability, and resiliency.
- Ensure blameless postmortems, root cause analysis (RCA), and action tracking are consistently executed.
- Govern automation initiatives including runbooks, self-healing, alert tuning, and capacity management.
- Track and report reliability KPIs, toil reduction metrics, and automation ROI.
- Partner with platform teams on observability and SRE tooling strategy including monitoring, logging, tracing, and APM.
- Ensure effective integration of ServiceNow, alerting platforms, and SRE tools.
- Support training, enablement, and onboarding of teams into SRE practices and mindset.
Qualifications
- 10+ years of experience in IT delivery, operations, reliability engineering, DevOps, or platform engineering roles.
- 5+ years of experience managing large-scale programs or transformations related to SRE, DevOps, ITIL, or Cloud Operations.
- Experience with Mainframe and other legacy technologies is highly preferred.
- Familiarity with distributed systems, cloud platforms, and enterprise application environments.
- Proven experience driving cross-functional transformation across engineering and operations teams.
Skills
APMCloud OperationsCloud platformsDevOpsITILMainframeSREServiceNow
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free