Lead Platform Reliability Engineer
Manulife Financial
About the role
About
Enhance the reliability of cutting-edge AI solutions as a Lead Platform Reliability Engineer. Engage in performance tracking and incident management while ensuring developer-friendly systems in a hybrid role.
Your commitment to operational excellence will shine through defining SLOs and reducing MTTR. With a background in DevOps and 5-8 years of relevant experience, you will drive automation, optimize cloud resources, and ensure compliance with security protocols. Collaborate with engineering teams to deliver outstanding platform reliability.
Responsibilities
- Track performance metrics and operational budgets
- Maintain observability through metrics and logging
- Lead incident resolution and postmortem activities
- Develop automated provisioning and deployment tools
- Implement policies for infrastructure as code
Requirements
- 5-8 years of DevOps or Platform Engineering experience
- Familiarity with cloud-native capabilities like Azure
- Expertise in large-scale systems management
- Knowledge of container deployment methodologies
- Understanding of data governance in AI contexts
Shape the future of AI engineering through robust platform reliability and innovative solutions in a collaborative atmosphere.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free