Lead Site Reliability Engineer in Markham
Artech LLC
About the role
Join our Markham team as a Site Reliability Engineering Lead, focusing on enterprise observability using Dynatrace. Enhance incident reduction and system resilience as a key player in our IT operations. In this vital role, you will leverage your expertise in infrastructure and reliability engineering, ideally with five years of direct experience. Your hands-on experience with Dynatrace will drive the development of effective monitoring systems across various platforms, ensuring performance and reliability. This position is perfect for professionals who thrive in high-impact environments and can directly influence our operational success. Key Responsibilities: • Own enterprise observability across cloud and on-prem environments • Design dashboards, alerts, and service topologies • Apply SRE principles to enhance system resilience • Lead post-incident reviews to identify and fix systemic issues • Define observability standards for efficient team usage Requirements: • 5 years in infrastructure, platform, or reliability engineering • Practical experience with SRE concepts and Dynatrace • Strong understanding of distributed systems • Experience in high-impact incident environments • Prior work in the client's industry Bring your expertise in observability and reliability to lead our team in Markham. #J-18808-Ljbffr
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free