Skip to content
mimi

Kubernetes SRE Observability Expert

Astra North Infoteck Inc.

Winnipeg · On-site Full-time 3d ago

About the role

Join as a Kubernetes SRE Observability Expert responsible for enhancing reliability and observability using Dynatrace. Drive initiatives that improve monitoring and incident management across platforms.

This role focuses on developing the enterprise's reliability capabilities while leading the implementation of observability standards. You will work cross-functionally to ensure comprehensive monitoring and utilize SRE principles to optimize system performance. Your hands-on experience with Dynatrace will play a crucial role in this endeavor.

Key Responsibilities: • Own enterprise-wide observability with Dynatrace integration • Collaborate across teams for effective monitoring solutions • Design alert systems and dashboards that reflect impact • Implement SRE principles to reduce failures and enhance resilience • Lead post-incident analysis focusing on root cause issues

Requirements: • Minimum of 5 years in operations or reliability engineering • Expertise in implementing Dynatrace within environments • Strong knowledge of cloud and hybrid systems • Practical application of SRE concepts • Comfortable managing high-stakes incidents in production

Utilize your expertise in Kubernetes and Dynatrace to elevate system reliability and performance. #J-18808-Ljbffr

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free