Skip to content
mimi

Senior Site Reliability Engineer

Citi

Canada · On-site Full-time Senior 2d ago

About the role

About

Take charge of platform oversight and reliability as a Senior Site Reliability Engineer. Elevate service standards across AI applications and developer tools while managing a skilled team.

We are looking for a leader with 6+ years of experience in support and technical roles. In this role, you will oversee the incident resolution process and collaborate with development teams to drive improvements in application stability and supportability. Your insights will be key to enhancing operational workstreams and performance tuning.

Empower a strong platform by fostering team collaboration and innovation in application support to drive long-term stability and performance.

Responsibilities

  • Manage team operations to ensure platform stability
  • Enhance incident management and knowledge-sharing
  • Coordinate vendor and offshore service engagements
  • Support development of onboarding guidelines
  • Engage in capacity and latency management

Requirements

  • 6+ years in a leadership technical support role
  • Proven track record in process improvement
  • Strong ability to communicate technical concepts
  • Familiarity with CI/CD, Kubernetes, and observability tools
  • Preferred experience with Java, Python, or similar languages

Skills

CI/CDDockerJavaKubernetesObservability toolsPython

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free