Skip to content
mimi

Site Reliability Engineer

Yelp

Remote · Canada Full-time 1mo ago

About the role

About

Join a fully remote team as a Site Reliability Engineer. Leverage your skills in scalable systems, automation, and problem-solving while supporting a platform that serves over 100 million users monthly.

As a core member of the engineering team, you’ll tackle the challenges of building and maintaining self-healing infrastructures. You'll integrate monitoring tools, scale Kubernetes clusters, and ensure the reliability of key datastores. Collaborating across teams will empower you to deliver effective solutions that improve user experiences at an immense scale.

Key Responsibilities:

  • Support new features and services across teams
  • Monitor platform stability and performance
  • Scale AWS-based infrastructure and Kubernetes clusters
  • Troubleshoot issues with industry-leading tools
  • Automate processes using various technologies

Requirements:

  • Mastery of Linux and understanding OS behaviors
  • Proficient in modern programming languages like Python and Go
  • Experience with cloud platforms like AWS
  • Familiarity with container orchestration tools
  • Self-motivated with a drive for improvement

Embrace a unique opportunity to enhance reliability while embracing creative solutions and collaboration.

Skills

AWSGoKubernetesLinuxPython

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free