Skip to content
mimi

Principal Site Reliability Engineer for AWS EKS Infrastructure

Parallel Domain

Vancouver · On-site Full-time Lead 3d ago

About the role

About

Join as a Principal Site Reliability Engineer, focused on the reliability and scalability of AWS Cloud infrastructure. Contribute highly to managing simulation workloads for autonomous vehicle development with expertise.

This hands-on role invites you to be the primary architect of a multi-region AWS/EKS platform. You will steer ongoing improvements and security measures while working collaboratively across various engineering teams, ensuring top-notch service availability and customer satisfaction.

Key Responsibilities:

  • Oversee AWS infrastructure and optimize performance
  • Manage EKS operations including autoscaling and health checks
  • Drive Git Ops practices for effective application deployment
  • Handle complex networking challenges including VPC design
  • Implement proactive SLO measurement and incident management

Requirements:

  • 5+ years in SRE, Dev Ops or similar technical roles
  • Experience with infrastructure-as-code tools; proficient in Terraform
  • Extensive AWS expertise including EKS and network services
  • In-depth Kubernetes management capabilities required
  • Knowledge of monitoring tools; experience with Grafana desirable

Elevate cloud reliability and security as you drive infrastructure projects, ensuring seamless service delivery for high-performance autonomous vehicle solutions.

Skills

AWSEKSGrafanaGitOpsKubernetesTerraform

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free