Skip to content
mimi

Principal Site Reliability Engineer

Parallel Domain

Vancouver · On-site Full-time Lead 1w ago

About the role

About the Role

Join as a Principal Site Reliability Engineer, focused on the reliability and scalability of AWS Cloud infrastructure. Contribute highly to managing simulation workloads for autonomous vehicle development with expertise. This hands-on role invites you to be the primary architect of a multi-region AWS/EKS platform. You will steer ongoing improvements and security measures while working collaboratively across various engineering teams, ensuring top-notch service availability and customer satisfaction.

Key Responsibilities

  • Oversee AWS infrastructure and optimize performance
  • Manage EKS operations including autoscaling and health checks
  • Drive GitOps practices for effective application deployment
  • Handle complex networking challenges including VPC design
  • Implement proactive SLO measurement and incident management

Requirements

  • 5+ years in SRE, DevOps or similar technical roles
  • Experience with infrastructure-as-code tools; proficient in Terraform
  • Extensive AWS expertise including EKS and network services
  • In-depth Kubernetes management capabilities required
  • Knowledge of monitoring tools; experience with Grafana desirable

Elevate cloud reliability and security as you drive infrastructure projects, ensuring seamless service delivery for high-performance autonomous vehicle solutions.

Skills

AWSCloudEKSGrafanaGitOpsKubernetesSRETerraformVPC

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free