Principal Site Reliability Engineer for AWS EKS Infrastructure
Parallel Domain
About the role
About
Join as a Principal Site Reliability Engineer, focused on the reliability and scalability of AWS Cloud infrastructure. Contribute highly to managing simulation workloads for autonomous vehicle development with expertise.
This hands-on role invites you to be the primary architect of a multi-region AWS/EKS platform. You will steer ongoing improvements and security measures while working collaboratively across various engineering teams, ensuring top-notch service availability and customer satisfaction.
Key Responsibilities:
- Oversee AWS infrastructure and optimize performance
- Manage EKS operations including autoscaling and health checks
- Drive Git Ops practices for effective application deployment
- Handle complex networking challenges including VPC design
- Implement proactive SLO measurement and incident management
Requirements:
- 5+ years in SRE, Dev Ops or similar technical roles
- Experience with infrastructure-as-code tools; proficient in Terraform
- Extensive AWS expertise including EKS and network services
- In-depth Kubernetes management capabilities required
- Knowledge of monitoring tools; experience with Grafana desirable
Elevate cloud reliability and security as you drive infrastructure projects, ensuring seamless service delivery for high-performance autonomous vehicle solutions.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free