Senior Site Reliability Engineer
Andiamo
About the role
About
Elevate system reliability with our Platform team as a Senior Site Reliability Engineer. Contribute your expertise in AWS, Terraform, and modern Dev Ops practices to ensure our systems' stability and scalability.
This senior role focuses on system optimization and continuous improvement at our company. With over 10 years of Site Reliability Engineering experience, you will manage AWS infrastructure using Infrastructure as Code tools like Terraform. Your proficiency with Docker, Kubernetes, and CI systems will help maintain high availability in our services while supporting developers through efficient issue resolution.
Responsibilities
- Manage AWS infrastructure using Terraform and IaC practices
- Enhance the reliability of Continuous Integration systems
- Troubleshoot issues across local, staging, and production environments
- Establish best practices for monitoring and alerting systems
- Define actionable alerts for proactive incident management
Requirements
- 10+ years as a Site Reliability Engineer or equivalent
- Expertise in Terraform, Docker, and Kubernetes
- Experience with CI tools like CircleCI or Jenkins
- Strong troubleshooting and debugging skills
- Background utilizing AI in recent projects
Support and optimize our systems to ensure robust performance in the role of Senior Site Reliability Engineer.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free