RT
Site Reliability Engineer (AWS DevOps)
RARR Technologies
India · On-site Full-time 1w ago
About the role
Key Responsibilities
- Strong proficiency in Python programming, with experience in automation and scripting.
- Expertise in DevOps and Site Reliability Engineering (SRE) practices.
- Hands‑on experience with AWS services (EC2, S3, Lambda, RDS, etc.).
- Proficient in managing infrastructure as code (IaC) using tools like Terraform or CloudFormation.
- Experience with containerization technologies like Docker and orchestration tools like Kubernetes.
- Strong knowledge of CI/CD pipelines and automation tools (Jenkins, GitLab CI, etc.).
- Implement and manage monitoring, alerting, and logging solutions in AWS using CloudWatch, Prometheus, or similar.
- Ability to optimize and scale cloud infrastructure to meet performance and cost requirements.
- Troubleshoot and resolve issues in production environments to ensure high availability and reliability.
- Collaborate with development teams to enhance the software delivery lifecycle and improve system reliability.
Please note that the above responsibilities are essential for this role.
- Strong proficiency in Python programming, with experience in automation and scripting.
- Expertise in DevOps and Site Reliability Engineering (SRE) practices.
- Hands‑on experience with AWS services (EC2, S3, Lambda, RDS, etc.).
- Proficient in managing infrastructure as code (IaC) using tools like Terraform or CloudFormation.
- Experience with containerization technologies like Docker and orchestration tools like Kubernetes.
- Strong knowledge of CI/CD pipelines and automation tools (Jenkins, GitLab CI, etc.).
- Implement and manage monitoring, alerting, and logging solutions in AWS using CloudWatch, Prometheus, or similar.
- Ability to optimize and scale cloud infrastructure to meet performance and cost requirements.
- Troubleshoot and resolve issues in production environments to ensure high availability and reliability.
- Collaborate with development teams to enhance the software delivery lifecycle and improve system reliability.
Please note that the above responsibilities are essential for this role.
Requirements
- Strong proficiency in Python programming, with experience in automation and scripting.
- Expertise in DevOps and Site Reliability Engineering (SRE) practices.
- Hands-on experience with AWS services (EC2, S3, Lambda, RDS, etc.).
- Proficient in managing infrastructure as code (IaC) using tools like Terraform or CloudFormation.
- Experience with containerization technologies like Docker and orchestration tools like Kubernetes.
- Strong knowledge of CI/CD pipelines and automation tools (Jenkins, GitLab CI, etc.).
Responsibilities
- Implement and manage monitoring, alerting, and logging solutions in AWS using CloudWatch, Prometheus, or similar.
- Ability to optimize and scale cloud infrastructure to meet performance and cost requirements.
- Troubleshoot and resolve issues in production environments to ensure high availability and reliability.
- Collaborate with development teams to enhance the software delivery lifecycle and improve system reliability.
Skills
AWSCI/CDCloudFormationCloudWatchDockerEC2GitLab CIJenkinsKubernetesLambdaPrometheusPythonRDSS3Terraform
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free