Skip to content
mimi

Site Reliability Engineer (AWS DevOps)

RARR Technologies

India · On-site Full-time 1w ago

About the role

Key Responsibilities

  • Strong proficiency in Python programming, with experience in automation and scripting.
  • Expertise in DevOps and Site Reliability Engineering (SRE) practices.
  • Hands‑on experience with AWS services (EC2, S3, Lambda, RDS, etc.).
  • Proficient in managing infrastructure as code (IaC) using tools like Terraform or CloudFormation.
  • Experience with containerization technologies like Docker and orchestration tools like Kubernetes.
  • Strong knowledge of CI/CD pipelines and automation tools (Jenkins, GitLab CI, etc.).
  • Implement and manage monitoring, alerting, and logging solutions in AWS using CloudWatch, Prometheus, or similar.
  • Ability to optimize and scale cloud infrastructure to meet performance and cost requirements.
  • Troubleshoot and resolve issues in production environments to ensure high availability and reliability.
  • Collaborate with development teams to enhance the software delivery lifecycle and improve system reliability.

Please note that the above responsibilities are essential for this role.

  • Strong proficiency in Python programming, with experience in automation and scripting.
  • Expertise in DevOps and Site Reliability Engineering (SRE) practices.
  • Hands‑on experience with AWS services (EC2, S3, Lambda, RDS, etc.).
  • Proficient in managing infrastructure as code (IaC) using tools like Terraform or CloudFormation.
  • Experience with containerization technologies like Docker and orchestration tools like Kubernetes.
  • Strong knowledge of CI/CD pipelines and automation tools (Jenkins, GitLab CI, etc.).
  • Implement and manage monitoring, alerting, and logging solutions in AWS using CloudWatch, Prometheus, or similar.
  • Ability to optimize and scale cloud infrastructure to meet performance and cost requirements.
  • Troubleshoot and resolve issues in production environments to ensure high availability and reliability.
  • Collaborate with development teams to enhance the software delivery lifecycle and improve system reliability.

Please note that the above responsibilities are essential for this role.

Requirements

  • Strong proficiency in Python programming, with experience in automation and scripting.
  • Expertise in DevOps and Site Reliability Engineering (SRE) practices.
  • Hands-on experience with AWS services (EC2, S3, Lambda, RDS, etc.).
  • Proficient in managing infrastructure as code (IaC) using tools like Terraform or CloudFormation.
  • Experience with containerization technologies like Docker and orchestration tools like Kubernetes.
  • Strong knowledge of CI/CD pipelines and automation tools (Jenkins, GitLab CI, etc.).

Responsibilities

  • Implement and manage monitoring, alerting, and logging solutions in AWS using CloudWatch, Prometheus, or similar.
  • Ability to optimize and scale cloud infrastructure to meet performance and cost requirements.
  • Troubleshoot and resolve issues in production environments to ensure high availability and reliability.
  • Collaborate with development teams to enhance the software delivery lifecycle and improve system reliability.

Skills

AWSCI/CDCloudFormationCloudWatchDockerEC2GitLab CIJenkinsKubernetesLambdaPrometheusPythonRDSS3Terraform

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free