Skip to content
mimi

Site Reliability Engineer III - DevOps Engineer

JPMC Candidate Experience page

Seattle · On-site Full-time Senior 2w ago

About the role

About

There’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.

As a Site Reliability Engineer III - DevOps Engineer at JPMorgan Chase within the Commercial and Investment Bank, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.

Responsibilities

  • Design, implement, and manage scalable, reliable, and secure cloud infrastructure on AWS or Cloud Foundry or Azure
  • Deploy, manage, and scale containerized applications using Kubernetes (EKS) and ECS.
  • Develop and maintain infrastructure as code using Terraform to automate provisioning and configuration management.
  • Supports the adoption of site reliability engineering best practices within your team.
  • Implement logging and tracing using ELK Stack, Splunk, Dynatrace, and AWS CloudWatch.
  • Ensure disaster recovery strategies are in place using multi-region deployments, backups, and failover mechanisms.
  • Integrate security tools such as SonarQube, Snyk, Trivy, Aqua Security into Jenkins or AWS CodePipeline.
  • Collaborate with development teams to ensure smooth deployment and operation of applications.
  • Implement and manage CI/CD pipelines to streamline the software development lifecycle.
  • Troubleshoot and resolve infrastructure-related issues in a timely manner.
  • Participate in on-call rotations to provide 24/7 support for critical systems.
  • Continuously evaluate and implement modern technologies and tools to improve operational efficiency.

Qualifications

  • Bachelor's degree in computer science, Information Technology, or a related field, or equivalent practical experience.
  • 3 years of experience in Site Reliability Engineering, DevOps, or a related field.
  • Strong expertise in AWS services, including EC2, S3, RDS, VPC, IAM, and networking.
  • Hands‑on experience with Kubernetes (EKS) and ECS for container orchestration.
  • Proficiency in using Terraform for infrastructure as code.
  • Solid understanding of CI/CD concepts and tools such as Jenkins, GitLab CI, CircleCI, AWS CodePipeline, Spinnaker.
  • Experience with observability, monitoring, and logging tools like Prometheus, Grafana, ELK Stack, or CloudWatch.
  • Experience in Java language, Springboot framework or scripting languages such as Bash, Node.JS, Shell, Python
  • Excellent problem‑solving skills and attention to detail.
  • Strong communication and collaboration skills.

Preferred Qualifications

  • AWS Certified Solutions Architect or DevOps Engineer or similar certifications.
  • Strong problem‑solving skills and ability to troubleshoot complex CI/CD issues.
  • Experience with microservices architecture and serverless computing.
  • Familiarity with security best practices in cloud environments.

Requirements

  • Bachelor's degree in computer science, Information Technology, or a related field, or equivalent practical experience.
  • 3 years of experience in Site Reliability Engineering, DevOps, or a related field.
  • Strong expertise in AWS services, including EC2, S3, RDS, VPC, IAM, and networking.
  • Hands-on experience with Kubernetes (EKS) and ECS for container orchestration.
  • Proficiency in using Terraform for infrastructure as code.
  • Solid understanding of CI/CD concepts and tools such as Jenkins, GitLab CI, CircleCI, AWS CodePipeline, Spinnaker.
  • Experience with observability, monitoring, and logging tools like Prometheus, Grafana, ELK Stack, or CloudWatch.
  • Experience in Java language, Springboot framework or scripting languages such as Bash, Node.JS, Shell, Python
  • Excellent problem-solving skills and attention to detail.
  • Strong communication and collaboration skills.

Responsibilities

  • Design, implement, and manage scalable, reliable, and secure cloud infrastructure on AWS or Cloud Foundry or Azure
  • Deploy, manage, and scale containerized applications using Kubernetes (EKS) and ECS.
  • Develop and maintain infrastructure as code using Terraform to automate provisioning and configuration management.
  • Supports the adoption of site reliability engineering best practices within your team.
  • Implement logging and tracing using ELK Stack, Splunk, Dynatrace, and AWS CloudWatch.
  • Ensure disaster recovery strategies are in place using multi-region deployments, backups, and failover mechanisms.
  • Integrate security tools such as SonarQube, Snyk, Trivy, Aqua Security into Jenkins or AWS CodePipeline.
  • Collaborate with development teams to ensure smooth deployment and operation of applications.
  • Implement and manage CI/CD pipelines to streamline the software development lifecycle.
  • Troubleshoot and resolve infrastructure-related issues in a timely manner.
  • Participate in on-call rotations to provide 24/7 support for critical systems.
  • Continuously evaluate and implement modern technologies and tools to improve operational efficiency.

Skills

AWSAWS CloudWatchAWS CodePipelineBashCircleCICloud FoundryDockerECSEC2ELK StackGitLab CIGrafanaIAMJenkinsJavaKubernetesNode.JSPrometheusPythonS3ShellSonarQubeSplunkSpringbootTerraformTrivyVPC

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free