Skip to content
mimi

Lead Site Reliability Engineer

JPMC Candidate Experience page

New York · On-site Full-time Lead 5d ago

About the role

About

As a Site Reliability Engineering at JPMorgan Chase within the Enterprise technology, liquidity risk team, you are the non-functional requirement owner and champion for the applications in your remit. You are a key influencer in your team’s strategic planning, driving continual improvement in customer experience, resiliency, security, scalability, monitoring, instrumentation, and automation of the software in your area. You act in a blameless, data-driven manner and navigate difficult situations with composure and tact.

Responsibilities

  • Lead SRE practices that balance delivery speed, efficiency, and system stability
  • Partner with engineering peers and senior stakeholders to drive strong, shared outcomes
  • Scale SRE adoption across application and platform teams
  • Set reliability expectations and show progress through stability and reliability metrics
  • Run blameless, data-driven post-incident reviews and regular debriefs to turn lessons into improvements
  • Build a continuous-improvement culture by gathering feedback and improving the customer experience
  • Coach entry- to mid-level engineers and promote knowledge sharing through internal forums and communities

Required qualifications, capabilities, and skills

  • Formal training or certification in software engineering concepts plus 5+ years of applied experience
  • Advanced knowledge of SRE principles and a track record of implementing SRE across application and platform teams while avoiding common pitfalls
  • Experience leading technologists to manage and resolve complex technology issues at a firmwide level
  • Ability to influence team culture by championing innovation and driving change
  • Experience hiring, developing, and recognizing talent
  • Proficiency in at least one programming language (preferred: JavaScript, Go, Python)
  • Hands-on experience with CI/CD tools (e.g., Jenkins, GitLab, Terraform)
  • Experience with containers and orchestration (e.g., Docker, Kubernetes, ECS)
  • Strong troubleshooting skills across common networking technologies and issues
  • Working knowledge of modern service and integration patterns, including GraphQL fundamentals, event-driven architecture (Kafka or equivalent), and observability/telemetry with OpenTelemetry

Preferred qualifications, capabilities, and skills

  • Ability to code, troubleshoot, and demonstrate strong data fluency

Requirements

  • Formal training or certification in software engineering concepts plus 5+ years of applied experience
  • Advanced knowledge of SRE principles and a track record of implementing SRE across application and platform teams while avoiding common pitfalls
  • Experience leading technologists to manage and resolve complex technology issues at a firmwide level
  • Ability to influence team culture by championing innovation and driving change
  • Experience hiring, developing, and recognizing talent
  • Proficiency in at least one programming language (preferred: JavaScript, Go, Python)
  • Hands-on experience with CI/CD tools (e.g., Jenkins, GitLab, Terraform)
  • Experience with containers and orchestration (e.g., Docker, Kubernetes, ECS)
  • Strong troubleshooting skills across common networking technologies and issues
  • Working knowledge of modern service and integration patterns, including GraphQL fundamentals, event-driven architecture (Kafka or equivalent), and observability/telemetry with OpenTelemetry

Responsibilities

  • Lead SRE practices that balance delivery speed, efficiency, and system stability
  • Partner with engineering peers and senior stakeholders to drive strong, shared outcomes
  • Scale SRE adoption across application and platform teams
  • Set reliability expectations and show progress through stability and reliability metrics
  • Run blameless, data-driven post-incident reviews and regular debriefs to turn lessons into improvements
  • Build a continuous-improvement culture by gathering feedback and improving the customer experience
  • Coach entry- to mid-level engineers and promote knowledge sharing through internal forums and communities

Skills

CI/CDDockerECSevent-driven architectureGitLabGoGraphQLJenkinsJavaScriptKubernetesOpenTelemetryPythonSRETerraform

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free