Skip to content
mimi

Senior Site Reliability Engineer (Senior SRE)

Mondo

Remote · US Contract Senior $70 – $90/hr Today

About the role

About

The Senior Site Reliability Engineer is responsible for improving reliability, scalability, and operational excellence across distributed systems through automation, observability, and engineering best practices.

Responsibilities

  • Lead reliability and recovery planning for critical systems and services
  • Define and maintain SLIs, SLOs, and error budget practices
  • Drive incident response efforts and lead complex outage investigations
  • Conduct root cause analysis and implement corrective actions
  • Design and build automation to reduce operational toil
  • Improve CI/CD pipelines, deployment workflows, and rollback strategies
  • Develop and maintain observability tooling for metrics, logs, and tracing
  • Partner with release and change management teams on release readiness
  • Mentor engineers and promote operational excellence across teams
  • Influence system architecture to improve reliability and scalability

Requirements

Must-Haves:

  • Bachelor's degree in Computer Science, Engineering, or related field
  • 6–10 years of experience in SRE, DevOps, platform engineering, or software engineering
  • Strong experience with AWS and distributed systems
  • Professional experience performing root cause analysis and incident management
  • Experience documenting SRE systems and operational processes
  • Strong programming skills across multiple languages
  • Experience with observability tools, monitoring, logging, and tracing
  • Experience with CI/CD pipelines and deployment automation
  • Familiarity with containerization and orchestration technologies
  • Strong troubleshooting, analytical, and problem-solving skills
  • Ability to work effectively in regulated or enterprise environments

Nice-to-Haves:

  • Experience with Infrastructure as Code tools such as CloudFormation
  • Experience with Spring, Spring Boot, React, or Angular
  • Experience with Tomcat, Netty, Node.js, or Next.js
  • Experience with relational and non-relational databases
  • Experience working in Agile/Scrum environments
  • Experience with ITSM tools such as ServiceNow
  • Familiarity with ITIL-based release and change management processes
  • Knowledge of security compliance frameworks such as ISO 27001 or SOC 2

Benefits

  • Health insurance
  • Retirement plan
  • Paid sick leave

Skills

AWSCI/CDCloudFormationDockerKubernetesNode.jsReactSpringSpring Boot

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free