Site Reliability Engineer (SRE)
Highspring
About the role
About Highspring
Highspring is a consulting and professional services firm specialized in technology delivery, digital transformation, and high‑performance engineering. We partner with organizations of all sizes to accelerate innovation, strengthen operational excellence, and build scalable, future‑ready platforms. Our teams combine technical depth, modern delivery practices, and a people‑focused mindset to help clients achieve meaningful, sustainable impact.
The Opportunity
We're looking for a Site Reliability Engineer (SRE) to help one of our major clients strengthen the reliability, resilience, and performance of their production platforms. This role is ideal for someone who thrives in complex distributed environments and enjoys working closely with development, architecture, and product teams to drive engineering excellence.
What You'll Do
- Ensure the stability, reliability, and resilience of critical production platforms.
- Automate end‑to‑end deployment, testing, and quality controls using modern Infrastructure‑as‑Code and continuous delivery practices.
- Design and industrialize observability solutions (logs, metrics, alerts) to support service‑level objectives.
- Guide and influence development teams to improve reliability, performance, and security from design through operations.
- Identify, prioritize, and implement technical improvements by replacing outdated technologies with sustainable, business‑aligned solutions.
- Participate in technical planning with engineering leadership and product owners, contributing to shared standards, tooling, and documentation.
What You Bring to the Table
- Bachelor's degree in computer science, software engineering, or a related field and 5+ years of relevant experience;
- OR a Master's degree and 4+ years of experience;
- OR a university certificate and 8+ years of experience.
- Strong expertise in Site Reliability Engineering practices, Infrastructure‑as‑Code, continuous deployment, and automated testing.
- Hands‑on experience with ecosystems such as Docker, Kubernetes, Git‑based delivery pipelines, Terraform, Ansible, and observability tools (e.g., Splunk, Datadog, SonarQube).
- Professional experience in Java development or system administration in distributed environments.
- AWS certification
Core Skills Required
- Reliability engineering
- CI/CD & IaC automation
- Observability architecture
- Distributed systems
- Performance and resiliency optimization
- Cloud engineering (AWS)
- Modern DevOps tooling
Our Stack
Typical tools used on this mandate include:
- Containerization & Orchestration: Docker, Kubernetes
- IaC & Automation: Terraform, Ansible
- CI/CD: Git‑based pipelines
- Observability: Splunk, Datadog, SonarQube
- Languages: Java (or equivalent experience)
- Cloud: AWS (certified)
Why Join Highspring
At Highspring, you'll join a team of experienced consultants who value collaboration, autonomy, and continuous learning. We offer the opportunity to work on impactful projects, contribute to modern engineering practices, and grow your career in a supportive, forward‑thinking environment. If you're passionate about reliability, automation, and technical excellence, we'd love to meet you.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free