SS
Systems Reliability Engineer (SRE)
Soho Square Solutions
Montreal · Hybrid Contract Mid Level 1mo ago
About the role
Role Overview
We are seeking a Systems Reliability Engineer to help design, operate, and scale highly available, reliable production systems. This role focuses on automation, observability, performance, and stability across large‑scale, distributed platforms.
Key Responsibilities
- Design, build, and support reliable production systems in partnership with engineering teams
- Troubleshoot issues across infrastructure, OS, application, and network layers
- Drive automation for deployments, monitoring, and operational workflows
- Identify and mitigate reliability and production risks
- Participate in design reviews, operational readiness, and on-call rotations
- Collaborate with global teams in a follow‑the‑sun support model
Required Skills & Experience
- 2+ years of experience in SRE, production support, or systems engineering
- Strong scripting or automation skills (Python, Bash, Perl preferred)
- Experience with UNIX/Linux environments and multi-tier architectures
- Familiarity with CI/CD, source control, and containerization (Git, Jenkins, Docker)
- Exposure to monitoring/observability tools (Grafana, Dynatrace, AppDynamics)
- Knowledge of distributed systems, microservices, and operating system fundamentals
Skills
BashCI/CDDockerGitGrafanaJenkinsLinuxMicroservicesMonitoringObservabilityPerlProduction SupportPythonSRESystems EngineeringUNIX
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free