OPENSHIFT SRE {Site Reliability Engineer}
Randstad Canada
About the role
Below is a polished, ready‑to‑post version of the OpenShift Engineer / Administrator role. I’ve kept all of the key details you provided while tightening the language, adding a clear “About Us” section, and formatting the qualifications for easy scanning. Feel free to copy‑paste it directly into your applicant‑tracking system or make any tweaks you see fit.
OpenShift Engineer / Administrator (Enterprise‑Scale)
Location: Canada (remote / hybrid – specify if needed)
Employment Type: Full‑time, permanent
Travel: Minimal / occasional onsite visits
About the Organization
We are a trusted technology partner for Canada’s largest institutions, delivering secure, innovative solutions that drive digital transformation. Our mission‑critical platforms support sectors such as finance, health, and government, and we pride ourselves on fostering an inclusive workplace where every employee feels valued, supported, and connected.
Why Join Us?
- Impact: Build and harden enterprise‑scale OpenShift/Kubernetes clusters that power mission‑critical services.
- Autonomy: Own the full lifecycle—from architecture and provisioning to long‑term operations and security.
- Support: Work with a skilled Platform Team Lead, modern tooling, and a culture that encourages continuous learning.
- Inclusivity: We are committed to equity, diversity, and inclusion, and provide accommodations throughout the hiring process.
Key Responsibilities
| Area | What You’ll Do |
|---|---|
| Design & Build | • Provision new OpenShift clusters (including AKS/ARO where relevant). • Define architecture, networking, storage, and security baselines. • Document design decisions and runbooks. |
| Resilience Engineering | • Harden clusters against threats (network policies, RBAC, pod security, etc.). • Optimize performance, networking, and storage. • Diagnose and resolve production incidents swiftly. |
| Collaboration & Leadership | • Partner with the Platform Team Lead and other engineering groups to plan and scale infrastructure initiatives. • Mentor junior staff and share best practices. |
| Automation & Observability | • Contribute to IaC/automation (Ansible, Terraform, Helm, etc.). • Implement monitoring, alerting, and self‑healing mechanisms. |
| On‑Call | • Participate in a compensated 24/7 on‑call rotation (primary/secondary) for rare, high‑impact incidents. |
Required Qualifications
- Deep expertise with OpenShift (4.x/5.x) and Kubernetes (including AKS/ARO experience a plus).
- Linux administration (RHEL/CentOS/Ubuntu) – networking, storage, SELinux, system tuning.
- Security hardening of clusters (RBAC, network policies, pod security standards, vulnerability scanning).
- Production troubleshooting – ability to isolate and resolve complex, multi‑layer issues.
- Infrastructure‑first mindset – not just application deployment.
- Scripting/automation – Python, Bash, Ansible, or similar (nice‑to‑have).
Preferred (Nice‑to‑Have) Skills
- GitOps workflows (Argo CD, GitLab CI/CD, Flux).
- Operator experience (e.g., Logging, ACS, Elasticsearch, Portworx, CrunchyData).
- Broader cloud/Kubernetes infrastructure experience (AWS, Azure, GCP).
What We Offer
- Competitive salary + benefits package.
- Professional development budget and access to certifications.
- Flexible work arrangements.
- A culture that celebrates diversity, equity, and inclusion.
Commitment to Accessibility & Inclusion
We are dedicated to creating an accessible hiring process. If you require accommodations at any stage, please email [accommodations@yourcompany.com] with your request.
Ready to own enterprise‑scale OpenShift engineering and make a lasting impact?
Apply today by submitting your resume and a brief cover letter outlining your most relevant OpenShift projects.
Feel free to let me know if you’d like any additional sections (e.g., salary range, benefits details, interview process) or if you’d prefer a more concise version.
Requirements
- Strong expertise in OpenShift and Kubernetes (AKS/ARO experience also valuable).
- Proven experience with Linux administration, networking, storage, and cluster security hardening.
- Solid background in infrastructure-level operations—not just application deployment.
- Ability to troubleshoot complex production issues with precision.
- Independent, proactive, and ready to hit the ground running—this team has no bandwidth for extended training.
Responsibilities
- Provision new OpenShift clusters, define architecture, and implement secure configurations.
- Harden security, optimize networking/storage, and troubleshoot production issues.
- Partner with the Team Lead, Platforms, to plan, deliver, and scale infrastructure initiatives.
- Contribute to automation, monitoring, and healing of OpenShift clusters.
- Participate in a compensated 24/7 on-call rotation (primary/secondary), ensuring availability for rare but critical incidents.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free