Senior Dev Ops Engineer
Rethinkit
About the role
About Us
We are looking for a Senior Dev Ops Engineer to help us design, implement, and operate enterprise-grade logging, metrics, monitoring, and alerting across our production platforms.
You will be a key contributor in managing a production RKE2 Kubernetes cluster, operated via Rancher, and improving our CI/CD pipelines using Git Lab.
The role supports high-performance, highly secure systems for one of the largest stock brokers in the United States.
Note: Candidates must be able to work, when required, US Eastern Time hours (14:00 - 00:00 SAST) as this will be required for production deployments, troubleshooting and collaboration with engineering teams.
This is a fully remote role, but candidates must reside in the greater Cape Town area for team lunches and get togethers. It is a hands‑on role working closely with backend, web, and mobile engineering teams as well as the platform team in the US.
Responsibilities
Observability & Reliability
- Design and implement enterprise-grade logging, metrics, monitoring, and alerting
- Build and maintain centralised log aggregation pipelines
- Define alerting strategies, SLOs, and operational dashboards
- Ensure system health, performance, and reliability in production
- Set up, configure, and manage production Kubernetes clusters (RKE2)
- Deploy and manage workloads using Helm
- Troubleshoot cluster, networking, and workload-level issues
- Apply security and hardening best practices for production systems
CI/CD & Automation
- Build, migrate, and maintain Git Lab CI/CD pipelines
- Migrate existing services into standardised pipelines
- Support and maintain pipelines for new services
- Improve developer experience through automation and reusable templates
Security & Networking
- Implement and maintain security best practices
- Manage SSL certificates
- Secure Kubernetes workloads, networks, and CI/CD pipelines
- Work with complex, high-security networking environments
Containers & Software Collaboration
- Set up and maintain Docker containers and registries
- Work closely with backend teams and assist where required
C++ experience is a bonus
Required Experience
- Senior-level experience in Dev Ops / Platform Engineering
- Experience with RKE2 and Rancher
- Strong experience with Helm
- Hands‑on experience with Git Lab CI/CD
- Experience implementing:
- Metrics and monitoring
- Alerting strategies
- Strong networking and security fundamentals
- Experience supporting mission‑critical production systems
Nice to Have
- Observability tools such as:
- Grafana
- Loki, ELK, or Open Search
- Open Telemetry
- Experience with Vector for log aggregation
- AWS/GCP experience
- Infrastructure as Code (Terraform, Ansible)
- Git Ops workflows
- Experience in financial or regulated environments
What We Offer
- Fully remote role, but candidates must reside in the greater Cape Town area for team get togethers.
- Work on large-scale, high-performance systems
- Opportunity to contribute to platforms used by one of the largest US stock brokers
- Collaborative engineering environment
- Competitive compensation
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free