Skip to content
mimi

Senior Dev Ops Engineer

Rethinkit

Remote · South Africa Full-time Senior 6d ago

About the role

About Us

We are looking for a Senior Dev Ops Engineer to help us design, implement, and operate enterprise-grade logging, metrics, monitoring, and alerting across our production platforms.

You will be a key contributor in managing a production RKE2 Kubernetes cluster, operated via Rancher, and improving our CI/CD pipelines using Git Lab.

The role supports high-performance, highly secure systems for one of the largest stock brokers in the United States.

Note: Candidates must be able to work, when required, US Eastern Time hours (14:00 - 00:00 SAST) as this will be required for production deployments, troubleshooting and collaboration with engineering teams.

This is a fully remote role, but candidates must reside in the greater Cape Town area for team lunches and get togethers. It is a hands‑on role working closely with backend, web, and mobile engineering teams as well as the platform team in the US.

Responsibilities

Observability & Reliability

  • Design and implement enterprise-grade logging, metrics, monitoring, and alerting
  • Build and maintain centralised log aggregation pipelines
  • Define alerting strategies, SLOs, and operational dashboards
  • Ensure system health, performance, and reliability in production
  • Set up, configure, and manage production Kubernetes clusters (RKE2)
  • Deploy and manage workloads using Helm
  • Troubleshoot cluster, networking, and workload-level issues
  • Apply security and hardening best practices for production systems

CI/CD & Automation

  • Build, migrate, and maintain Git Lab CI/CD pipelines
  • Migrate existing services into standardised pipelines
  • Support and maintain pipelines for new services
  • Improve developer experience through automation and reusable templates

Security & Networking

  • Implement and maintain security best practices
  • Manage SSL certificates
  • Secure Kubernetes workloads, networks, and CI/CD pipelines
  • Work with complex, high-security networking environments

Containers & Software Collaboration

  • Set up and maintain Docker containers and registries
  • Work closely with backend teams and assist where required

C++ experience is a bonus

Required Experience

  • Senior-level experience in Dev Ops / Platform Engineering
  • Experience with RKE2 and Rancher
  • Strong experience with Helm
  • Hands‑on experience with Git Lab CI/CD
  • Experience implementing:
    • Metrics and monitoring
    • Alerting strategies
  • Strong networking and security fundamentals
  • Experience supporting mission‑critical production systems

Nice to Have

  • Observability tools such as:
    • Grafana
    • Loki, ELK, or Open Search
    • Open Telemetry
  • Experience with Vector for log aggregation
  • AWS/GCP experience
  • Infrastructure as Code (Terraform, Ansible)
  • Git Ops workflows
  • Experience in financial or regulated environments

What We Offer

  • Fully remote role, but candidates must reside in the greater Cape Town area for team get togethers.
  • Work on large-scale, high-performance systems
  • Opportunity to contribute to platforms used by one of the largest US stock brokers
  • Collaborative engineering environment
  • Competitive compensation

Skills

AnsibleAWSC++DockerELKGCPGrafanaGitGitLab CI/CDHelmInfrastructure as CodeLokiOpen SearchOpen TelemetryRancherRKE2TerraformVector

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free