Skip to content
mimi

Senior Site Reliability Engineer (m/f/d)

Hays

On-site 4w ago

About the role

Ihre Aufgaben:

  • Deploy, install, and manage Kubernetes clusters in on-prem and hybrid environments
  • Configure and maintain GitOps Workflows, Helm/Kustomize, and artifact registries in restricted networks
  • Design, operate, and lead incident response for the observability stack (Prometheus, Grafana) and enforce disaster recovery
  • Harden environments with network segmentation, mTLS, IAM, and vulnerability remediation
  • Produce compliance documentation, runbooks, and train agency and client teams on operations

Ihre Qualifikationen:

  • Excellent skills with SRE/Platform Engineering/DevOps, on-call ownership, and operating production systems
  • Delivered services for regulated customers (public sector, finance, healthcare)
  • Worked directly with customer IT/security teams and handled compliance audits
  • Familiar with data privacy obligations, logging, and legal evidence standards
  • Kubernetes operations (on-prem/hybrid) with GitOps, Helm/Kustomize, infrastructure automation (Ansible, Terraform) and Google Cloud
  • Observability tooling (Prometheus, Grafana, Loki, OpenTelemetry) and incident response practices

Ihre Vorteile:

  • A highly motivated team and an open way of communication

Requirements

  • Kubernetes operations (on-prem/hybrid) with GitOps, Helm/Kustomize, infrastructure
  • automation (Ansible, Terraform) and Google Cloud
  • Observability tooling (Prometheus, Grafana, Loki, OpenTelemetry) and incident response practices

Responsibilities

  • Deploy, install, and manage Kubernetes clusters in on-prem and hybrid environmentsConfigure and maintain GitOps
  • Workflows, Helm/Kustomize, and artifact registries in restricted networks
  • Design, operate, and lead incident response for the observability stack (Prometheus, Grafana) and enforce disaster recovery
  • Harden environments with network segmentation, mTLS, IAM, and vulnerability remediation
  • Produce compliance documentation, runbooks, and train agency and client teams on operations

Skills

AnsibleGitOpsGoogle CloudGrafanaHelmIAMInfrastructure automationKubernetesLokimTLSOpenTelemetryPrometheusTerraform

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free