H
Senior Site Reliability Engineer (m/f/d)
Hays
On-site 4w ago
About the role
Ihre Aufgaben:
- Deploy, install, and manage Kubernetes clusters in on-prem and hybrid environments
- Configure and maintain GitOps Workflows, Helm/Kustomize, and artifact registries in restricted networks
- Design, operate, and lead incident response for the observability stack (Prometheus, Grafana) and enforce disaster recovery
- Harden environments with network segmentation, mTLS, IAM, and vulnerability remediation
- Produce compliance documentation, runbooks, and train agency and client teams on operations
Ihre Qualifikationen:
- Excellent skills with SRE/Platform Engineering/DevOps, on-call ownership, and operating production systems
- Delivered services for regulated customers (public sector, finance, healthcare)
- Worked directly with customer IT/security teams and handled compliance audits
- Familiar with data privacy obligations, logging, and legal evidence standards
- Kubernetes operations (on-prem/hybrid) with GitOps, Helm/Kustomize, infrastructure automation (Ansible, Terraform) and Google Cloud
- Observability tooling (Prometheus, Grafana, Loki, OpenTelemetry) and incident response practices
Ihre Vorteile:
- A highly motivated team and an open way of communication
Requirements
- Kubernetes operations (on-prem/hybrid) with GitOps, Helm/Kustomize, infrastructure
- automation (Ansible, Terraform) and Google Cloud
- Observability tooling (Prometheus, Grafana, Loki, OpenTelemetry) and incident response practices
Responsibilities
- Deploy, install, and manage Kubernetes clusters in on-prem and hybrid environmentsConfigure and maintain GitOps
- Workflows, Helm/Kustomize, and artifact registries in restricted networks
- Design, operate, and lead incident response for the observability stack (Prometheus, Grafana) and enforce disaster recovery
- Harden environments with network segmentation, mTLS, IAM, and vulnerability remediation
- Produce compliance documentation, runbooks, and train agency and client teams on operations
Skills
AnsibleGitOpsGoogle CloudGrafanaHelmIAMInfrastructure automationKubernetesLokimTLSOpenTelemetryPrometheusTerraform
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free