Skip to content
mimi

(Senior) Site Reliability Engineer / Distributed Cloud - STACKIT (m/w/d) in Heilbronn

Energy Jobline ZR

Heilbronn · On-site Senior Today

About the role

About Schwarz Digits

Schwarz Digits schafft das technologische Fundament für digitale Entscheidungsfreiheit in Europa. Als IT- und Digitalsparte der Schwarz Gruppe entwickeln und verantworten wir einerseits die IT-Infrastrukturen für die Handelssparten Lidl und Kaufland sowie die Schwarz Produktion und PreZero. Gleichzeitig agieren wir als unabhängiger Anbieter am externen Markt, um Unternehmen in ganz Europa bei ihrer digitalen Transformation zu unterstützen.

Unsere Kernleistungen bündeln wir in den Bereichen Cloud, Cyber Security, Data & AI, Communication und Workspace. Trage auch du zur digitalen Entscheidungsfreiheit in Europa bei. Bei uns arbeitest du an der Schnittstelle zwischen Agilität und Sicherheit: Du profitierst von den schnellen Entscheidungswegen, genießt echte Gestaltungsspielräume in deinen Projekten und baust dabei auf das stabile Fundament der Schwarz Gruppe.

Your Responsibilities

  • You operate and optimize our highly complex platforms (Kubernetes, KubeVirt, Cilium, Ceph, Talos) as well as the underlying infrastructure with a focus on end-to-end stability, scalability, and costs.
  • You develop and maintain our monitoring and logging systems (Metrics, Logs, Traces) to ensure deep insights into the system status at all times and to proactively identify bottlenecks.
  • You implement consistent synthetic monitoring and trace tests to continuously validate the end-to-end functionality of critical services.
  • You define and monitor clear Service Level Objectives (SLOs) and consistently reduce 'Toil' through code. Runbooks are only the last line of defense for you.
  • You document your work comprehensibly, because the best system is worthless without good Markdown.

Your Profile

  • You have a completed degree in Computer Science or a related field.
  • At least 2 years of active experience as an SRE/DevOps Engineer, where you learned that 'Works on my machine' is not an answer.
  • Sound experience in operating cloud infrastructures with Kubernetes and/or virtualization technologies.
  • You have good knowledge of software development with Golang or a comparable systems language and use it to automate processes and build your own tools.

Skills

CephCiliumCloudCommunicationContainerizationCyber SecurityData & AIDevOpsDockerElasticsearchGitGolangGrafanaInfrastructureITKibanaKubernetesKubeVirtLinuxLoggingMonitoringNetworkingObservabilityPostgreSQLPrometheusPythonSRETalosTerraformVirtualizationWorkspace

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free