GI
Kafka Platform Engineer (m/f/d)
GULP Information Services GmbH
Remote (Global) Full-time Today
About the role
About
We are currently seeking a Kafka Platform Engineer (m/f/d) for our client.
Details
- Employment Type: Full-time
- End Date: December 31, 2026 (with the option to extend)
- Location: Primarily remote
Responsibilities
- Design, deployment, and operation of scalable Kafka clusters (on-premises, Azure, OpenShift)
- Configuration and maintenance of Kafka components (Broker, Connect, ksqlDB, Schema Registry, MirrorMaker)
- Implementation of security and authentication mechanisms (TLS, SASL, ACLs, Kerberos)
- Monitoring, log analysis, and performance tuning (Prometheus, Grafana, JMX, Kafka Metrics)
- Incident management: fault diagnosis, root cause analysis, recovery, and post-mortem reporting
- Automation of deployments and configurations using IaC & GitOps workflows
- Planning and execution of capacity and cost optimization measures (partition strategies, tiered storage, dynamic scaling)
- Creation of documentation, best practice guides, and training for developers and operations teams
Requirements
- In-depth expertise in Apache Kafka / Confluent Kafka (Broker installation, cluster setup, topic management, partitioning, replication, quotas, ACLs)
- Experience with Kafka Connect, ksqlDB / Kafka Streams, Schema Registry
- Monitoring & performance tuning (JMX metrics, Prometheus, Grafana, Kafka metrics)
- Troubleshooting & incident response in a production Kafka environment
- Basic knowledge of Kubernetes/OpenShift (deployment of Kafka clusters in container environments)
- Solid understanding of network and security concepts (TLS/SSL, Kerberos, SASL)
Ideally
- Experience with GitOps automation (ArgoCD/Flux, Helm charts, CRDs) for Kafka deployment
- Infrastructure as Code (IaC) with Terraform for Azure / Kubernetes resources
- Knowledge of hybrid cloud environments (Azure + on-premises) and their Network integration
- Experience with container security frameworks (OPA, Kyverno, image scanning)
- Practical implementation of disaster recovery and HA strategies (Raft Mode, Zookeeper-Free, MirrorMaker)
- Optimization of Kafka costs and capacities (Dynamic Scaling, Tiered Storage)
Skills
ACLsApache KafkaAzureConfluent KafkaGitOpsGrafanaHelmIaCJMXKerberosKafka ConnectKafka MetricsKafka StreamsKubernetesksqlDBMirrorMakerOpenShiftOPAPrometheusRaft ModeSASLSchema RegistryTerraformTiered StorageTLSZookeeper-Free
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free