KA
AI Ops Engineer
Knapp AG
Graz · flexible Full-time From €3k/mo Today
About the role
Your role in the team
- As an AI Ops Engineer, you are a central part of our team and shape the future of our intelligent IT operations solutions.
- With your expertise in automation, observability, and ML-driven system analysis, you help us implement highly available, robust, and autonomous platforms.
- You develop AI-powered automation solutions to make IT operations more efficient, stable, and predictive.
- You implement event correlation and noise reduction mechanisms to quickly identify important signals from large data sets.
- You develop anomaly detection and predictive analytics systems for early prediction of critical failures.
- You conduct root cause analyses (RCA) using modern ML and data analysis methods.
- You implement automated remediation and self-healing mechanisms that automatically resolve or mitigate incidents.
- You are building a comprehensive observability architecture (logs, metrics, traces) to ensure complete system transparency.
- You automate recurring processes through scripting and infrastructure automation and assist in incident response and troubleshooting.
- You work closely with AI engineers, platform teams, and infrastructure experts to develop intelligent, scalable operations solutions.
What we offer
- We offer you professional and personal development programs as well as the opportunity to work in a motivated team where the willingness to take responsibility and enthusiasm for implementing ideas are welcomed.
- The minimum gross salary for this position is EUR 3,415.01 per month for full-time employment.
- However, we offer market-compliant compensation depending on your qualifications and prior experience.
Technologies and skills
- Python
- Apache Spark
- Ansible
- Elastic
- Google Cloud Platform
- Grafana
- Podman
- OpenTelemetry
- Prometheus
- Terraform
- AWS
- Azure
- Kubernetes
- Apache Kafka
Our expectations:
Qualifications
- You have expertise in Observability stacks such as Prometheus, Grafana, ELK/Elastic, OpenTelemetry.
- You have practical experience with Big Data/Streaming technologies (Kafka, Spark) for processing large telemetry data sets.
- You are familiar with cloud platforms (AWS/Azure/GCP) in the areas of monitoring, telemetry, and automated operations.
- You have knowledge of container and orchestration platforms (Podman, Kubernetes).
- Excellent German and English skills, as well as clear and structured communication abilities, and an independent and systematic way of working, distinguish you.
Experience
- You have extensive experience with Python for automation and data analysis.
- You have experience with infrastructure automation (Ansible, Terraform, Argo, GitOps tools).
Education
- You have a technical education (HTL Informatics, university or university of applied sciences degree in Informatics/Data Science, etc.) or possess equivalent practical experience.
Benefits
- No Physical Barriers
- Employee Parking Space
- Fresh Fruit
- Health Care Benefits
- Company Retirement Provision
- Day Care for Kids
- Employee Discount
- Excellent Traffic Connections
- Company Doctor
- Flexible Working Hours
- Employee Stock Option
Skills
AnsibleApache KafkaApache SparkAWSAzureElasticGoogle Cloud PlatformGrafanaKubernetesOpenTelemetryPodmanPrometheusPythonTerraform
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free