Lead DevOps Engineer - On-prem, Linux, Containers - (Healthcare or Highly regulated Industry)
Discovered MENA
About the role
Lead DevOps Engineer
Location: Dubai
Salary: Competitive (Tax Free) + Benefits
About
We are seeking an experienced Lead DevOps Engineer from either healthcare or highly regulated industries to define and oversee platform engineering practices for a premier data and technology organization. This role sets the standards for secure and scalable automated environments supporting data, AI, and application workloads across the enterprise.
Role Overview
As a Lead DevOps Engineer, you will design and operate Kubernetes and OpenShift clusters mainly in on-prem environments and some hybrid environments. You will own the cluster lifecycle, including upgrades, storage integration, and production troubleshooting to ensure engineering teams can deliver quickly.
You will lead a team of 3/4 engineers while enforcing DevOps standards across CI/CD and Infrastructure as Code. You will be responsible for building observability frameworks and partnering with security teams to embed identity and network guardrails.
This role requires strong hands-on experience operating Linux-based systems and Kubernetes/OpenShift clusters in on-prem and hybrid environments, including ownership of cluster lifecycle, upgrades, storage integration, and production troubleshooting.
Responsibilities
- Strong hands-on experience operating Linux-based systems and Kubernetes/OpenShift clusters in on-prem and hybrid environments
- Champion DevOps principles within an enterprise hybrid architecture involving Azure cloud and on-prem storage solutions
- Standardize CI/CD frameworks using Azure DevOps and GitHub Actions
- Ensure Infrastructure as Code practices such as Terraform and Ansible are applied consistently
- Implement guardrails for container orchestration platforms including scaling and networking
- Establish observability standards for metrics and logs across all workloads
- Define operational SLOs and SLAs for platform services to ensure peak performance
- Oversee the creation of runbooks and automated remediation processes
- Partner with DataOps and AI teams to provide secure environments for model training and deployment
- Optimize infrastructure utilization to maintain a balance between cost and performance
Requirements
- 10+ years in engineering or DevOps roles with at least 5 years leading platform teams
- Deep expertise in CI/CD automation and container orchestration
- Advanced proficiency in Azure DevOps, GitHub Actions, Terraform, and Ansible
- Strong hands-on knowledge of Kubernetes and OpenShift cluster management
- Familiarity with observability stacks like Prometheus and Grafana
- Bachelor’s or Master’s degree in Computer Science or a related engineering field
- Understanding of hybrid infrastructure and data governance frameworks
- Exceptional communication and leadership skills with a focus on developing others
Requirements
- Deep expertise in CI/CD automation and container orchestration.
- Advanced proficiency in Azure DevOps, GitHub Actions, Terraform, and Ansible.
- Strong hands-on knowledge of Kubernetes and OpenShift cluster management.
- Familiarity with observability stacks like Prometheus and Grafana.
- Understanding of hybrid infrastructure and data governance frameworks.
- Exceptional communication and leadership skills with a focus on developing others.
Responsibilities
- Design and operate Kubernetes and OpenShift clusters mainly in on-prem environments and some hybrid environments.
- Own the cluster lifecycle, including upgrades, storage integration, and production troubleshooting to ensure engineering teams can deliver quickly.
- Lead a team of 3/4 engineers while enforcing DevOps standards across CI/CD and Infrastructure as Code.
- Build observability frameworks and partner with security teams to embed identity and network guardrails.
- Champion DevOps principles within an enterprise hybrid architecture involving Azure cloud and on-prem storage solutions.
- Standardize CI/CD frameworks using Azure DevOps and GitHub Actions.
- Ensure Infrastructure as Code practices such as Terraform and Ansible are applied consistently.
- Implement guardrails for container orchestration platforms including scaling and networking.
- Establish observability standards for metrics and logs across all workloads.
- Define operational SLOs and SLAs for platform services to ensure peak performance.
- Oversee the creation of runbooks and automated remediation processes.
- Partner with DataOps and AI teams to provide secure environments for model training and deployment.
- Optimize infrastructure utilization to maintain a balance between cost and performance.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free