Site Reliability and DevOps Engineer
ZIWO
About the role
About the RoleWe re looking for a skilled and motivated Site Reliability and DevOps Engineer to join our Infrastructure team in Dubai. This position is remote-friendly, giving you the flexibility to work from anywhere while remaining closely connected to our Dubai office.In this role, you ll play a critical part in designing, maintaining, and optimizing our infrastructure to support ZIWO s global services. You ll collaborate with cross-functional teams to streamline deployments, automate tasks, and ensure our systems run smoothly, securely, and reliably.What You ll Be DoingDesign, build, and operate scalable and reliable infrastructure based on business and engineering needs.Take ownership of the entire service lifecycle from design to deployment, operations, and refinement.Identify manual or error-prone workflows and automate them to improve system efficiency.Provide system design consulting, capacity planning, and launch support.Monitor and maintain system health, ensuring optimal performance and minimal downtime.Use automation to scale systems sustainably and improve delivery velocity.Utilize public cloud tools (AWS/OCI) to enhance speed-to-market and reliability.Lead or contribute to blameless incident responses and postmortems.Participate in on-call rotations to support alerts or incidents during off-hours when required.What We re Looking ForBachelor s degree in Computer Science or a related field.3 5 years of experience in a DevOps or Site Reliability Engineering role.Strong experience with:Linux (Ubuntu)Ansible and TerraformDocker and Kubernetes (production-grade)CI/CD pipeline design and managementGit & Git FlowsPublic Cloud providers: AWS and/or OCIMonitoring tools: Prometheus, InfluxDB, ElasticsearchPostgreSQLLanguages: Python and/or GoBonus skills:VoIP: FreeSwitch, Asterisk, KamailioAWS services: SNS, SQS, Firehose, RedshiftGrafanaNetwork and virtualization knowledge: KVM, LAN/WAN, Cisco switches
Requirements
- Bachelor's degree in Computer Science or a related field
- 3-5 years of experience in a DevOps or Site Reliability Engineering role
- Strong experience with Linux (Ubuntu), Ansible, Terraform, Docker, Kubernetes, CI/CD pipeline design, Git, public cloud providers, monitoring tools, and programming languages
Responsibilities
- Design, build, and operate scalable and reliable infrastructure
- Take ownership of the entire service lifecycle
- Identify manual or error-prone workflows and automate them
- Provide system design consulting, capacity planning, and launch support
- Monitor and maintain system health
- Use automation to scale systems sustainably
- Utilize public cloud tools to enhance speed-to-market and reliability
- Lead or contribute to blameless incident responses and postmortems
- Participate in on-call rotations
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free