Application Administrator (DevOps / SRE)
Appzone Limited
About the role
Job Opening ID: 292
Job Description • Log management: Ensure critical logs are visualized, with efficient aggregation and proper archiving to blob storage. • CI/CD & Pipeline Management: Ensure smooth automation, testing, and deployment cycles, and ensure the continuous improvement of system monitoring tools and processes. • Application Support: Provide second-level application support and troubleshooting for users. • Service Uptime: Ensure prompt incident detection, logging, and recovery, ensuring minimal service disruption. • Automation: Automate repetitive tasks and manual processes, to enable efficiency of task completion. • Disaster Recovery: Maintain and improve disaster recovery plans to ensure business continuity during unforeseen situations. • Performance Monitoring: Monitor application performance and ensure optimal user experience. • System Health: Create and manage health checks for services and systems and monitor and ensure the availability of third-party services is integrated into the system. • Documentation: Ensure proper documentation of system configurations and changes. • Collaboration & Cross-Functional Sync: Collaborate with other teams to improve cross functional processes and communication. • Alerting & Visualization: Implement proactive alerting processes for critical infrastructure and system-related issues. • Change Management: Implement a robust version control and change management process. • Incident Management: Provide timely incident reports to the relevant stakeholders for decision-making purposes. • Operational Excellence: Evaluate and integrate new tools or services that improve operational efficiency.
Requirements • Bachelor’s Degree in Computer Science or other highly technical, scientific discipline with at least 2 years' experience as a Site Reliability Engineer, DevOps Engineer, and/or Application Administrator. • Ability to program with one or more high level languages, such as Python, Java, C/C++, Ruby, and JavaScript. • Experience with distributed storage technologies like NFS, HDFS, Ceph, S3 as well as dynamic resource management frameworks Mesos, Kubernetes, Yarn. • Hands-on experience with CI/CD tools e.g., Jenkins, GitHub Actions, Azure DevOps and version control systems. • Experience with cloud computing technologies such as AWS, Azure and GCP, and experience with basic database management, Linux and Windows Server OS. • Proficiency in scripting/automation Python, Bash, or similar and familiarity with containerization technologies Docker, Kubernetes. • Experience with monitoring tools such as AWS CloudWatch, Grafana, ELK, New Relic and with 24/7 shift-based structures. • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks. • An analytical/problem solving mindset, with good communication and collaboration skills, and experience in automation, system reliability, and continuous improvement. • Good knowledge of cloud platforms AWS, Azure, or GCP and modern infrastructure practices, with relevant certifications in any of the platforms.
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free