Senior Site Reliability Engineer/ DevOps Engineer/ Platform or Cloud Engineer
Zema Global Data Corporation
About the role
You will be working as a Backend-focused SRE / DevOps Engineer at Zema Global Data Corporation, responsible for ensuring system reliability, stability, and automation across AWS-based environments. Your role will involve troubleshooting production issues, improving infrastructure, automating processes, and collaborating closely with internal product and engineering teams. This is a hands-on infrastructure and reliability engineering position with no direct client-facing exposure. • *Key Responsibilities:** - Maintain and support products and data systems by proactively monitoring events, investigating issues, analyzing solutions, and driving problems to resolution. - Collaborate with the product team to define application hardening and opportunities for chaos engineering. - Utilize operational tools and monitoring platforms for in-depth knowledge, understanding, and ongoing monitoring of system availability, performance, and capacity. - Establish Service Level Indicators and Objectives (SLIs and SLOs) in collaboration with business partners. - Implement an alerting strategy to make alerts actionable and unique. - Support Unix/Linux servers running Commodity Data custom services. - Adhere to best practices, develop efficiencies, and enhance department scalability. • *Required Qualifications:** • *Must Have** - Bachelors degree in Computer Science or equivalent practical experience. - 3-5+ years of experience in DevOps, Site Reliability Engineering, or infrastructure-focused engineering roles. - Working knowledge of Linux systems administration and AWS services such as EC2, IAM, VPC, S3, and CloudWatch. - Strong scripting skills in Python and Bash. - Experience troubleshooting production incidents and working with CI/CD tools like Jenkins. - Understanding of DevOps principles, automation practices, and monitoring tools. - Strong communication skills in English and ability to collaborate effectively. • *Nice Have** - AWS certification(s), defining SLIs / SLOs, designing alerting strategies, chaos engineering concepts. - Windows systems knowledge, relational database experience, programming skills in popular languages. - Experience managing AWS permissions and IAM policies, improving system scalability and performance. • *Why Zema Global:** - Join a rapidly growing company that influences decision-making in energy and commodities. - Work with cutting-edge technology alongside industry experts. - Opportunity to impact strategy, revenue growth, and decision-making. - Be part of a culture that values innovation, collaboration, and autonomy for meaningful change. • Please submit your PDF CV highlighting relevant experience. Only shortlisted candidates will be contacted. No visa sponsorship available for this position.* • Equality and Diversity:* Zema Global is committed to diversity and inclusion without discrimination based on race, gender, sexual orientation, disability, or any other protected status. You will be working as a Backend-focused SRE / DevOps Engineer at Zema Global Data Corporation, responsible for ensuring system reliability, stability, and automation across AWS-based environments. Your role will involve troubleshooting production issues, improving infrastructure, automating processes, and collaborating closely with internal product and engineering teams. This is a hands-on infrastructure and reliability engineering position with no direct client-facing exposure. • *Key Responsibilities:** - Maintain and support products and data systems by proactively monitoring events, investigating issues, analyzing solutions, and driving problems to resolution. - Collaborate with the product team to define application hardening and opportunities for chaos engineering. - Utilize operational tools and monitoring platforms for in-depth knowledge, understanding, and ongoing monitoring of system availability, performance, and capacity. - Establish Service Level Indicators and Objectives (SLIs and SLOs) in collaboration with business partners. - Implement an alerting strategy to make alerts actionable and unique. - Support Unix/Linux servers running Commodity Data custom services. - Adhere to best practices, develop efficiencies, and enhance department scalability. • *Required Qualifications:** • *Must Have** - Bachelors degree in Computer Science or equivalent practical experience. - 3-5+ years of experience in DevOps, Site Reliability Engineering, or infrastructure-focused engineering roles. - Working knowledge of Linux systems administration and AWS services such as EC2, IAM, VPC, S3, and CloudWatch. - Strong scripting skills in Python and Bash. - Experience troubleshooting production incidents and working with CI/CD tools like Jenkins. - Understanding of DevOps principles, automation practices, and monitoring tools. - Strong communication skills in English and ability to collaborate effectively. • *Nice Have** - AWS certification(s), defining SLIs / SLOs, designing alerting strategies, chaos engineering concepts.
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free