Skip to content
mimi

Site Reliability Engineer GCP Azure

Boost-IT

India · On-site Full-time 2w ago

About the role

As a Site Reliability Engineer (SRE) at Boost IT, you will play a key role in ensuring the reliability and scalability of our cloud infrastructure on Google Cloud Platform (GCP) and Microsoft Azure. Your expertise in data engineering will be crucial in optimizing performance, reliability, and cost-efficiency. Below are the details of your responsibilities and qualifications:

**Role Overview:** Boost IT is seeking an experienced Site Reliability Engineer (SRE) to lead technical initiatives and provide guidance to the team. Your role will involve managing cloud infrastructure, leading complex problem-solving efforts, and developing strategies to overcome operational challenges.

**Key Responsibilities:** - **Technical Leadership:** Provide guidance and mentorship to the team while leading by example with hands-on contributions. - **Complex Problem Solving:** Take ownership of complex tasks, drive solutions from conception to implementation. - **Coordination:** Ensure tasks are on track, resolve blockers, and facilitate a smooth workflow. - **Infrastructure Management:** Design, deploy, and maintain reliable and scalable cloud environments on GCP and Azure. - **Strategy Development:** Identify challenges, devise strategies to optimize performance, reliability, and cost-efficiency. - **Monitoring & Incident Resolution:** Implement monitoring systems, lead incident response efforts, and perform root cause analysis. - **Data Pipeline Management:** Collaborate with data teams to optimize and manage large-scale data pipelines for reliability and efficiency. - **Automation & CI/CD:** Develop automation scripts, implement CI/CD pipelines for improved deployment efficiency. - **Security & Compliance:** Ensure infrastructure follows security best practices and compliance requirements.

**Qualifications Required:** - 6 years of experience as a Site Reliability Engineer, DevOps Engineer, or in a related technical role. - Expertise in GCP and Azure with in-depth knowledge of cloud architecture, services, and best practices. - Strong data engineering background with hands-on experience in data pipelines, databases, and big data technologies. - Proven leadership experience with mentoring abilities and problem-solving mindset. - Proficiency in automation tools like Terraform, Ansible, and scripting languages such as Python and Bash. - Experience with monitoring tools like Prometheus, Grafana, or ELK stack. - Familiarity with containerization and orchestration tools like Docker and Kubernetes. - CI/CD expertise including the design and implementation of automated pipelines. - Excellent communication and leadership skills to drive cross-team collaboration.

Boost IT is a dynamic and energetic technology consultancy company looking for individuals passionate about technology and keen to work on cutting-edge projects. Join us in our mission to do IT better and make a significant impact in the technology industry. As a Site Reliability Engineer (SRE) at Boost IT, you will play a key role in ensuring the reliability and scalability of our cloud infrastructure on Google Cloud Platform (GCP) and Microsoft Azure. Your expertise in data engineering will be crucial in optimizing performance, reliability, and cost-efficiency. Below are the details of your responsibilities and qualifications:

**Role Overview:** Boost IT is seeking an experienced Site Reliability Engineer (SRE) to lead technical initiatives and provide guidance to the team. Your role will involve managing cloud infrastructure, leading complex problem-solving efforts, and developing strategies to overcome operational challenges.

**Key Responsibilities:** - **Technical Leadership:** Provide guidance and mentorship to the team while leading by example with hands-on contributions. - **Complex Problem Solving:** Take ownership of complex tasks, drive solutions from conception to implementation. - **Coordination:** Ensure tasks are on track, resolve blockers, and facilitate a smooth workflow. - **Infrastructure Management:** Design, deploy, and maintain reliable and scalable cloud environments on GCP and Azure. - **Strategy Development:** Identify challenges, devise strategies to optimize performance, reliability, and cost-efficiency. - **Monitoring & Incident Resolution:** Implement monitoring systems, lead incident response efforts, and perform root cause analysis. - **Data Pipeline Management:** Collaborate with data teams to optimize and manage large-scale data pipelines for reliability and efficiency. - **Automation & CI/CD:** Develop automation scripts, implement CI/CD pipelines for improved deployment efficiency. - **Security & Compliance:** Ensure infrastructure follows security best practices and compliance requirements.

**Qualifications Required:** - 6 years of experience as a Site Reliability Engineer, DevOps Engineer, or in a related technical role. - Expertise in GCP and Azure with in-depth knowledge of cloud architecture, services, and best practices. - Strong

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free