Lead Cloud GCP Systems Engineer
Epam
About the role
About
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi‑national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting‑edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
We are seeking a talented Lead Cloud GCP Engineer to join our team and contribute to the design, implementation, and management of Google Cloud Platform (GCP) solutions. If you are passionate about cloud technologies, infrastructure as code, and optimizing cloud‑based environments, this role offers an exciting opportunity to shape our cloud strategy and drive innovation. We require an experienced Architect who can contribute to critical application and product development projects.
#LI-DNI
Responsibilities
- This resource will be working closely with the Cloud Architect to take defined requirements and lead the development of the necessary automation elements to deploy production workloads to our cloud environment
- First Tier support for internal and external users
- Design, install, configure and maintain cloud resources and develop recovery strategies for all systems
- Monitor and evaluate to ensure timely upgrades and additions of IaaS/PaaS and software to meet growth requirements
- Responsible for fault‑tolerance, high‑availability, scalability, and security on Google Cloud Infrastructure and Platform
- Responsible for implementation of CI/CD pipelines with automated build and test systems
- Responsible for Production Deployment using Multiple Deployment Strategies
- Responsible for Automating the Google Cloud Infrastructure and Platform Deployment with Infrastructure as a Code
- Responsible for Automating System Configurations using Configuration Management Tools
- Good Understanding & implementation of Microservices Concepts and Best Practices
- Enabling application development by coordinating requirements, schedules, and activities
- Addressing issues promptly, responding positively to setbacks and challenges with a mindset of continuous improvement
- Carrying out POCs to make sure that suggested design/technologies meet the requirements
- Able to learn the services used in the environment quickly
Requirements
- Should have overall 8 to 12 years of experience with minimum 5 years of production experience on any public cloud
- Possesses Leadership traits, and has lead teams for at‑least 2 years
- Must have excellent communication skills
- Thorough understanding of Security, Networks, Hybrid Cloud
- A good understanding of GCP Architecture, Deployment architectures
- Basic understanding/exposure to SRE
- Design and implement solutions for clients considering the on‑premise infrastructure and Google Cloud Platform
- Should have experience in GCP Services - Compute(CE, AE, CF, GKE, CR, pub/Sub), Storage, Data tools, Cloud Build
- Should have experience in GCP Networking fundamentals (VPC, VPN, Peering, Interconnect)
- Should have experience in GCP Security fundamentals (IAM, Service Accounts, Workload Identity)
- Should have experience in Container Orchestration
- Should have experience in Container Registry - GCR
- Experience in setting up of Load Balancer, Auto Scaling, databases, firewall rules etc
- Working knowledge of Backup and Recovery methodology and implementations on GCP
- Experience with Infrastructure as Code Tooling like Terraform
- Experience with at least one Configuration management tool like (Ansible, Packer, Salt, Chef, Puppet, etc.)
- Experience with Containerization like Dockers and Kubernetes
- Experience with windows or Linux/Unix administration and Unix shell scripting (bash)
- Experience with at least one Continuous I
Requirements
- 8–12 years of overall professional experience, with at least 5 years of production experience on a public cloud
- Minimum 2 years of leadership experience managing or leading teams
- Excellent communication skills
- Strong understanding of security, networking, and hybrid‑cloud concepts
- Good knowledge of GCP architecture and deployment patterns
- Basic exposure to Site Reliability Engineering (SRE) practices
- Experience designing and implementing solutions that span on‑premise infrastructure and GCP
- Hands‑on experience with GCP services such as Compute Engine, App Engine, Cloud Functions, GKE, Cloud Run, Pub/Sub, Cloud Storage, data tools, and Cloud Build
- Experience with GCP networking (VPC, VPN, peering, interconnect)
- Experience with GCP security (IAM, service accounts, workload identity)
- Container orchestration experience (Kubernetes/GKE)
- Experience with Google Container Registry (GCR)
- Knowledge of load balancers, auto‑scaling, databases, firewall rules, and related services
- Working knowledge of backup and recovery methodologies on GCP
- Proficiency with Infrastructure as Code tools, especially Terraform
- Experience with at least one configuration‑management tool (e.g., Ansible, Packer, Salt, Chef, Puppet)
- Containerization experience with Docker and Kubernetes
- Windows or Linux/Unix administration and Unix shell scripting (bash)
- Experience with CI/CD pipelines and related tooling
Responsibilities
- Work closely with the Cloud Architect to translate requirements into automation for production workload deployment
- Provide first‑tier support for internal and external users
- Design, install, configure, and maintain cloud resources and develop recovery strategies
- Monitor and evaluate upgrades and additions of IaaS/PaaS and software to meet growth requirements
- Ensure fault‑tolerance, high‑availability, scalability, and security on Google Cloud Infrastructure and Platform
- Implement CI/CD pipelines with automated build and test systems
- Manage production deployments using multiple deployment strategies
- Automate Google Cloud infrastructure deployment using Infrastructure as Code
- Automate system configurations with configuration‑management tools
- Apply microservices concepts and best practices
- Coordinate requirements, schedules, and activities to enable application development
- Address issues promptly and foster continuous improvement
- Conduct proof‑of‑concepts to validate designs and technologies
- Rapidly learn and adopt new services used in the environment
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free