Site Reliability Engineer
Motion Recruitment
About the role
A model development platform redefining how engineering drawings and supply chain intelligence move across the manufacturing ecosystem is seeking a Site Reliability Engineer to help build infrastructure with aerospace-grade reliability.
This role centers on designing resilient cloud environments that support mission-critical AI workloads while protecting sensitive U.S. customer data tied to complex industrial programs. You’ll own core U.S. operational infrastructure and collaborate with a global engineering organization delivering systems that must scale, perform, and remain continuously available.
The environment emphasizes GCP-first architecture, multi-cloud redundancy, and platform tooling that enables hundreds of developers and AI engineers to ship safely. This role will pair GCP features and national security compliance practices, to create a disciplined and growing Cloud/AI department with upward momentum.
Required Skills & Experience
- Strong production experience designing and operating Google Cloud Platform (GCP) infrastructure
- Infrastructure-as-Code expertise using Terraform to manage scalable, repeatable environments
- Deep Kubernetes experience operating highly available, resilient container platforms
Desired Skills & Experience
- Multi-cloud architecture supporting redundancy and regional isolation strategies
- Automated security testing, vulnerability scanning, and compliance enforcement within CI/CD pipelines
- Performance tuning, capacity planning, and disaster recovery engineering for distributed systems
Daily Responsibilities
- Hands-On Engineering: 70%
- Team Collaboration & Cross-Functional Work: 30%
Applicants must be currently authorized to work in United States on a full-time basis now and in the future.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free