Site Reliability Engineer

Scaleops

New York · On-site Full-time Senior 1mo ago

About the role

About ScaleOps

ScaleOps, the leader in real-time automated cloud resource management, is revolutionizing how DevOps teams manage their cloud-native application infrastructures. Backed by venture capital and software industry titans, ScaleOps’ platform removes the organizational friction between application owners and DevOps teams by fully automating the resource management process to meet real-time demand.

The ScaleOps platform dynamically manages the application’s resource allocation, eliminating the need for manual intervention. The result is improved application performance, 60%- 80% cloud cost savings, and a fully automated allocation process.

With well over $210 million in backing, ScaleOps has seen tremendous business growth, attracting global industry leaders to its customer base. ScaleOps automatically manages the production environments of over 50 enterprises, including Adobe, Salseorce,Wiz, Docusign, EA (EA Sports), Coupa.

What You’ll Do

Own ScaleOps’ infrastructure end-to-end - our self-hosted product, its installation and onboarding flows, our SaaS platform, and AI infrastructure
Manage ScaleOps’ cloud infrastructure across AWS, GCP, and Azure - networking, security, SSO, and compute
Work closely with customers on deployments and troubleshooting
Collaborate closely with backend, product, and R&D teams to support rapid feature delivery without compromising reliability
Identify and eliminate operational toil through automation and tooling improvements
Maintain security and compliance best practices across the infrastructure stack

Requirements

What You’ll Bring

5+ years of hands-on development experience in infrastructure, platform engineering, or SRE roles in high-scale distributed systems
Hands-on experience with at least two major cloud providers (AWS, GCP, or Azure)
Solid understanding of networking, security groups, IAM, SSO/OIDC, and cloud-native security principles
A strong ownership mentality — you don’t just flag problems, you fix them
Full professional fluency in Hebrew and English

Advantage

Deep expertise with Kubernetes, Helm, and Go — you don’t just deploy, you understand the internals and can build on them
Experience with cloud cost optimization, resource management, or working at a company that sells infrastructure tooling

Skills

AWSAzureGCPGoHelmIAMKubernetesOIDCSSO

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free

Site Reliability Engineer

About the role

About ScaleOps

What You’ll Do

Requirements

What You’ll Bring

Advantage

Skills

Similar roles

Software Engineer

Senior Database Engineer

Team Leads

Don't send a generic resume