Skip to content
mimi

Site Reliability Engineer

Scaleops

New York · On-site Full-time Senior Yesterday

About the role

About ScaleOps

ScaleOps, the leader in real-time automated cloud resource management, is revolutionizing how DevOps teams manage their cloud-native application infrastructures. Backed by venture capital and software industry titans, ScaleOps’ platform removes the organizational friction between application owners and DevOps teams by fully automating the resource management process to meet real-time demand.

The ScaleOps platform dynamically manages the application’s resource allocation, eliminating the need for manual intervention. The result is improved application performance, 60%- 80% cloud cost savings, and a fully automated allocation process.

With well over $210 million in backing, ScaleOps has seen tremendous business growth, attracting global industry leaders to its customer base. ScaleOps automatically manages the production environments of over 50 enterprises, including Adobe, Salseorce,Wiz, Docusign, EA (EA Sports), Coupa.

What You’ll Do

  • Own ScaleOps’ infrastructure end-to-end - our self-hosted product, its installation and onboarding flows, our SaaS platform, and AI infrastructure
  • Manage ScaleOps’ cloud infrastructure across AWS, GCP, and Azure - networking, security, SSO, and compute
  • Work closely with customers on deployments and troubleshooting
  • Collaborate closely with backend, product, and R&D teams to support rapid feature delivery without compromising reliability
  • Identify and eliminate operational toil through automation and tooling improvements
  • Maintain security and compliance best practices across the infrastructure stack

Requirements

What You’ll Bring

  • 5+ years of hands-on development experience in infrastructure, platform engineering, or SRE roles in high-scale distributed systems
  • Hands-on experience with at least two major cloud providers (AWS, GCP, or Azure)
  • Solid understanding of networking, security groups, IAM, SSO/OIDC, and cloud-native security principles
  • A strong ownership mentality — you don’t just flag problems, you fix them
  • Full professional fluency in Hebrew and English

Advantage

  • Deep expertise with Kubernetes, Helm, and Go — you don’t just deploy, you understand the internals and can build on them
  • Experience with cloud cost optimization, resource management, or working at a company that sells infrastructure tooling

Skills

AWSAzureGCPGoHelmIAMKubernetesOIDCSSO

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free