Lead Site Reliability Engineer - Storage Layer Services
MongoDB
About the role
Overview
Join MongoDB's innovative Storage Layer Services (SLS) team, dedicated to re-architecting our cloud storage layer. As a pivotal unit in our next‑generation cloud architecture, we are crafting high‑performance, multi‑tenant distributed storage services to enhance our Atlas storage stack and optimize customer workloads.
As the Lead Site Reliability Engineer for SLS, you will collaborate closely with engineering teams to define service level objectives (SLOs), shape capacity plans, and ensure the reliability and durability of the foundational storage layer for Atlas. In this role, you will be instrumental in building and leading a skilled team of Site Reliability Engineers (SREs) as we execute on a multi‑year roadmap for MongoDB's cloud storage architecture.
Location: New York City office (hybrid work model) or remote from the East Coast.
Key Responsibilities
- Build and lead a talented team of 6‑8 engineers, fostering a positive work culture, guiding career growth and performance, and proactively addressing obstacles.
- Define and communicate a clear technical vision and strategic roadmap for our multi‑tenant storage systems, aligning long‑term infrastructure goals with immediate engineering needs.
- Contribute through hands‑on technical engagement by leading architectural design reviews, reviewing pull requests, and assisting the team through complex operational challenges.
- Serve as the primary liaison for the SLS SRE team, closely collaborating with engineering leaders to ensure alignment across platforms and manage stakeholder expectations.
You May Be a Great Fit If You
- Have 10+ years of experience in software development and operating distributed systems, with 2+ years in a management role.
- Possess a customer‑focused mindset, treating internal developers as primary users and advocates.
- Value efficient processes and operations, showcasing a proven history of optimizing workflows.
- Prefer automation over manual processes, fostering a culture of software solutions that minimize toil.
- Have deep technical knowledge of Kubernetes ecosystems, containerization technologies, and modern Infrastructure as Code (IaC) tools such as Terraform, Crossplane, or Operators, allowing you to guide the team's technical decisions effectively.
- Have experience operating or supporting stateful storage or database systems at scale, with comfort in navigating durability, consistency, and recovery trade‑offs.
- Excel in translating complex business and engineering requirements into actionable, phased technical roadmaps.
- Exhibit a high level of empathy, responsibility, ownership, and accountability.
- Demonstrate excellent verbal and written communication skills in technical contexts.
Strong Candidates May Also Have Experience With
- Leading significant architectural transitions, such as upgrading from legacy storage systems to modern multi‑tenant architectures, including planning and executing large‑scale data and workload migrations with stringent availability and durability standards.
- Managing and scaling infrastructure across multiple cloud services (AWS, GCP, or Azure).
- Designing secure multi‑tenant runtime environments at scale.
About MongoDB
MongoDB empowers innovation, enabling our customers and our teams to adapt swiftly to market changes. We have transformed the database landscape for the AI era, allowing creators to revolutionize industries through software. Our unified database platform is the most widely available, globally distributed database, assisting organizations in modernizing legacy workloads and embracing new innovations. MongoDB Atlas is the only globally distributed, multi‑cloud database available across AWS, Google Cloud, and Microsoft Azure.
With a global presence and over 60,000 customers—including 75% of the Fortune 100 and numerous AI‑native startups—MongoDB is facilitating the next generation of software solutions.
Our company culture is shaped by our Leadership Commitment, guiding our decision‑making processes, collaboration, and victories. This commitment is what makes MongoDB distinctive.
We are dedicated to fostering personal growth and maximizing our employees' business impact while ensuring a supportive and enriching workplace environment. We offer a range of employee benefits, from affinity groups to comprehensive parental leave policies, reflecting our commitment to the wellbeing of our team.
MongoDB is committed to providing necessary accommodations for individuals with disabilities throughout our application and interview processes. Please contact your recruiter to request any accommodations.
MongoDB, Inc. is an equal opportunity employer, prohibiting discrimination and harassment of any kind. All employment decisions are made without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, and any other characteristic protected by federal, state, or local laws.
Compensation & Benefits
- Req ID: 1273396229
- Base salary range (U.S.): $151,000‑$297,000 USD. Compensation during the offer stage is unique to each candidate, based on factors such as skills, experience, qualifications, and work location.
- Salary is just one portion of MongoDB's comprehensive compensation and benefits package, which may include equity, participation in our employee stock purchase program, flexible paid time off, 20 weeks of fully‑paid gender‑neutral parental leave, fertility and adoption assistance, 401(k) plan, mental health counseling, and various health benefits offerings.
- Please note that the salary range and benefits mentioned are applicable only to U.S.-based candidates.
Requirements
- Have 10+ years of experience in software development and operating distributed systems, with 2+ years in a management role
- Possess a customer-focused mindset, treating internal developers as primary users and advocates
- Value efficient processes and operations, showcasing a proven history of optimizing workflows
- Prefer automation over manual processes, fostering a culture of software solutions that minimize toil
- Have deep technical knowledge of Kubernetes ecosystems, containerization technologies, and modern Infrastructure as Code (IaC) tools such as Terraform, Crossplane, or Operators, allowing you to guide the team's technical decisions effectively
- Have experience operating or supporting stateful storage or database systems at scale, with comfort in navigating durability, consistency, and recovery trade-offs
- Excel in translating complex business and engineering requirements into actionable, phased technical roadmaps
- Exhibit a high level of empathy, responsibility, ownership, and accountability
- Demonstrate excellent verbal and written communication skills in technical contexts
- Strong candidates may also have experience with
- Leading significant architectural transitions, such as upgrading from legacy storage systems to modern multi-tenant architectures, including planning and executing large-scale data and workload migrations with stringent availability and durability standards
- Managing and scaling infrastructure across multiple cloud services (AWS, GCP, or Azure)
- Designing secure multi-tenant runtime environments at scale
Responsibilities
- In this role, you will be instrumental in building and leading a skilled team of Site Reliability Engineers (SREs) as we execute on a multi-year roadmap for MongoDB's cloud storage architecture
- Build and lead a talented team of 6-8 engineers, fostering a positive work culture, guiding career growth and performance, and proactively addressing obstacles
- Define and communicate a clear technical vision and strategic roadmap for our multi-tenant storage systems, aligning long-term infrastructure goals with immediate engineering needs
- Contribute through hands-on technical engagement by leading architectural design reviews, reviewing pull requests, and assisting the team through complex operational challenges
- Serve as the primary liaison for the SLS SRE team, closely collaborating with engineering leaders to ensure alignment across platforms and manage stakeholder expectations
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free