M
Senior SRE for Cloud Storage Systems
MongoDB
Toronto · On-site Full-time Senior 1w ago
About the role
About
Join a dynamic team as a Senior Site Reliability Engineer in cloud storage. Focus on enhancing reliability and operational safety in distributed systems tailored for diverse customer workloads.
With 6+ years of industry experience, leverage your coding skills in Python or Go and your expertise in container technologies, especially Kubernetes. Your background should reflect a commitment to efficiency, self-healing infrastructures, and customer-centered design—critical for our multi-year roadmap.
Key Responsibilities:
- Design and implement distributed storage systems
- Build resilient, available, and fault-tolerant infrastructures
- Establish metrics for service health and performance
- Engage in a 24/7 on-call rotation
- Enhance performance from kernel to application layers
Requirements:
- Minimum of 6 years in distributed systems engineering
- Proficient with cloud services: AWS, GCP, or Azure
- Practical experience with container management
- Familiarity with Linux and networking concepts
- Preference for automation in operational tasks
Transform cloud storage reliability while optimizing performance through innovative engineering practices.
Skills
AWSGCPGoKubernetesLinuxPython
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free