Senior Site Reliability Engineer
HiBob
About the role
About
Drive cutting-edge AI initiatives as a Senior Site Reliability Engineer. Enjoy remote work while ensuring the reliability of our scalable AWS / Kubernetes environment and collaborating with global teams.
The ideal candidate boasts over 5 years of experience in a production engineering role, emphasizing cloud orchestration skills and coding proficiency in Python or Go. Your deep understanding of AI technologies will allow you to design innovative solutions that enhance operations. This is an opportunity to have a significant voice in technology decisions while managing production workloads independently.
Key Responsibilities
- Build and manage Kubernetes infrastructure on AWS
- Develop AI agents for incident responses
- Establish GitOps CI / CD pipelines using GitHub Actions
- Create developer self-service platforms and tools
- Oversee observability and monitoring via Datadog
Requirements
- Minimum 5 years as a Senior SRE or Production Engineer
- Proven experience in high-traffic SaaS environments
- Expertise in AWS and Kubernetes
- Proficient in Python or Go programming
- Knowledge of AI technology and applications
Shape the AI-driven future of operations while working remotely across diverse teams.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free