Senior Site Reliability Engineer
Persistent Systems
About the role
About Persistent
We are a trusted Digital Engineering and Enterprise Modernization partner, combining deep technical expertise and industry experience to help our clients anticipate what’s next. Our offerings and proven solutions create unique competitive advantage for our clients by giving them the power to see beyond and rise above.
We are experiencing tremendous growth, with $566 million in revenue in FY21, representing 12.9% year-over-year growth. Along with that growth, we onboarded over 3,000 new employees in the past year, bringing our total employee count to over 15,000 people located in 18 countries across the globe.
At Persistent, our values are more than a list of ideals to improve our corporate image. We’re dedicated to building an inclusive culture that reflects what’s important to our employees and is based on what they value. As a result, 95% of our employees approve of the CEO and 83% recommend working at Persistent to a friend.
About Position
We are searching for motivated, self-starter candidates to join our team as an Cloud Engineer. The ideal candidate will be proficient in a range of AWS services and related technologies, with a proven ability to design, implement, and manage cloud infrastructure in a dynamic environment.
About Position
Role: Senior Site Reliability Engineer (SRE) - Release & Observability
Location: Scottsdale AZ (onsite)
Hire Type: contract
Experience: 8+ years of experience
What You'll Do
- Solid hands-on experience in SRE or Release Engineering Roles
- Strong experience deploying and operating containerized applications on Kubernetes across on-Prem and AWS Cloud
- Strong of Linux and networking fundamentals
- Own release automation, deployment strategies, rollback mechanisms, and release validation
- Proven experience supporting REST API services in production environments
- Dr. Continuous improvements in release safety, reliability, monitoring, alerting and operational readiness
- Experience with monitoring and observability tools such as Splunk, Prometheus/Grafana
- Lead troubleshooting of complex production incidents and service degradations
- Participate in on call rotations and lead incident response and post incidence reviews
Expertise You'll Bring
- Solid hands-on experience in SRE or Release Engineering Roles
- Strong experience deploying and operating containerized applications on Kubernetes across on-Prem and AWS Cloud
- Strong of Linux and networking fundamentals
- Own release automation, deployment strategies, rollback mechanisms, and release validation
- Proven experience supporting REST API services in production environments
- Dr. Continuous improvements in release safety, reliability, monitoring, alerting and operational readiness
- Experience with monitoring and observability tools such as Splunk, Prometheus/Grafana
- Lead troubleshooting of complex production incidents and service degradations
- Participate in on call rotations and lead incident response and post incidence reviews
Benefits
- Competitive salary and benefits package
- Culture focused on talent development with quarterly promotion cycles and company-sponsored higher education and certifications
- Opportunity to work with cutting-edge technologies
- Employee engagement initiatives such as project parties, flexible work hours, and ‘Long Service’ awards
- Annual health check-ups as well as insurance:
- Group term life insurance
- Personal accident insurance
- Mediclaim hospitalization insurance for self, spouse, two children, and parents
Why Persistent is an employer of choice
- Technology Innovation: culture of innovation using cutting-edge technology to bring value to clients.
- Growth and Career Progression: learning opportunities for growth, including quarterly promotion cycles.
- One Persistent Culture: global outlook with diversity and inclusion at its core.
- Mental and Physical Wellness: employee health and mindfulness programs
Requirements
- Solid hands-on experience in SRE or Release Engineering Roles
- Strong experience deploying and operating containerized applications on Kubernetes across on-Prem and AWS Cloud
- Strong of Linux and networking fundamentals
- Proven experience supporting REST API services in production environments
- Experience with monitoring and observability tools such as Splunk, Prometheus/Grafana
Responsibilities
- Own release automation, deployment strategies, rollback mechanisms, and release validation
- Lead troubleshooting of complex production incidents and service degradations
- Participate in on call rotations and lead incident response and post incidence reviews
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free