Site Reliability Engineer III - Incident Management, Linux, Root Cause Analysis
Groupon
About the role
About
As a Site Reliability Engineer (Incident Management) at Groupon, you will be responsible for supporting and optimizing the process, implementation, and operational support of internal systems that span business side and engineering departments. You will have the opportunity to leverage Site Reliability Engineering best practices and ITIL Solutions Architecture framework to devise incident management strategies.
At Groupon, we value engineers who are customer‑focused, team players, fast learners, pragmatic, and owners. If you are passionate, energetic, and a technology enthusiast who enjoys debugging infrastructure platforms and solving problems, then Groupon is the right place for you. Join us on our mission to become the ultimate destination for local experiences and services.
Responsibilities
- Acting as Incident Commander, change manager, and a senior technical resource to prevent, identify, triage, document, investigate, mitigate, and recover from site/service impacting incidents across Groupon's ~300+ globally dispersed services.
- Facilitating the coordination and resolution of Post Mortems through best practices and overseeing Problem Management.
- Dedicated project time to work on a number of interesting and engaging projects.
- Working as part of the Incident Management team (Shift Monday‑Friday with one weekend primary on‑call every 6 weeks).
Requirements
- 6+ years of experience administering Linux system environments and conducting root cause analysis of site impacting issues.
- 6+ years of experience with web applications operations and root cause analysis.
- 4+ years of experience creating unique Splunk or Kibana search queries to identify, resolve, and prevent incidents and outages, and owning all impacting events until resolution.
- 6+ years of experience developing policies and procedures that improve overall production stability.
- Good communication, consulting, and collaboration skills interfacing with senior leadership teams.
- Experience with one or more programming languages (Python, Ruby, Java).
- A plus if you have a BS, MS, or PhD in Computer Sciences or related fields.
- A plus if you have designed and created tools to manage the site and services.
Additional Company Details
- Purpose: Groupon's purpose is to build strong communities through thriving small businesses.
- About Groupon: To learn more about the world's largest local e‑commerce marketplace, visit the latest Groupon news.
- Recruitment: Groupon follows a merit‑based recruitment process without charging any fees to job seekers. Be cautious of recruitment fraud and always check the official careers website at grouponcareers.com for legitimate job openings.
Requirements
- 6+ years of experience administering Linux system environments and conducting root cause analysis of site impacting issues.
- 6+ years of experience with web applications operations and root cause analysis.
- 4+ years of experience creating unique Splunk or Kibana search queries to identify, resolve, and prevent incidents and outages, and owning all impacting events until resolution.
- 6+ years of experience developing policies and procedures that improve overall production stability.
- Good communication, consulting, and collaboration skills interfacing with senior leadership teams.
- Experience with one or more programming languages (Python, Ruby, Java).
Responsibilities
- Acting as Incident Commander, change manager, and a senior technical resource to prevent, identify, triage, document, investigate, mitigate, and recover from site/service impacting incidents across Groupon's ~300+ globally dispersed services.
- Facilitating the coordination and resolution of Post Mortems through best practices and overseeing Problem Management.
- Dedicated project time to work on a number of interesting and engaging projects.
- Working as part of the Incident Management team (Shift Monday-Friday with one weekend primary on-call every 6 weeks)
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free