JV
Site Reliability Engineering - Incident Response Manager
Jobs via Dice
Cleveland · On-site Full-time Lead Today
About the role
Description
- 8+ years in IT Operations / SRE / Technical Operations
- 3+ years in leadership managing 24x7 teams
- Strong hands-on experience in Incident Management (ITIL framework)
- Expertise in:
- Linux environments
- Monitoring tools (Prometheus, Grafana, Datadog, Zabbix)
- Kubernetes
- Networking (TCP/IP, DNS, BGP)
- Ticketing tools (ServiceNow / Jira Service Management)
- Strong leadership, decision-making, and communication skills under pressure
Skills
DatadogGrafanaITILJira Service ManagementKubernetesLinuxPrometheusServiceNowTCP/IPZabbixBGPDNS
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free