Skip to content
mimi

Site Reliability Engineering - Incident Response Manager

Jobs via Dice

Cleveland · On-site Full-time Lead Today

About the role

Description

  • 8+ years in IT Operations / SRE / Technical Operations
  • 3+ years in leadership managing 24x7 teams
  • Strong hands-on experience in Incident Management (ITIL framework)
  • Expertise in:
    • Linux environments
    • Monitoring tools (Prometheus, Grafana, Datadog, Zabbix)
    • Kubernetes
    • Networking (TCP/IP, DNS, BGP)
    • Ticketing tools (ServiceNow / Jira Service Management)
  • Strong leadership, decision-making, and communication skills under pressure

Skills

DatadogGrafanaITILJira Service ManagementKubernetesLinuxPrometheusServiceNowTCP/IPZabbixBGPDNS

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free