L
SRE - Tools & Telemetry (SME)
Hubballi · On-site Full-time Senior Today
About the role
About
Looking for a Senior SRE – Tools & Telemetry with strong hands-on experience in observability platforms to act as a technical SME across our monitoring and telemetry stack.
- This role requires someone who can identify gaps, recommend improvements, and personally implement solutions across complex production environments.
- Observability Stack: Dynatrace, Prometheus, Splunk, Grafana, Loki, OpManager.
Key Responsibilities
- Serve as a hands-on SME for observability tools across applications and platforms.
- Design, implement, and improve metrics, logs, dashboards, alerts, and SLOs.
- Proactively identify observability gaps and drive improvements end-to-end. Partner with SRE, DevOps, and application teams to reduce MTTR and improve reliability.
- Optimize alerting, telemetry quality, and platform cost/performance.
Required Experience
- 5+ years in SRE, Observability roles.
- Strong hands-on experience with Dynatrace, Prometheus, Splunk, OpManager, Grafana, and/or Loki.
- Experience supporting production, business-critical systems.
- Proven ability to recommend and implement improvements, not just operate tools.
What We’re Looking For
- Deeply hands-on engineers with a strong ownership mindset.
- Ability to act as an SME and raise observability maturity across teams.
- Bias toward execution, automation, and continuous improvement.
Requirements
- Strong hands-on experience with Dynatrace, Prometheus, Splunk, OpManager, Grafana, and/or Loki.
- Experience supporting production, business-critical systems.
- Proven ability to recommend and implement improvements, not just operate tools.
Responsibilities
- Serve as a hands-on SME for observability tools across applications and platforms.
- Design, implement, and improve metrics, logs, dashboards, alerts, and SLOs.
- Proactively identify observability gaps and drive improvements end-to-end.
- Partner with SRE, DevOps, and application teams to reduce MTTR and improve reliability.
- Optimize alerting, telemetry quality, and platform cost/performance.
Skills
DynatraceGrafanaLokiOpManagerPrometheusSplunk
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free