Skip to content
mimi

SRE – Tools

Diligente Technologies

Vellore · On-site Full-time Senior Today

About the role

About

Looking for a Senior SRE – Tools & Telemetry with strong hands-on experience in observability platforms to act as a technical SME across our monitoring and telemetry stack.

  • This role requires someone who can identify gaps, recommend improvements, and personally implement solutions across complex production environments.
  • Observability Stack: Dynatrace, Prometheus, Splunk, Grafana, Loki, OpManager.

Key Responsibilities

  • Serve as a hands-on SME for observability tools across applications and platforms.
  • Design, implement, and improve metrics, logs, dashboards, alerts, and SLOs.
  • Proactively identify observability gaps and drive improvements end-to-end. Partner with SRE, DevOps, and application teams to reduce MTTR and improve reliability.
  • Optimize alerting, telemetry quality, and platform cost/performance.

Required Experience

  • 5+ years in SRE, Observability roles.
  • Strong hands-on experience with Dynatrace, Prometheus, Splunk, OpManager, Grafana, and/or Loki.
  • Experience supporting production, business-critical systems.
  • Proven ability to recommend and implement improvements, not just operate tools.

What We’re Looking For

  • Deeply hands-on engineers with a strong ownership mindset.
  • Ability to act as an SME and raise observability maturity across teams.
  • Bias toward execution, automation, and continuous improvement.

Requirements

  • Strong hands-on experience with Dynatrace, Prometheus, Splunk, OpManager, Grafana, and/or Loki.
  • Experience supporting production, business-critical systems.
  • Proven ability to recommend and implement improvements, not just operate tools.

Responsibilities

  • Serve as a hands-on SME for observability tools across applications and platforms.
  • Design, implement, and improve metrics, logs, dashboards, alerts, and SLOs.
  • Proactively identify observability gaps and drive improvements end-to-end.
  • Partner with SRE, DevOps, and application teams to reduce MTTR and improve reliability.
  • Optimize alerting, telemetry quality, and platform cost/performance.

Skills

DynatraceGrafanaLokiOpManagerPrometheusSplunk

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free