Skip to content
mimi

Azure Site Reliability Engineer - Datadog Observability

Droisys

Glen Allen · On-site Contract Mid Level 3w ago

About the role

Droisys is an innovation technology company focused on helping companies accelerate their digital initiatives from strategy and planning through execution. We leverage deep technical expertise, Agile methodologies, and data-driven intelligence to modernize systems of engagement and simplify human/tech interaction.

At Droisys, we invest in our talent and support career growth, and we are always on the lookout for amazing talent who can contribute to our growth by delivering top results for our clients. Join us to challenge yourself and accomplish work that matters.

Role Summary

  • We are seeking Site Reliability Engineers SREs with strong Datadog observability experience to help build and scale a single pane of glass monitoring and observability platform across applications and infrastructure
  • This role will focus on designing actionable dashboards APM synthetic monitoring and ing standards while driving observability as code and automation in partnership with the CloudOps team
  • The ideal candidate has 2-3 years of hands on experience with Datadog and enjoys combining engineering discipline automation and reliability practices to improve system visibility and operational outcomes

Required Qualifications

  • 2-3 years hands on experience in SRE Observability or Production Operations roles
  • Strong practical experience with Datadog including dashboards monitors APM logs and synthetics
  • Experience automating infrastructure or observability using Terraform
  • Experience scripting or automating operational tasks using PowerShell especially for agent installation on VMs
  • Working knowledge of Azure cloud services and cloud native architectures
  • Strong troubleshooting skills and a mindset focused on reliability and prevention

Preferred Qualifications

  • Experience with observability platform rollouts or migration from tools such as App Insights or LogicMonitor to Datadog
  • Experience working with GitHub/ GitHub Actions or similar CICD tools for automation workflows
  • Familiarity with SRE concepts such as SLIs SLOs error budgets and incident response frameworks
  • Exposure to containerized environments AKS Kubernetes and distributed systems observability

Droisys is an equal opportunity employer. We do not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Droisys believes in diversity, inclusion, and belonging, and we are committed to fostering a diverse work environment.

Skills

AKSAPMAzureDatadogDevOpsGitHub ActionsKubernetesLogicMonitorPowerShellSRETerraform

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free