I
Observability Architect/ Engineer - Dynatrace
idexcel
Remote (Global) Contract Senior Today
About the role
About
We are seeking a highly experienced Senior Observability Architect/ Engineer with deep, hands‑on expertise in Dynatrace SaaS to lead enterprise‑scale observability deployments across AWS and Azure. This role will drive the design, automation, and rollout of Dynatrace capabilities for large, complex environments while partnering closely with DevOps, SRE, Cloud, and Application teams. The ideal candidate has 9+ years of Dynatrace implementation experience, strong DevOps and automation skills, and the ability to mentor engineering teams.
Dynatrace Expertise
- Lead large enterprise-scale deployments of Dynatrace observability across distributed microservices, serverless workloads, and multi‑region multi-cloud environments.
- Maintain Dynatrace governance and best practices, support multi-tenants, fine grained access controls, and logical segmentation of teams, apps, and environments.
- Configure and optimize APM instrumentation, Deep code‑level visibility, PurePath distributed tracing, Smartscape topology mapping, and other advanced Dynatrace features to ensure full‑stack observability.
- Build and maintain custom dashboards, management zones, tagging rules and entity metadata strategies.
- Develop and tune alerting profiles, anomaly detection rules, Davis AI configurations, and auto-remediation workflows.
- Leverage Davis AI to automatically identify Root Cause using causal analysis, correlate metrics, logs, traces, and events to reduce noise and eliminate false positives.
- Build HTTP, and Browser Synthetic Monitoring and performance baselines.
- Configure Real User Monitoring (RUM) for web and mobile applications, including User journey analysis, User experience insights, and performance KPIs.
- Implement and manage log ingest pipelines, log processing rules, retention policies, and Dynatrace Grail/Log Management features.
- Integrate with GitHub Actions, Jenkins, ServiceNow, PagerDuty, and Teams.
- Build OTel integrations and custom plugins.
DevOps Automation
- Implement CI/CD pipelines using tools such as GitHub Actions, AWS CodePipeline, and Jenkins.
- Automate infrastructure provisioning through Infrastructure-as-Code (IaC) using Terraform, CloudFormation, or AWS CDK.
- Develop self-service automation tools using Python or other scripting languages.
Incident Management & Response
- Proficient in ITIL framework and ITSM tools such as ServiceNow.
- Production on-call responder with strong troubleshooting capabilities.
- Develop RCA documentation, and Knowledge articles.
- Apply SRE principles, including SLIs, SLOs, and error budgets.
Security & Compliance Implementation
- Manage service accounts and access permissions.
- Create, deploy, and manage digital certificates.
- Respond to security incidents and execute remediation tasks effectively.
Education & Additional Experience
- Bachelor’s degree in Computer Science, Engineering, or related field
- 9+ years of Dynatrace implementation experience
- 5+ years of experience in DevOps, SRE, or infrastructure roles
- Knowledge of Linux systems and networking
- Working in a SAFe Agile delivery environment
- Excellent written and verbal communication skills.
- Demonstrated ability to work independently and manage priorities.
- Availability to work outside of standard business hours as required.
Skills
AWSAWS CDKAWS CodePipelineAzureCloudFormationCI/CDDockerDynatraceGitHub ActionsGrafanaGraphiteHTTPInfrastructure-as-CodeITILJenkinsLinuxMicroservicesNetworkingPagerDutyPythonRUMSAFeSREServiceNowTerraformTeamsTraceTypeScriptVue
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free