PT
Senior Observability Engineer
PRI Technology
Holmdel · Hybrid Contract Senior $80 – $85/hr 2w ago
About the role
About
This role is responsible for the administration, configuration, implementation, and ongoing optimization of observability platforms that enable end-to-end visibility across applications, infrastructure, and cloud-native workloads.
Responsibilities
- Continuously monitor the health, availability, and performance of observability platforms.
- Ensure data integrity, retention, and availability across metrics, logs, and traces.
- Proactively identify and remediate platform performance, scalability, and reliability issues.
- Administer, configure, and support Splunk (OpenTelemetry or AppDynamics) platforms to meet enterprise monitoring and observability needs.
- Design and implement observability solutions aligned to MELT (Metrics, Events, Logs, Traces) best practices.
- Perform regular upgrades, patching, and security hardening of observability platforms.
- Implement and support observability for AWS services, including:
- Deliver full-stack observability, including:
- Create and maintain dashboards, reports, and alerts in AppDynamics and Splunk.
- Collaborate with application, platform, and DevOps teams to define meaningful monitoring and alerting standards.
- Reduce noise through alert tuning and promote actionable signal over raw data.
- Integrate observability into CI/CD pipelines using GitHub, Jenkins, ArgoCD, and automation frameworks.
- Develop scripts and automation using Python, JavaScript, or Bash to streamline onboarding, configuration, and maintenance activities.
- Partner with IT, SRE, and DevOps teams to ensure comprehensive monitoring coverage.
- Participate in incident response efforts, leveraging observability data to accelerate detection, diagnosis, and resolution.
Qualifications
- Bachelor’s degree in Computer Science, Information Technology, or a related field.
- 5–7+ years of experience in Observability, Monitoring, SRE, or Platform Engineering roles.
- Proven hands-on experience implementing, managing, and maintaining AppDynamics, Splunk, and OpenTelemetry in enterprise environments.
Observability Platforms:
- AppDynamics (APM, dashboards, alerts)
- Splunk (configuration, administration, data onboarding)
- OpenTelemetry (instrumentation, collectors, sampling)
- EKS, ECS, Lambda
- SNS/SQS, S3, CloudWatch
- GitHub, Jenkins, or ArgoCD
- CI/CD pipelines and GitOps practices
- Strong expertise in Metrics, Events, Logs, and Traces (MELT)
- Full-stack and cloud-native observability
- Python, JavaScript, Bash
- Strong understanding of IT infrastructure, applications, and networking
Skills
AppDynamicsAWS CloudWatchAWS ECSAWS EKSAWS LambdaAWS S3AWS SNSAWS SQSArgoCDBashCI/CDDockerGitOpsGitHubJenkinsJavaScriptOpenTelemetryPythonSplunk
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free