Site Reliability Engineer (SRE) – Synthetic Monitoring & Observability
Hire Orbitt
About the role
Job Title
Site Reliability Engineer (SRE) / Application Support
Location
3 days a week in midtown Manhattan
Company
Financial Services
Salary range
$120K - $140K
About the Role
We’re looking for a Site Reliability Engineer (SRE) with strong experience in synthetic monitoring, observability, and application performance to support large‑scale, business‑critical systems within a major financial services organization.
This role is ideal for someone who enjoys building proactive monitoring, improving reliability, and partnering closely with engineering teams to prevent issues before they impact customers.
What You’ll Do
- Build, maintain, and optimize synthetic monitoring scripts for key applications and customer journeys
- Configure and tune Splunk and Dynatrace dashboards, alerts, and performance insights
- Partner with application and engineering teams to troubleshoot availability and latency issues
- Develop automation for monitoring, reporting, and health checks using Python or Shell
- Drive observability best practices across SLIs/SLOs, metrics, logs, and tracing
- Participate in incident triage, root‑cause analysis, and reliability improvement initiatives
- Deliver clear visibility into system health through dashboards, analytics, and reporting
What You Bring
- Hands‑on experience with synthetic monitoring tools (Dynatrace Synthetic, Selenium‑based frameworks, etc.)
- Experience supporting front-office trading desks or real-time financial applications.
- Strong proficiency with Splunk (search queries, dashboards, alerting)
- Solid scripting skills in Python or Shell
- Understanding of APM, distributed tracing, performance tuning, and observability concepts
- Experience with CI/CD pipelines, cloud platforms, and containerized environments (Docker/Kubernetes)
- Strong communication skills and ability to collaborate across engineering, SRE, and application teams
Nice to Have
- Experience with AppDynamics, Datadog, New Relic, or ELK
- Background in financial services or large enterprise environments
- Exposure to SRE/DevOps practices and reliability engineering principles
- Familiarity with Unix/Linux and shell scripting
Job Type
Full-time
Pay
$120,000.00 - $170,000.00 per year
Benefits
- 401(k)
- Dental insurance
- Health insurance
- Paid time off
- Vision insurance
Work Location
Hybrid remote in New York, NY 10036
Requirements
- Hands‑on experience with synthetic monitoring tools (Dynatrace Synthetic, Selenium‑based frameworks, etc.)
- Experience supporting front-office trading desks or real-time financial applications.
- Strong proficiency with Splunk (search queries, dashboards, alerting)
- Solid scripting skills in Python or Shell
- Understanding of APM, distributed tracing, performance tuning, and observability concepts
- Experience with CI/CD pipelines, cloud platforms, and containerized environments (Docker/Kubernetes)
- Strong communication skills and ability to collaborate across engineering, SRE, and application teams
Responsibilities
- Build, maintain, and optimize synthetic monitoring scripts for key applications and customer journeys
- Configure and tune Splunk and Dynatrace dashboards, alerts, and performance insights
- Partner with application and engineering teams to troubleshoot availability and latency issues
- Develop automation for monitoring, reporting, and health checks using Python or Shell
- Drive observability best practices across SLIs/SLOs, metrics, logs, and tracing
- Participate in incident triage, root‑cause analysis, and reliability improvement initiatives
- Deliver clear visibility into system health through dashboards, analytics, and reporting
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free