P
Site Reliability Engineer (SRE)
Peraton
US · On-site Full-time Mid Level $135k – $216k/yr 1w ago
About the role
About
Peraton seeks a Site Reliability Engineer (SRE) to support Department of Defense Cyber Defense Command (DCDC) with DCO AI Cyber Security Support. Location: Fort Meade, MD.
In this role, you will ensure reliability, performance, and availability of analytic platforms through monitoring, optimization, and incident response. You will operate as part of the Enablers Team.
Primary Responsibilities
- Monitor system resources using Grafana/CloudWatch
- Implement and maintain monitoring solutions
- Perform capacity planning and optimization
- Conduct root cause analysis for incidents
- Develop automation for reliability
- Define and track SLIs/SLOs
Required Qualifications
- Minimum of 8 years with BS/BA; Minimum of 6 years with MS/MA; Minimum of 3 years with PhD
- Must have/maintain a current Security+ or a comparable IA Cybersecurity certification
- Experience with monitoring tools (Grafana, Prometheus)
- Knowledge of SRE principles and practices
- Scripting and automation skills
- Incident management experience
- Performance analysis capabilities
- U.S Citizenship required
- DoD TS with ability to obtain/maintain SCI clearance
Preferred Qualifications
- Any of the SRE certifications:
- SRE Foundation (DevOps Institute/PeopleCert): Focuses on the core principles, practices, and terminology of SRE, ideal for beginners.
- SRE Practitioner (GSDC): Validates deeper knowledge, focusing on implementing SRE, managing large-scale systems, and operational excellence.
- SRE Fundamentals (APMG International): Covers foundational concepts, including incident management and automation
- 8 years of specific experience
- Bachelor's degree in Computer Science or related field
Skills
CloudWatchGrafanaPrometheus
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free