OS
Site Reliability Engineer - Junior/ Jounerman
OSAAVA Services LLC
Leesburg · On-site Contract Entry Level $100k – $125k/yr 3d ago
About the role
POSITION OVERVIEW
OSAAVA Services is seeking a Junior to Journeyman Site Reliability Engineer (SRE) to support at Langley AFB, Virginia. In this role, you will apply software engineering and systems operations disciplines to maintain the reliability, availability, and performance of mission‑critical infrastructure — automating processes, building observability pipelines, and responding to incidents in a classified DoD environment. This is a Monday–Friday on‑site position.
Occasional schedule adjustments or on‑call responsibilities may be required to support operational needs with advance notice provided.
WHAT YOU WILL DO
System Reliability & Operations
- Monitor production system health, availability, and performance across classified DoD infrastructure
- Define, implement, and track Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to maintain operational standards
- Respond to and triage system incidents; conduct root cause analysis and implement corrective actions to prevent recurrence
- Build and maintain monitoring, logging, and alerting pipelines using tools such as Prometheus, Grafana, Splunk, or equivalent
Automation & Infrastructure as Code
- Develop and maintain automation scripts in Python, Bash, or Power Shell to reduce manual operational toil
- Build and manage CI/CD pipelines (Jenkins, Git Lab CI, or equivalent) to support continuous delivery workflows
- Implement and maintain infrastructure‑as‑code (IaC) solutions using Terraform or Ansible
- Automate system patching, configuration management, and deployment processes across the environment
Security & Compliance
- Operate and harden Linux (RHEL preferred) and Windows Server environments in compliance with DoD STIGs
- Support RMF accreditation activities and maintain compliance with NIST 800‑53 security controls
- Perform STIG compliance scans and assist with remediation in classified environments
- Enforce security policies and configuration baselines across managed systems
Ticketing, Collaboration & Documentation
- Resolve Incident, Work Order, Task, and Change requests via the Service Now ticketing system
- Collaborate with Tier III engineering, network, and cybersecurity teams to resolve escalated issues
- Maintain technical documentation including runbooks, SOPs, and system architecture diagrams
- Participate in post‑incident reviews and contribute to continuous improvement initiatives
REQUIRED QUALIFICATIONS
- Active DoD Secret clearance with CV enrollment current in DISS
- 1–5 years of experience in site reliability engineering, systems administration, Dev Ops, or IT operations
- Demonstrated experience with Linux system administration (RHEL, CentOS, or equivalent strongly preferred)
- Hands‑on experience writing automation scripts in Python, Bash, or Power Shell
- Working knowledge of monitoring and observability tools (Prometheus, Grafana, Nagios, Splunk, or equivalent)
- Experience with Git‑based version control workflows
- Basic understanding of TCP/IP networking, DNS, VLANs, and firewall concepts
- Ability to operate in classified environments and access secured facilities at Langley AFB (JBLE)
- DoD 8570.01‑M / DoD 8140 IAT Level II certification required at start date — Security+ CE minimum
- U.S. Citizenship required
PREFERRED QUALIFICATIONS
- Bachelor’s degree in Computer Science, Information Technology, Systems Engineering, or related field (equivalent experience accepted in lieu of degree)
- Prior DoD or USAF IT experience, particularly supporting Air Force network or infrastructure programs
- Experience with containerization and orchestration: Docker, Kubernetes, or equivalent
- Familiarity with VMware or other hypervisor‑based virtualization environments
- Experience with ITSM platforms in a DoD environment (Service Now, Remedy)
- Hands‑on experience with configuration management tools: Ansible, Puppet, or Chef
- Cloud platform exposure: AWS Gov Cloud, Azure Government, or hybrid on‑premise/cloud environments
- Familiarity with DoD Risk Management Framework (RMF) and…
Skills
AnsibleBashCI/CDCentOSDockerGitGit Lab CIGrafanaJenkinsKubernetesLinuxNagiosNIST 800-53Power ShellPrometheusPythonRHELRemedyService NowSplunkTerraformVMwareWindows Server
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free