Skip to content
mimi

Cloud Engineer SRE (Site Reliability Engineer)

General Dynamics Information Technology

US · On-site Full-time Senior Today

About the role

Clearance Level

Secret

Category

Cloud

Location

Fort Liberty, North Carolina – Onsite Workplace

Requisition Type

Regular

Your Impact

Own your opportunity to work with the largest government agency in the nation. Make an impact by advancing the Department of Defense's mission to keep our country safe and secure.

Job Title

Cloud Developer Sr Advisor – Cloud Site Reliability Engineer (SRE)

How a Cloud Developer Sr Advisor Will Make an Impact

  • Run the production environment by monitoring availability and taking a holistic view of system health.
  • Build software and systems to manage platform infrastructure and applications.
  • Improve reliability, quality, performance for cloud-hosted applications.
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve.
  • Provide primary operational support and engineering for multiple large, distributed software applications.
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.
  • Partner with development teams to improve services through rigorous testing and release procedures.
  • Participate in system design consulting, platform management, and capacity planning.
  • Create sustainable systems and services through automation and uplifts.
  • Balance feature development speed and reliability with well-defined service level objectives.

What You'll Need to Succeed

Education

  • Bachelor's Degree in a STEM field.
  • DoD 8570 Level II (Security +)

Required Experience

  • 8+ years of related experience

Required Technical Skills

  • Ability to program (structured and OO) with one or more high‑level languages, such as Python, Java, C/C++, Ruby, and JavaScript.
  • Adept Shell/BASH scripter
  • Experience with distributed storage technologies like NFS, HDFS, Ceph, and S3.
  • 2+ years of experience working with container orchestration technologies, specifically Kubernetes.
  • Security Clearance Level: Secret to start, must be able to obtain TS/SCI

Required Skills and Abilities

  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks along with an ability to offer and implement solutions to address these.
  • Experience creating dashboards to track service health that appeal to both technical and non‑technical audiences preferably with Splunk.
  • Excellent written and verbal communication skills, with a strong attention to detail and a head for problem solving.
  • Skilled at working in tandem with a team, or unsupervised as required

Preferred Skills

  • Experience working with identity and access management technologies and solutions.
  • Experience with Agile development methodologies; using collaboration tools such as Jira and Confluence.
  • Experience with monitoring and logging solutions, specifically Splunk
  • Any of the following: AWS Certified SysOps Administrator Associate or AWS Certified Solutions Architect Associate or any Professional level of the above‑mentioned certs where applicable
  • 1+ years' experience working with Gitlab
  • Skilled at creating Ansible playbooks, working with AWX/Ansible Tower

Additional Requirements

  • Location: On Customer Site
  • US Citizenship Required

GDIT Is Your Place

  • 401K with company match
  • Comprehensive health and wellness packages
  • Internal mobility team dedicated to helping you own your career
  • Professional growth opportunities including paid education and certifications
  • Cutting‑edge technology you can learn from
  • Rest and recharge with paid vacation and holidays

Requirements

  • Bachelor's Degree in a STEM field
  • DoD 8570 Level II (Security +)
  • Required Experience: 8+ years of related experience
  • Required Technical Skills:
  • Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C/C++, Ruby, and JavaScript
  • Adept Shell/BASH scripter
  • Experience with distributed storage technologies like NFS, HDFS, Ceph, and S3
  • 2+ years of experience working with container orchestration technologies, specifically Kubernetes
  • Security Clearance Level: Secret to start, must be able to obtain TS/SCI
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks along with an ability to offer and implement solutions to address these
  • Experience creating dashboards to track service health that appeal to both technical and non-technical audiences preferably with Splunk
  • Excellent written and verbal communication skills, with a strong attention to detail and a head for problem solving
  • Skilled at working in tandem with a team, or unsupervised as required
  • Experience working with identity and access management technologies and solutions
  • Experience with Agile development methodologies; using collaboration tools such as Jira and Confluence
  • Experience with monitoring and logging solutions, specifically Splunk
  • Any of the following: AWS Certified SysOps Administrator Associate or AWS Certified Solutions Architect Associate or any Professional level of the above-mentioned certs where applicable
  • 1+ years' experience working with Gitlab
  • Skilled at creating Ansible playbooks, working with AWX/Ansible Tower
  • Location: On Customer Site
  • US Citizenship Required

Responsibilities

  • Make an impact by advancing the Department of Defense's mission to keep our country safe and secure
  • Cloud Developer Sr Advisor Own the opportunity as a Cloud Site Reliability Engineer (SRE) and help ensure the mission is never interrupted
  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, performance for cloud-hosted applications
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Provide primary operational support and engineering for multiple large, distributed software applications
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
  • Partner with development teams to improve services through rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through automation and uplifts
  • Balance feature development speed and reliability with well-defined service level objectives

Benefits

paid_time_offhealth_insurance

Skills

AnsibleAWSAWXBashC++CephGitlabHDFSJavaJavaScriptJiraKubernetesNFSPythonRubyS3Security+Splunk

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free