Skip to content
mimi

Site Reliability Engineer

Elite Technical

Reston · Hybrid Contract Senior Today

About the role

About

Elite Technical is seeking a Site Reliability Engineer in the Washington DC, Maryland and/or Virginia area for a long term contract position with our customer in Reston VA.

Roles & Responsibilities

  • Communicates Architectural decisions, plans, goals, and strategies, while highlighting short-term trade-offs vs. long-term commitments and costs
  • Engage in and improve the end-to-end Lifecycle of services, starting from Inception & design, deployment, and operations.
  • Establish automation capabilities leveraging Cloud native solutions, to improve the Developer experience.
  • Support activities, including System design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
  • Willingness to roll up the sleeves and troubleshoot difficult issues and engage the Customer.
  • Willingness to learn new AWS Services and other technologies as required.
  • Systems Scalability and sustainability leveraging automation and strive to improve our systems with changes that improve reliability and velocity.
  • Experience with Enterprise Cloud transformation and migration efforts.
  • Actively participate and help guide customers on using Cloud-native design and architecture patterns.
  • Provide Consultation on Technology infrastructure planning and engineering for assigned systems; Assesses the implications of technology strategies on infrastructure capabilities.
  • Establish strategies to migrate Legacy applications by conversion to multiple Microservices and hosting on AWS Cloud platform.
  • Leverage Cloud-native architecture components including Containers, immutable infrastructure, Microservices, Service Mesh etc., to build highly available and Fault tolerant applications.
  • Conduct research on the global technology trends and their applicability to FEPOC products in support of our internal development teams and business initiatives.
  • Promotes and ensures Modern application design, applies engineering best practices in the development and operations life cycle and mitigates vulnerabilities.
  • Monitors and manages the Stability, Availability, and Performance of enterprise systems and platforms across IT domains.
    • (e.g., Systems, Network, Storage, Security) by analyzing systems to identify problems, trends, and opportunities for improvement.
  • Automate end to end process to maintain (patches and upgrades) of our AWS Cloud ecosystem.
  • Makes data-driven recommendations and decisions and continuously improves the overall efficacy and efficiency of our software delivery capabilities.
  • Mentoring peers as well as engaging with others across teams and socializing solutions.

Required Skills

  • Minimum of One AWS certification is required.
  • Minimum of 10 years of IT experience of which at least 5 years must be in AWS Cloud
  • Platform engineering and Administration.
  • Strong Leadership experience with driving Transformation initiatives
  • 3-5 years of experience in a Site Reliability Engineering role
  • Experience with SRE principles and transformation
  • 3+ years of experience with Containerization (Kubernetes), Cloud technologies (AWS, Azure etc.), DevOps tool chain (Ansible, Jenkins, Artifactory, bitbucket, etc.), and technical patterns (IaC, Automated Provisioning/Release, CI/CD, etc.)
  • Solid understanding of Software coding techniques and experience with full spectrum of Software engineering (Build, Integration, Test, Releasing and Deployment) leveraging Python.
  • Experience in Developing and/or challenging engineering solutions/practices and collaborating with peers within and outside of immediate team, including customers (Dev, Architects, Engineers)
  • Platform Engineering Lead with Hands -on Experience: Building robust Middleware Environments, previous Linux System administration is required.
  • Must have strong hands-on knowledge of AWS platform and services but not limited to VPC, Networking, Direct Connect, Subnets, NACLs, Security Groups, EC2, S3, IAM, ELBs, Lambda, CloudWatch, CloudTrail, EKS etc.
  • Must Have Hands on current Implementation and Production level experience in AWS Cloud.
  • Hands on experience with Automation and Infrastructure Provisioning is a must
  • Our goal is to only provision infrastructure with Code, and Policy As Code.
  • Must be familiar with Terraform automation, Ansible playbooks, and Python code.
  • Experience with AWS Cloud Formation and CDK is required.
  • Must have hands on experience in writing Lambda functions preferably in Python (Boto3).
  • Must be well versed in writing Linux Bash scripts.
  • Hands-on experience with Containerization and Amazon EKS is a big plus.
  • A great understanding of various DevOps toolchains, including Git/repo, Crucible, Jenkins etc.
  • Solid understanding and experience with a CI/CD tool chain.

Skills

AnsibleAWSAWS CDKAWS Cloud FormationAWS EKSAzureBashBitbucketCDCICloudWatchContainerizationCrucibleDevOpsEC2ELBIAMIaCJenkinsKubernetesLambdaLinuxMicroservicesNetworkingNACLsPythonS3Security GroupsService MeshSite Reliability EngineeringSubnetsTerraformVPC

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free