Skip to content
mimi

Site Reliability Engineer (SRE)

Comcast

Reston · flexible Full-time Mid Level 5d ago

About the role

About Comcast

Make your mark at Comcast -- a Fortune 30 global media and technology company. From the connectivity and platforms we provide, to the content and experiences we create, we reach hundreds of millions of customers, viewers, and guests worldwide. Become part of our award-winning technology team that turns big ideas into cutting-edge products, platforms, and solutions that our customers love.

We create space to innovate, and we recognize, reward, and invest in your ideas, while ensuring you can proudly bring your authentic self to the workplace. Join us. You'll do the best work of your career right here at Comcast.

Job Summary

As a Site Reliability Engineer (SRE), you will be part of the SRE team within the CONNECT OpTek team. Our team is responsible for the development and support of multiple tools and applications used by Comcast field technicians to diagnose and troubleshoot issues within the Comcast nation-wide network.

The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and project managers.

Job Description

  • This position is unable to provide work authorization sponsorship or immigration support now or in the future.
  • Comcast's Technology, Product & Experience (TPX) organization works at the intersection of media and technology. Our innovative teams are continually developing and delivering products that transform the customer experience.

About the Role:

As a Site Reliability Engineer (SRE), you will be part of the SRE team within the CONNECT OpTek team. Our team is responsible for the development and support of multiple tools and applications used by Comcast field technicians to diagnose and troubleshoot issues within the Comcast nation-wide network.

About the Team:

The SRE team is responsible for maintaining the existing systems, supporting our development teams, and implementing innovative solutions. You will work alongside software developers, testers, and project managers.

Your responsibilities will span the entire product life cycle, from requirements gathering, to development, to deployment, and operations support. You will be responsible for several things, including:

  • Providing Infrastructure as Code solutions for a small cohesive group within Comcast
  • Using Terraform to configure AWS Infrastructure, Kubernetes cluster provisioning and application provisioning
  • Working with and supporting developers to help maintain/define best practices
  • Configuring, watching, tuning and responding to monitoring events
  • Supporting an on-call rotation with the SRE team
  • Maintaining and improving CI/CD pipelines using Concourse and GoCD
  • Supporting corporate initiatives (e.g., security hardening)
  • Having a good time learning and working with the people on the team

What You'll Do:

  • Develop solutions for a wide range of difficult applications, problems or procedures.
  • Interpret internal/external business issues and recommend complete solutions based on best practices and proven technologies.
  • Work with members of cross-functional teams, third party vendors, company product managers, and marketing teams to deliver quality products in a timely fashion that meet defined requirements.
  • Provide technical leadership and mentorship.
  • Diligent about recording/documenting development and production support activities and tasks in our ticketing tool.
  • Ensure that project requests are properly accepted into the SRE engineering team, are worked in a timely and efficient manner, are of high quality, and smoothly follow the Dev Ops life cycle - continuous innovation, feedback, and improvement.
  • Deploy new systems and software and conduct appropriate testing to ensure successful deployment. Determine the necessary test coverage and plans as part of the deployment strategy.
  • Other duties and responsibilities as assigned.
  • Occasional on-call support is required.

Preferred Requirements:

  • Experience with Cloud Providers and configuring Infrastructure
  • AWS
  • Kubernetes
  • Experience with CM Tools, such as Terraform and Ansible
  • Docker
  • Monitoring systems (Prometheus/Alert Manager/Grafana)
  • Git
  • Experience with CI/CD Tools
  • ECS/ECR

Additional "Nice to Have" Qualifications:

  • Scripting experience with bash and python 3
  • Experience troubleshooting applications and networking (Java, Angular, VPC's Firewalls etc)
  • Scripting (Bash/Python)
  • Concourse/GoCD
  • Understanding distributed systems and how the pieces fit together.

About Our Perks & Benefits:

We are determined to create an environment where our employees feel valued, understand our business goals, and are motivated.

Here's a…

Skills

AWSAnsibleBashConcourseDockerGoCDGrafanaGitJavaKubernetesPrometheusPythonTerraform

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free