Skip to content
mimi

Site Reliability Engineer Specializing in Advanced Infrastructure Solutions

Pythian

On-site Full-time Yesterday

About the role

About

Transform cloud infrastructure as a Site Reliability Engineer. Design and automate resilient systems while focusing on performance optimization and system integrity.

In this role, you'll contribute to a pioneering Site Reliability Engineering team, responsible for building and maintaining high-performing infrastructures. You'll leverage your skills to operate Kubernetes clusters and integrate advanced observability tools. Your expertise in cloud technologies will ensure infrastructure readiness and support innovative AI/ML projects.

Key Responsibilities

  • Optimize and operate Kubernetes clusters and Linux systems
  • Automate workflows using Go, Python, and Shell scripts
  • Construct monitoring solutions with Prometheus and Grafana
  • Resolve complex networking and performance issues
  • Work with AI/ML teams to prepare infrastructure for data tasks

Requirements

  • Hands-on experience with Google Cloud and Terraform tools
  • Strong understanding of microservices and containers
  • Experienced in Linux systems administration and PKI
  • SRE mentality with a focus on scalability and reliability
  • Proven ability to manage distributed cloud systems

Additional Information

Shape the future of cloud technology by enhancing infrastructure performance and ensuring robust automation.
#J-18808-Ljbffr

Requirements

  • Hands-on experience with Google Cloud and Terraform tools
  • Strong understanding of microservices and containers
  • Experienced in Linux systems administration and PKI
  • SRE mentality with a focus on scalability and reliability
  • Proven ability to manage distributed cloud systems

Responsibilities

  • Optimize and operate Kubernetes clusters and Linux systems
  • Automate workflows using Go, Python, and Shell scripts
  • Construct monitoring solutions with Prometheus and Grafana
  • Resolve complex networking and performance issues
  • Work with AI/ML teams to prepare infrastructure for data tasks

Skills

DockerGoGrafanaGoogle CloudLinuxMicroservicesObservabilityPrometheusPythonShell scriptingSRETerraform

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free