Skip to content
mimi

Site Reliability Engineer

Alliance Sourcing Network

Parsippany-Troy Hills · On-site Full-time Mid Level 1w ago

About the role

About

We are looking for a talented Site Reliability Engineer (SRE) with a strong background in Google Cloud Platform (Google Cloud Platform) and kubernetes. The ideal candidate will be responsible for ensuring the reliability, performance, and scalability of our on-premise and cloud-based systems along with focus on reducing costs for Google Cloud.

Responsibilities

  • System Reliability: Ensure the reliability and uptime of critical services and infrastructure.
  • Google Cloud Expertise: Design, implement, and manage cloud infrastructure using Google Cloud services.
  • Automation: Develop and maintain automation scripts and tools to improve system efficiency and reduce manual intervention.
  • Monitoring and Incident Response: Implement monitoring solutions and respond to incidents to minimize downtime and ensure quick recovery.
  • Collaboration: Work closely with development and operations teams to improve system reliability and performance.
  • Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle future growth.
  • Documentation: Create and maintain comprehensive documentation for system configurations, processes, and procedures.

Qualifications

  • Education: Bachelor's degree in computer science, Engineering, or a related field.
  • Experience: 4+ years of experience in site reliability engineering or a similar role.
  • Skills:
    • Proficiency in Google Cloud services (Compute Engine, Kubernetes Engine, Cloud Storage, BigQuery, Pub/Sub, etc.).
    • Familiarity with Google BI and AI/ML tools (Looker, BigQuery ML, Vertex AI, etc.)
    • Experience with automation tools (Terraform, Ansible, Puppet).
    • Familiarity with CI/CD pipelines and tools (Azure pipelines Jenkins, GitLab CI, etc.).
    • Strong scripting skills (Python, Bash, etc.).
    • Knowledge of networking concepts and protocols. (Service mesh experience a plus)
    • Experience with monitoring tools (Prometheus, Grafana, etc.).

Preferred Certifications

  • Google Cloud Professional DevOps Engineer
  • Google Cloud Professional Cloud Architect

Skills

AnsibleAzure PipelinesBashBigQueryBigQuery MLCompute EngineDockerGitLab CIGoogle CloudGoogle Kubernetes EngineGrafanaJenkinsLookerNetworkingPrometheusPub/SubPuppetPythonService MeshTerraformVertex AI

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free