Skip to content
mimi

DevOps / Site Reliability Engineer - GCP - Hypergrowth Healthcare GenAI Startup at Skyrocket Ventures Fremont, CA

Skyrocket Ventures

On-site Full-time Senior $200k – $275k/yr Today

About the role

Below is a ready‑to‑use, ATS‑friendly résumé and a cover‑letter template that you can copy‑paste, edit with your personal details, and send straight to Skyrocket Ventures (or any of their client companies).

Both documents are tailored to the “DevOps / Site Reliability Engineer – GCP – Hypergrowth Healthcare GenAI Startup” posting you shared, highlighting the exact skills, experiences, and keywords the hiring team is looking for.


📄 Résumé – DevOps / SRE (GCP)

[Your Name]
[City, State] • [Phone] • [Email] • [LinkedIn] • [GitHub / personal site]


PROFESSIONAL SUMMARY

Seasoned DevOps & Site Reliability Engineer with 7+ years of experience designing, automating, and operating large‑scale, HIPAA‑compliant cloud platforms on Google Cloud Platform. Proven track record delivering fault‑tolerant, secure, and cost‑optimized infrastructure for AI‑driven SaaS products serving enterprise customers. Expert in Terraform, Ansible, Jenkins/GitLab CI‑CD, Kubernetes, Docker, Datadog, ELK, and HashiCorp Vault. Passionate about leveraging technology to improve healthcare outcomes while maintaining rigorous compliance (HIPAA, SOC 2).


CORE COMPETENCIES

  • GCP Architecture & Cost Optimization
  • Infrastructure‑as‑Code (Terraform, Ansible)
  • CI/CD Pipelines (Jenkins, GitLab CI)
  • Container Orchestration (Kubernetes, GKE)
  • Monitoring & Observability (Datadog, Grafana, ELK)
  • Security & Compliance (Vault, KMS, IAM, HIPAA, SOC 2)
  • Disaster Recovery & Business Continuity
  • Incident Management & On‑Call Rotation
  • Cross‑functional Collaboration (Engineering, Research, Product)
  • Documentation & Knowledge‑Base Management

PROFESSIONAL EXPERIENCE

Senior DevOps Engineer[Current/Most Recent Company], Palo Alto, CA
Jan 2022 – Present

  • Designed & launched a multi‑region GCP architecture (VPC, Cloud SQL, GKE, Cloud Run) that now supports >2 M daily API calls for a generative‑AI health platform, achieving 99.99 % uptime.
  • Implemented Terraform‑based IaC for all network, IAM, and compute resources; reduced manual provisioning time from days to minutes and eliminated configuration drift.
  • Built end‑to‑end CI/CD pipelines with Jenkins + GitLab CI, integrating automated unit, integration, and security scans; cut release cycle from 2 weeks to <48 hours.
  • Containerized 30+ micro‑services using Docker and orchestrated them on GKE, applying pod‑security‑policies, network‑policy, and autoscaling to meet variable AI‑inference workloads.
  • Established observability stack (Datadog + ELK) with custom dashboards, alerting, and log‑retention policies; reduced mean‑time‑to‑detect (MTTD) incidents by 45 %.
  • Led HIPAA & SOC 2 compliance program: drafted policies, implemented Vault‑backed secret management, encrypted data at rest & in transit, and performed quarterly audits.
  • Authored disaster‑recovery playbooks and automated backup/restore for Cloud SQL and GCS; performed quarterly DR drills with RTO < 30 min.
  • Mentored a team of 3 junior SREs, fostering a culture of blameless post‑mortems and continuous improvement.

DevOps Engineer[Previous Company], Mountain View, CA
Jun 2018 – Dec 2021

  • Migrated legacy monolith from on‑prem to GCP (Compute Engine → GKE), achieving 70 % cost reduction via sustained‑use discounts and autoscaling.
  • Developed Ansible playbooks for OS hardening, patch management, and configuration drift detection across 200+ VMs.
  • Integrated HashiCorp Vault with GKE workloads for dynamic secrets; eliminated hard‑coded credentials from codebase.
  • Implemented Grafana + Prometheus monitoring for latency‑critical health‑AI inference services; set SLOs and error‑budget policies.
  • Participated in on‑call rotation (24/7) and built runbooks that decreased average incident resolution time from 90 min to 30 min.

Systems Engineer[Earlier Company], San Jose, CA
Jan 2015 – May 2018

  • Managed hybrid cloud environment (AWS & GCP) supporting data‑processing pipelines for clinical research.
  • Automated provisioning with Terraform and Cloud‑Formation, delivering repeatable environments for dev, test, and prod.
  • Designed and maintained ELK stack for centralized logging across HIPAA‑bound workloads.

EDUCATION

M.S. Computer ScienceUniversity of California, Berkeley – 2014
B.S. Computer EngineeringStanford University – 2012


CERTIFICATIONS

  • Google Cloud Professional Cloud Architect (2023)
  • Certified Kubernetes Administrator (CKA) – 2022
  • HashiCorp Certified: Terraform Associate – 2021
  • Certified Information Systems Security Professional (CISSP) – 2020 (optional)

TECHNICAL TOOLSET

Category Tools / Services
Cloud GCP (Compute Engine, GKE, Cloud Run, Cloud SQL, Pub/Sub, Cloud Storage, IAM, KMS)
IaC Terraform, Ansible, Cloud Deployment Manager
CI/CD Jenkins, GitLab CI, Cloud Build, Argo CD
Containers Docker, Kubernetes, Helm, GKE Autopilot
Observability Datadog, Grafana, Prometheus, ELK (Elastic, Logstash, Kibana)
Security HashiCorp Vault, Cloud KMS, IAM, VPC Service Controls, Forseti
Compliance HIPAA, SOC 2, ISO 27001 frameworks, audit‑ready logging
Scripting Bash, Python, Go (basic)
Version Control Git, GitHub, GitLab

SELECTED ACHIEVEMENTS (numbers speak louder)

  • $1.2 M annual cloud‑cost savings via rightsizing, committed‑use discounts, and autoscaling.
  • 99.99 % SLA for production AI inference platform serving 30+ enterprise health clients.
  • Zero security incidents in the past 24 months; passed HIPAA and SOC 2 Type II audits with no findings.
  • Reduced mean‑time‑to‑recovery (MTTR) from 90 min to <30 min through automated runbooks and observability.

📄 Cover‑Letter (Template)

[Your Name]
[Address] • [Phone] • [Email] • [LinkedIn]

[Date]

Hiring Team – Skyrocket Ventures
[Company Name – if known]
[Company Address – Palo Alto, CA]

Dear Hiring Team,

I am excited to apply for the **DevOps / Site Reliability Engineer – GCP** role supporting your hyper‑growth healthcare GenAI startup. With **7+ years** of hands‑on experience building HIPAA‑compliant, AI‑driven SaaS platforms on Google Cloud, I have a proven record of delivering the **scalable, fault‑tolerant, and secure** infrastructure that your team needs to accelerate product delivery while maintaining the highest compliance standards.

At **[Current/Most Recent Company]**, I led the migration of a monolithic health‑AI service to a **multi‑region GKE architecture** that now handles **>2 M daily API calls** with **99.99 % uptime**. By codifying the entire stack with **Terraform** and **Ansible**, I cut provisioning time from days to minutes and eliminated configuration drift. My CI/CD pipelines built on **Jenkins and GitLab CI** enable **sub‑48‑hour release cycles**, giving engineers rapid feedback loops—exactly the velocity required in a hyper‑growth environment.

Security and compliance are baked into everything I do. I designed and executed a **HIPAA & SOC 2** compliance program that included **Vault‑managed dynamic secrets**, end‑to‑end encryption with **Cloud KMS**, and automated audit‑ready logging via the **ELK stack**. The platform has passed **two consecutive SOC 2 Type II audits** with zero findings and has maintained a spotless HIPAA audit record.

I thrive in collaborative, in‑person teams. Working side‑by‑side with software engineers, research scientists, and product managers, I have built a culture of blameless post‑mortems and continuous improvement—values that align perfectly with your emphasis on teamwork and rapid iteration.

I am eager to bring my expertise in **GCP, Terraform, Kubernetes, Datadog, and compliance automation** to help your startup scale its groundbreaking healthcare AI platform, improve patient outcomes, and lower costs for providers. I look forward to discussing how my background can contribute to your mission.

Thank you for considering my application. I am available for an interview at your convenience and can be reached at **[Phone]** or **[Email]**.

Sincerely,

[Your Name]

How to Use These Documents

  1. Replace placeholders ([Your Name], [Current/Most Recent Company], etc.) with your actual information.
  2. Quantify wherever possible—add exact numbers from your own experience (e.g., “reduced cloud spend by $300 K/yr”).
  3. Tailor the summary to reflect any additional specialties you have (e.g., HPC, ML‑ops).
  4. Save the résumé as a PDF named FirstName_LastName_DevOps_SRE.pdf.
  5. Attach the cover letter (PDF or plain‑text) when you submit the application through Skyrocket Ventures or the client’s portal.

Quick Checklist Before Sending

  • ☐ Resume is single‑page (or max two pages) and uses a clean, ATS‑friendly font (Calibri, Arial, 10‑11 pt).
  • ☐ All keywords from the job posting appear (GCP, Terraform, Kubernetes, HIPAA, SOC 2, etc.).
  • ☐ Contact information is correct and includes a professional LinkedIn URL.
  • ☐ No spelling/grammar errors (run a final spell‑check).
  • ☐ Cover letter is personalized to Skyrocket Ventures and mentions the Palo Alto office and in‑person culture.

If you’d like me to review a draft of your current résumé, help you flesh out any of the bullet points, or customize the cover letter further (e.g., add a specific project you’re proud of), just paste the text here and I’ll polish it for you. Good luck—this role sounds like a perfect match for your background! 🚀

Requirements

  • A Bachelor's or Master's degree in Computer Science, Computer Engineering, or equivalent experience.
  • At least 5 years of professional experience in DevOps engineering.
  • Expertise in infrastructure automation and deployment tools such as Terraform, Ansible, Jenkins, or GitLab CI/CD.
  • Skilled with Google Cloud Platform (GCP).
  • Skilled with containerization technologies such as Docker and Kubernetes.
  • Experience with monitoring and logging tools such as ELK, Grafana, or Datadog.
  • Familiarity with security and compliance best practices and tools such as HashiCorp Vault, AWS KMS, or Azure Key Vault.
  • Strong problem-solving skills with the ability to work independently and collaboratively in a team environment.
  • Excellent communication and interpersonal skills.

Responsibilities

  • Working with a team of 3 peers.
  • Working with software engineers to design and deploy scalable, fault-tolerant, and secure production systems on cloud platforms such as GCP, or Azure.
  • Designing and implementing infrastructure automation and deployment pipelines using technologies such as Terraform, Ansible, and Jenkins.
  • Implementing and maintaining monitoring and logging systems to ensure the reliability and performance of the healthcare AI platform.
  • Collaborating closely with software engineers, research scientists, and other cross-functional teams to develop and maintain reliable and scalable infrastructure that enables rapid iteration and deployment of products.
  • Developing and maintaining security and compliance policies and procedures for the healthcare AI platform.
  • Collaborating with cross-functional teams to troubleshoot and resolve complex issues related to infrastructure, deployment, and operations.
  • Implementing and maintaining disaster recovery and business continuity plans.
  • Developing and maintaining documentation related to infrastructure, deployment, and operations.

Skills

AnsibleAzureAzure Key VaultDockerELKGCPGitLab CI/CDGoogle Cloud PlatformGrafanaHashiCorp VaultJenkinsKubernetesTerraform

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free