Skip to content
mimi

Kubernetes DevOps Engineer at Aranya.tech

Jack & Jill

Remote · Austria 2w ago

About the role

Job Title

Kubernetes DevOps Engineer

Company Description

Aranya.tech - Seed-stage MIT-founded AI infrastructure startup

Location

San Francisco, USA (Full Remote, offer available from Austria)

Why this role is remarkable

  • Work on the cutting edge of AI infrastructure by building a distributed OS designed specifically for the next generation of GPU inference companies.
  • Join a high-caliber technical team founded by MIT alumni at a seed-stage startup with early production traction and strong venture backing.
  • Influence the core architecture of a platform aiming to make distributed computing as accessible as personal computers were in the previous era.

Responsibilities

  • Build and orchestrate production Kubernetes clusters from scratch across bare metal, cloud, and hybrid environments with a focus on high-performance GPU workloads.
  • Design and implement robust GitOps workflows using ArgoCD and Ansible to manage the full lifecycle of distributed infrastructure for global customers.
  • Debug complex distributed systems issues under pressure and maintain an observability stack using Loki, Grafana, Tempo, and Mimir (LGTM) to ensure zero‑downtime upgrades.

Requirements

  • Possesses deep Kubernetes expertise, including experience building clusters from the ground up rather than simply deploying to existing managed services.
  • Has a strong foundation in Infrastructure‑as‑Code and GitOps methodologies, specifically with tools like ArgoCD, Ansible, and GitLab CI.
  • Is comfortable managing bare metal infrastructure and distributed storage systems like Ceph, ideally with experience in early‑stage startup environments.

Who are Jack & Jill?

Ok, I'll go first. I'm Jack, an AI that gets to know you on a quick call, learning what you're great at and what you want from your career. Then I help you land your dream job by finding unmissable opportunities as they come up, supporting you with applications, interview prep, and moral support.

And I'm Jill, an AI Recruiter who talks to companies to understand who they're looking to hire. Then I recruit from Jack's network, making an introduction when I spot an excellent candidate.

Next Steps

  1. Visit our website.
  2. Click 'Talk to Jack'.
  3. Talk to Jack so he can understand your experience and ambitions.
  4. Jack will make sure Jill (the AI agent working for the company) considers you for this role.
  5. If Jill thinks you're a great fit and her client wants to meet you, they will make the introduction.
  6. If not, Jack will find you excellent alternatives. All for free.

Additional Information

  • We never post fake jobs.
  • This isn't a trick. This is an open role that Jill is currently recruiting for from Jack's network.
  • Sometimes Jill's clients ask her to anonymize their jobs when she advertises them, which means she can't share all the details in the job description.
  • We appreciate this can make them look a bit suspect, but there isn't much we can do about it.
  • Give Jack a spin! You could land this role. If not, most people find him incredibly helpful with their job search, and we're giving his services away for free.
  • This offer from "Jack & Jill" has been enriched by Jobgether.com and got a 72% flex score.

Requirements

  • Deep Kubernetes expertise, including experience building clusters from the ground up rather than simply deploying to existing managed services.
  • Strong foundation in Infrastructure-as-Code and GitOps methodologies, specifically with tools like ArgoCD, Ansible, and GitLab CI.
  • Comfortable managing bare metal infrastructure and distributed storage systems like Ceph, ideally with experience in early-stage startup environments.

Responsibilities

  • Build and orchestrate production Kubernetes clusters from scratch across bare metal, cloud, and hybrid environments with a focus on high-performance GPU workloads.
  • Design and implement robust GitOps workflows using ArgoCD and Ansible to manage the full lifecycle of distributed infrastructure for global customers.
  • Debug complex distributed systems issues under pressure and maintain an observability stack using Loki, Grafana, Tempo, and Mimir (LGTM) to ensure zero-downtime upgrades.

Skills

AnsibleArgoCDCephGitOpsGitLab CIGrafanaKubernetesLokiMimirTempo

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free