T
0D Capital: Senior Devops/SRE Engineer
The10minutecareersolution
Remote (Global) Senior 5d ago
About the role
About the Role
You'll personal the reliability, scalability, and efficiency of the infrastructure powering our buying and selling methods. This contains Kubernetes operations, CI/CD methods, observability stack, and networking for high-frequency buying and selling and DeFi methods. Your work immediately impacts latency, uptime, and developer velocity.
What You will Do
- Design and evolve our Kubernetes platform: operators, workload orchestration, environment friendly deployments (blue/inexperienced, canary).
- Enhance CI/CD pipelines: GitHub Actions, Cloud Construct, automated assessments, safe picture supply.
- Construct and keep monitoring and incident response: OpenTelemetry, Prometheus/Alertmanager, Loki, Tempo, Thanos/Mimir.
- Preserve community & ingress: NGINX Ingress, Kong Gateway (auth, rate-limit, plugins).
- Handle infrastructure as code: Terraform/Ansible for GCP sources.
- Drive FinOps: optimize value of compute, storage, and networking.
- Assist growth groups: debug efficiency points, enhance reliability, automate workflows.
- Lead incident response & post-mortems: guarantee methods are observable and resilient.
Stack & Infra You will Contact
- Networking: NGINX Ingress, Kong Gateway
- Tooling: Terraform, Ansible, Python/Rust for automation
What We're Wanting For
Expertise
- 5+ years working manufacturing infrastructure at scale.
- Sturdy expertise with Kubernetes (operators, controllers, upgrades).
- Strong background in CI/CD, GitOps, infrastructure as code.
- Confirmed report of constructing dependable, observable methods.
Technical Abilities
- Sturdy in at the least one methods language (Golang or Rust most well-liked).
- Cloud (GCP/AWS/Azure) networking and IAM.
- Terraform / Ansible or related for infra automation.
- Monitoring and tracing (Prometheus, OpenTelemetry).
- Incident administration and on-call practices.
Good to Have
- Expertise in low-latency buying and selling or crypto infra.
- Safety hardening (community insurance policies, secrets and techniques administration, Vault/KMS).
- Efficiency tuning of Kubernetes and containerized workloads.
- Value optimization (FinOps) at scale.
Why Be a part of Us
- Aggressive comp with fairness/token upside.
- Possession of the platform core to buying and selling.
- Distant (±4h CET), lean sharp crew, offsites.
Mindset
- Finish-to-end possession. Bias to ship with reliability and excessive requirements.
- Calm beneath stress, capable of debug complicated distributed methods.
- Quick learner, inquisitive about infra and buying and selling methods.
Requirements
- 5+ years working manufacturing infrastructure at scale.
- Sturdy expertise with Kubernetes (operators, controllers, upgrades).
- Strong background in CI/CD, GitOps, infrastructure as code.
- Confirmed report of constructing dependable, observable methods.
- Sturdy in at the least one methods language (Golang or Rust most well-liked).
- Cloud (GCP/AWS/Azure) networking and IAM.
- Terraform / Ansible or related for infra automation.
- Monitoring and tracing (Prometheus, OpenTelemetry).
- Incident administration and on-call practices.
Responsibilities
- Design and evolve our Kubernetes platform: operators, workload orchestration, environment friendly deployments (blue/inexperienced, canary).
- Enhance CI/CD pipelines: GitHub Actions, Cloud Construct, automated assessments, safe picture supply.
- Construct and keep monitoring and incident response: OpenTelemetry, Prometheus/Alertmanager, Loki, Tempo, Thanos/Mimir.
- Preserve community & ingress: NGINX Ingress, Kong Gateway (auth, rate-limit, plugins).
- Handle infrastructure as code: Terraform/Ansible for GCP sources.
- Drive FinOps: optimize value of compute, storage, and networking.
- Assist growth groups: debug efficiency points, enhance reliability, automate workflows.
- Lead incident response & post-mortems: guarantee methods are observable and resilient.
Benefits
equity upsidetoken upside
Skills
AnsibleCloud ConstructDockerGCPGitHub ActionsGolangGrafanaIAMIngressKong GatewayKubernetesLokiMimirNGINX IngressOpenTelemetryPrometheusPythonRustTerraformThanosTempo
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free