Infrastructure Team

Remote (Global) Full-time Senior 3mo ago

About the role

About Us

Close is a bootstrapped, profitable, 100% remote, ~100‑person team of thoughtful individuals who prioritize taking ownership and making a meaningful impact. We love small scaling businesses. Since 2013, we’ve been building a CRM that focuses on better communication, without the hassle of manual data entry or a complex UI. We are out to supercharge sales productivity with the most modern, thoughtfully designed, all‑in‑one, communication‑focused CRM.

Our backend tech stack consists primarily of Python Flask web apps with our TaskTiger scheduler handling many of the backend asynchronous task processing chores. Our data stores include MongoDB, PostgreSQL, Elasticsearch, and Redis. The underlying infrastructure runs on AWS using a combination of managed services like EKS, MSK, RDS and ElasticCache and non‑managed services running on EC2 instances. We have CI/CD pipelines that build Docker images, run automated tests and deploy to Kubernetes clusters. We also use these images in our local development environment allowing coding locally against all of our services. We have a well‑documented public API that is consumed by our front‑end JavaScript app as well as numerous integrations. Our infrastructure is heavily automated using Terraform, Ansible and other AWS tools.

We love open sourcing our code and ideas on our GitHub and on The Making of Close, our behind‑the‑scenes Product & Engineering blog. Check out our open source projects like close‑mongo‑ops‑manager, SocketShark, TaskTiger, LimitLion and ciso8601.

About the Role

You will be joining the Infrastructure Team at Close. This team builds and maintains the platform that runs all Close systems. You will work with:

Multi‑terrabyte MongoDB, PostgreSQL, and Elasticsearch clusters
Telemetry systems built on Grafana’s LGTM stack and ClickHouse processing over 130 TB per month
Multiple Kubernetes clusters running tens of thousands of pods
GitHub Actions & ArgoCD powered CI/CD that can go from merged, to production, to rolled back in 10 minutes
A system that is stable, up‑to‑date, and hasn’t needed scheduled downtime in 4 years

About You

You are a rock in the storm. With hard‑won expertise, you consistently build robust systems from quality components fit to underpin mission‑critical applications. You value simplicity over familiarity and resilience over speed.
You’ve worked with a diverse array of infrastructure tools and systems, including:
- CI/CD (CircleCI, GitHub Actions, ArgoCD)
- Configuration Management (Ansible, Terraform)
- Databases (Elasticsearch, MongoDB, PostgreSQL, ClickHouse)
- Cloud Computing (Kubernetes, AWS)
- Telemetry (Loki, Tempo, Grafana, Mimir/Prometheus)
You’re comfortable working in a fast‑paced environment with a small, talented, fully distributed team. You manage time well, communicate effectively, and collaborate across time zones.

Responsibilities / Projects

Fully automate our database lifecycles with Argo Workflow
Eliminate all static credentials where they may be used
Reduce downtime and disruption due to maintenance or disaster to new lows
Improve our multi‑region disaster recovery system

Requirements

Senior 1 & 2 level candidates: 5+ years building modern infrastructure systems
Staff level candidates: 8+ years building modern infrastructure systems
Recognized as an expert on the systems you run; the buck stops with you
Final point of escalation for mission‑critical production systems
Familiarity with: AWS, Terraform, Kubernetes, Ansible, MongoDB, PostgreSQL, Elasticsearch
Strong grasp of common networking and data‑transfer protocols (DNS, HTTP, TCP)
Ability to speak and write in English
Located in the USA (ET, CT, MT, PT)

Bonus Points

Contributed open‑source code related to our tech stack
Experience maintaining very large databases
Successful disaster‑response experience
Experience with multi‑region architectures
Experience running MLOps systems
Experience scaling Temporal

Benefits

Competitive compensation including an organization‑wide goal‑based bonus
Paid Time Off: ~5 weeks PTO upon joining + Winter and Summer Holiday Breaks; 2 additional PTO days each year
80% Work Option: choose between a 5‑day week (standard full‑time) or a 4‑day week at 80% pay
Paid Parental Leave for primary and secondary caregivers
Sabbatical: after 5 years, eligible for a 1‑month paid sabbatical
Healthcare (US residents): Medical, Dental, Vision with HSA option; Dependent care FSA
401(k) (US residents): 6% match with immediate vesting

Our Values

Build a house you want to live in – Examine long‑term thinking and action
No BS – Practice transparency and honesty, especially when it’s hard
Invest in each other – Build successful relationships with coworkers and customers
Discipline equals freedom – Keep your word to yourself and others
Strive for greatness – Constantly challenge yourself and others

Skills

AnsibleArgoCDAWSCircleCIClickHouseDockerElasticsearchFlaskGrafanaGitHub ActionsKubernetesLokiMimirMongoDBPrometheusPostgreSQLPythonRedisTerraformTemporalTempoTigris

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free