Infrastructure Team
Close
About the role
About Us
Close is a bootstrapped, profitable, 100% remote, ~100‑person team of thoughtful individuals who prioritize taking ownership and making a meaningful impact. We love small scaling businesses. Since 2013, we’ve been building a CRM that focuses on better communication, without the hassle of manual data entry or a complex UI. We are out to supercharge sales productivity with the most modern, thoughtfully designed, all‑in‑one, communication‑focused CRM.
Our backend tech stack consists primarily of Python Flask web apps with our TaskTiger scheduler handling many of the backend asynchronous task processing chores. Our data stores include MongoDB, PostgreSQL, Elasticsearch, and Redis. The underlying infrastructure runs on AWS using a combination of managed services like EKS, MSK, RDS and ElasticCache and non‑managed services running on EC2 instances. We have CI/CD pipelines that build Docker images, run automated tests and deploy to Kubernetes clusters. We also use these images in our local development environment allowing coding locally against all of our services. We have a well‑documented public API that is consumed by our front‑end JavaScript app as well as numerous integrations. Our infrastructure is heavily automated using Terraform, Ansible and other AWS tools.
We love open sourcing our code and ideas on our GitHub and on The Making of Close, our behind‑the‑scenes Product & Engineering blog. Check out our open source projects like close‑mongo‑ops‑manager, SocketShark, TaskTiger, LimitLion and ciso8601.
About the Role
You will be joining the Infrastructure Team at Close. This team builds and maintains the platform that runs all Close systems. You will work with:
- Multi‑terrabyte MongoDB, PostgreSQL, and Elasticsearch clusters
- Telemetry systems built on Grafana’s LGTM stack and ClickHouse processing over 130 TB per month
- Multiple Kubernetes clusters running tens of thousands of pods
- GitHub Actions & ArgoCD powered CI/CD that can go from merged, to production, to rolled back in 10 minutes
- A system that is stable, up‑to‑date, and hasn’t needed scheduled downtime in 4 years
About You
- You are a rock in the storm. With hard‑won expertise, you consistently build robust systems from quality components fit to underpin mission‑critical applications. You value simplicity over familiarity and resilience over speed.
- You’ve worked with a diverse array of infrastructure tools and systems, including:
- CI/CD (CircleCI, GitHub Actions, ArgoCD)
- Configuration Management (Ansible, Terraform)
- Databases (Elasticsearch, MongoDB, PostgreSQL, ClickHouse)
- Cloud Computing (Kubernetes, AWS)
- Telemetry (Loki, Tempo, Grafana, Mimir/Prometheus)
- You’re comfortable working in a fast‑paced environment with a small, talented, fully distributed team. You manage time well, communicate effectively, and collaborate across time zones.
Responsibilities / Projects
- Fully automate our database lifecycles with Argo Workflow
- Eliminate all static credentials where they may be used
- Reduce downtime and disruption due to maintenance or disaster to new lows
- Improve our multi‑region disaster recovery system
Requirements
- Senior 1 & 2 level candidates: 5+ years building modern infrastructure systems
- Staff level candidates: 8+ years building modern infrastructure systems
- Recognized as an expert on the systems you run; the buck stops with you
- Final point of escalation for mission‑critical production systems
- Familiarity with: AWS, Terraform, Kubernetes, Ansible, MongoDB, PostgreSQL, Elasticsearch
- Strong grasp of common networking and data‑transfer protocols (DNS, HTTP, TCP)
- Ability to speak and write in English
- Located in the USA (ET, CT, MT, PT)
Bonus Points
- Contributed open‑source code related to our tech stack
- Experience maintaining very large databases
- Successful disaster‑response experience
- Experience with multi‑region architectures
- Experience running MLOps systems
- Experience scaling Temporal
Benefits
- Competitive compensation including an organization‑wide goal‑based bonus
- Paid Time Off: ~5 weeks PTO upon joining + Winter and Summer Holiday Breaks; 2 additional PTO days each year
- 80% Work Option: choose between a 5‑day week (standard full‑time) or a 4‑day week at 80% pay
- Paid Parental Leave for primary and secondary caregivers
- Sabbatical: after 5 years, eligible for a 1‑month paid sabbatical
- Healthcare (US residents): Medical, Dental, Vision with HSA option; Dependent care FSA
- 401(k) (US residents): 6% match with immediate vesting
Our Values
- Build a house you want to live in – Examine long‑term thinking and action
- No BS – Practice transparency and honesty, especially when it’s hard
- Invest in each other – Build successful relationships with coworkers and customers
- Discipline equals freedom – Keep your word to yourself and others
- Strive for greatness – Constantly challenge yourself and others
Requirements
- Senior 1 & 2 level candidates should have 5+ years of experience building modern infrastructure systems.
- Staff level candidates should have 8+ years of experience.
- You have been the final point of escalation in the support of mission critical production systems
- You are familiar with some of the following technologies: AWS, Terraform, Kubernetes, Ansible, MongoDB, PostgreSQL, Elasticsearch
- You have a strong grasp of common networking and data transfer protocols such as DNS, HTTP, TCP
- You are able to speak and write in English
- You are located in the USA (ET, CT, MT, PT)
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free