Senior Site Reliability Engineer (SRE)
JobForgeNow
About the role
Who We Are
Provision IAM is a custom digital solutions agency specializing in financial and regulatory technology. For more than 28 years, we’ve helped organizations optimize digital operations and build secure, scalable identity and access management solutions for financial institutions and other regulated industries. We are a distributed U.S.-based team of thoughtful technologists and problem‑solvers who value ownership, continuous learning, and meaningful impact. Our environment supports autonomy, professional growth, and collaboration across engineering and client teams.
About the Role
We are seeking an experienced Senior Site Reliability Engineer (SRE) to own infrastructure initiatives from concept through completion. This is a highly autonomous individual contributor role working closely with the VP of Infrastructure and development teams. You will be trusted to take ownership of projects, drive execution, and maintain reliable, secure environments across multiple client infrastructures. As you ramp up and build context, you’ll gain increasing independence and technical influence. This is a full‑time, fully remote position open to applicants across the continental United States who are authorized to work in the U.S.
Salary Range: $115,000 – $140,000 annually, commensurate with experience.
What You’ll Be Doing
- Own and execute infrastructure projects, including migrations, automation, and tooling improvements
- Manage and troubleshoot Kubernetes clusters across multiple environments
- Maintain and improve GitOps deployment pipelines
- Build and maintain CI/CD pipelines
- Manage Google Cloud Platform infrastructure (GKE, IAM, networking, storage)
- Implement and maintain secrets and configuration management systems
- Write and maintain automation (infrastructure as code, configuration management, scripting)
- Participate in an on‑call rotation supporting production infrastructure as needed
- Communicate with internal teams and occasionally with clients when infrastructure matters impact delivery
- Collaborate with developers on deployment, reliability, and performance
- Use AI tools appropriately to enhance engineering productivity and workflow
What We Offer
- Competitive base salary ($115K–$140K)
- Company‑paid health insurance (employee and family coverage)
- Generous paid time off
- SIMPLE IRA retirement plan (IRS‑compliant eligibility and company participation)
- Fully remote work environment
- Meaningful technical ownership and growth opportunities
How We Evaluate Candidates
We focus on demonstrated skills and practical capability. We utilize structured technical assessments and interviews to evaluate problem‑solving ability, technical reasoning, and real‑world experience. We value:
- Strong technical fundamentals
- Sound judgment
- Intellectual honesty
- Ability to research and solve complex problems
- Clear communication and documentation
Required Technical Skills
Infrastructure & Cloud
- Hands‑on Kubernetes production experience
- Experience with GitOps workflows (ArgoCD, Flux, or similar)
- Strong cloud infrastructure experience (Google Cloud preferred; AWS/Azure transferable)
- CI/CD pipeline design and maintenance (GitLab CI/CD or equivalent)
- Infrastructure as Code (Terraform, OpenTofu, Pulumi, or similar)
- Enterprise secrets management tools (HashiCorp Vault or equivalent)
Systems & Operations
- Advanced Linux command‑line and system administration
- Monitoring and observability tools (Prometheus, Grafana, Datadog, etc.)
- Understanding of SLIs/SLOs and incident response practices
- Automation and scripting (Bash, Python, or similar)
Working Style & Professional Expectations
- Self‑directed and accountable
- Clear written and verbal communicator
- Honest about knowledge gaps and proactive in resolving them
- Professional and client‑aware
- Able to manage time and responsibilities effectively in a remote environment
Nice‑to‑Have Experience
- Configuration management tools (Ansible, Puppet, Chef)
- Programming beyond scripting (Python, Go, TypeScript)
- Database operations (PostgreSQL)
- Security tooling and practices
- DevOps automation and CI/CD optimization
Employment Requirements
- Must be authorized to work in the United States
- Position is open to applicants residing in the continental United States
- Provision IAM does not sponsor employment visas
- Employment is contingent upon successful completion of background screening and drug testing, where permitted by law
- All hires must complete Form I‑9 and verify identity and employment authorization
- The role may include participation in an on‑call rotation as business needs require
Equal Employment Opportunity
Provision IAM is an equal opportunity employer and makes employment decisions based on business needs, job requirements, and individual qualifications. We are committed to providing equal employment opportunities in accordance with applicable federal, state, and local laws.
Requirements
- Hands-on Kubernetes production experience
- Experience with GitOps workflows (ArgoCD, Flux, or similar)
- Strong cloud infrastructure experience (Google Cloud preferred; AWS/Azure transferable)
- CI/CD pipeline design and maintenance (GitLab CI/CD or equivalent)
- Infrastructure as Code (Terraform, OpenTofu, Pulumi, or similar)
- Enterprise secrets management tools (HashiCorp Vault or equivalent)
- Advanced Linux command-line and system administration
- Monitoring and observability tools (Prometheus, Grafana, Datadog, etc.)
- Understanding of SLIs/SLOs and incident response practices
- Automation and scripting (Bash, Python, or similar)
- Self-directed and accountable
- Clear written and verbal communicator
- Honest about knowledge gaps and proactive in resolving them
- Professional and client-aware
- Able to manage time and responsibilities effectively in a remote environment
- Candidates should be comfortable leveraging AI responsibly while maintaining strong engineering fundamentals and independent problem-solving ability
- Must be authorized to work in the United States
- Position is open to applicants residing in the continental United States
Responsibilities
- Own and execute infrastructure projects, including migrations, automation, and tooling improvements
- Manage and troubleshoot Kubernetes clusters across multiple environments
- Maintain and improve GitOps deployment pipelines
- Build and maintain CI/CD pipelines
- Manage Google Cloud Platform infrastructure (GKE, IAM, networking, storage)
- Implement and maintain secrets and configuration management systems
- Write and maintain automation (infrastructure as code, configuration management, scripting)
- Participate in an on-call rotation supporting production infrastructure as needed
- Communicate with internal teams and occasionally with clients when infrastructure matters impact delivery
- Collaborate with developers on deployment, reliability, and performance
- Use AI tools appropriately to enhance engineering productivity and workflow
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free