Site Reliability Engineer - DevOps Focus
Software Guidance & Assistance
About the role
At Software Guidance & Assistance we are looking for a DevOps engineer!
Our tech stack:
AWS, Lambda, Architect, Azure, CI/CD, Cloud, CloudWatch, Datadog, DevOps, Django, Docker, Dynatrace, EC2, ELK, GitLab, IAM, Jenkins, Kubernetes, Python, SQL, Security, Serverless, Terraform, microservices, numpy, pandas
Requirements:
- Over 8 years of hands-on experience in Site Reliability, DevOps, or Platform Engineering roles.
- Extensive knowledge of AWS Cloud Services (ECS, EC2, Lambda, IAM, CloudWatch, etc.).
- Proficiency in Python for automation, scripting, and infrastructure integration.
- Strong familiarity with CI/CD pipelines using GitLab or Jenkins.
- Practical experience with Infrastructure as Code (CDK, Terraform, or CloudFormation).
- Expertise in monitoring and observability tools (Datadog, Dynatrace, ELK).
- Understanding of microservices, serverless architectures, and containerization (Docker, ECS, Kubernetes).
- Excellent analytical, troubleshooting, and problem-resolution abilities.
- Strong communication and collaboration skills within cross-functional teams.
- Bachelors degree in Computer Science or a related field (or equivalent experience).
Your responsibilities are:
- Architect, deploy, and sustain AWS and Azure infrastructures with a focus on reliability, scalability, and cost-effectiveness.
- Design and manage monitoring, logging, and alerting systems to ensure high availability and swift incident response.
- Develop and maintain CI/CD pipelines (GitLab, Jenkins) to facilitate continuous software delivery and automation.
- Implement and uphold Infrastructure as Code (IaC) using CDK, Terraform, or CloudFormation.
- Collaborate with development teams to enhance deployment processes and improve production reliability.
- Contribute to the application codebase for resilience, performance optimization, and observability best practices.
- Maintain comprehensive documentation for architectures, design patterns, and configurations.
- Partner with Dev, QA, and AppSecOps teams to advance automation, consistency, and security enhancements.
- Conduct incident triage, root cause analysis, and develop lasting solutions to production challenges.
- Continuously refine standards, tools, and processes for enhanced platform reliability and efficiency.
Software Guidance & Assistance - More about us and the role:
At Software Guidance & Assistance, Inc. (SGA), we are seeking a Site Reliability Engineer (SRE) for a direct placement assignment with one of our esteemed financial services clients located in mid-town New York City. This position offers a hybrid work schedule, allowing 2-3 days onsite each week. Our firm is dedicated to solving complex IT challenges through a personalized, boutique approach, and we pride ourselves on matching consultants like you with over 1,000 engagements annually. Join a diverse team built on core values such as exceptional service, employee growth, quality, and integrity. We are a women-owned business committed to fostering a culture where you can bring your authentic self, pursue your passions, and thrive in your career.
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free