Site Reliability Engineer
Rapinno Tech
About the role
Description
As a member of the Platform as a Service team, you will be responsible for the design and development of medium to highly complex systems. This includes the design and implementation of infrastructure from specifications, configuration and deployment of applications, connecting to back-end resources, and advanced troubleshooting of moderately complex software applications. Deployment, middleware administration and operational support of (production, staging, test and development) environments for multiple projects using WebSphere, Weblogic, and Tomcat Application Server. Monitors systems capacity and performance, plans and executes disaster recovery procedures, and provides Tier 2 technical support.
In addition, this role requires the candidate to be highly flexible in hours of work because of its customer-facing, highly available infrastructure requirements. Work closely with Dev, QA and production support team members to align and orchestrate resolutions on open issues/defects. Provides high level written communications to upper management regarding production issues.
Required Skills
- 3-5 years managing and administrating middleware technologies(Weblogic, Websphere, Tomcat).
- 3+ years hands-on experience with Solaris, Linux (RHEL, CentOS, Ubuntu), in bare-metal and Cloud-based infrastructure (AWS, OpenStack)
- Experience with cloud platforms AWS( Auto scaling , AVI, security, EC2 , EFS , EBS , S3 , KMS)
- Strong experience with Installing IBM WebSphere MQ and creating multi instance Queue manager in AWS by using EBS/EFS volumes, creating MQ objects, clusters, channels etc.
- Experience with configuring the clustered Queue managers for HA and load-balancing as well troubleshooting in clustered environment
- Installing open source Rabbit MQ on AWS EC2 instances with the use of CFTs/ansible and automating it by using Jenkins. Also creating Classic Load balancer to distribute traffic among those Rabbit MQ instances
- Experience with migrating applications from monolithic to kubernetes container platform
- Experience with APIGEE Proxy configurations and troubleshooting
- Hands on experience with CI/CD tools such as Jenkins, Ansible
- Working knowledge of monitoring tools like CA Wily, New Relic, and Datadog
- Experience with Elasticsearch, Kibana, and Logstash
- Execution on all release engineering aspects of DevOps including the configuration management , Build and Deployment Management, Continuous Integration and Delivery
- Ansible based deployment and configuration automation solutions.
- Experience with web based services and protocols ( HTTP , HTTPS, REST , Apache , Tomcat)
- Experience with micro-service architectures and deployment.
- Knowledge on L2/L3 protocols , IPv4/IPv6 and TCP/IP stack .
- Proficiency in high level script languages (Python preferred) as well as script environments like bash
- Experience with DevOps workflow automation (Jenkins, Ansible, Puppet)
- Strong analytical & troubleshooting skills.
- Experience with tools like JIRA, Confluence, Stash
- M.S. or relevant experience required.
Preferred to have:
- AWS Certification
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free