Site Reliability Engineer

Rapinno Tech

Piscataway · flexible Contract Mid Level 2mo ago

About the role

Description

As a member of the Platform as a Service team, you will be responsible for the design and development of medium to highly complex systems. This includes the design and implementation of infrastructure from specifications, configuration and deployment of applications, connecting to back-end resources, and advanced troubleshooting of moderately complex software applications. Deployment, middleware administration and operational support of (production, staging, test and development) environments for multiple projects using WebSphere, Weblogic, and Tomcat Application Server. Monitors systems capacity and performance, plans and executes disaster recovery procedures, and provides Tier 2 technical support.

In addition, this role requires the candidate to be highly flexible in hours of work because of its customer-facing, highly available infrastructure requirements. Work closely with Dev, QA and production support team members to align and orchestrate resolutions on open issues/defects. Provides high level written communications to upper management regarding production issues.

Required Skills

3-5 years managing and administrating middleware technologies(Weblogic, Websphere, Tomcat).
3+ years hands-on experience with Solaris, Linux (RHEL, CentOS, Ubuntu), in bare-metal and Cloud-based infrastructure (AWS, OpenStack)
Experience with cloud platforms AWS( Auto scaling , AVI, security, EC2 , EFS , EBS , S3 , KMS)
Strong experience with Installing IBM WebSphere MQ and creating multi instance Queue manager in AWS by using EBS/EFS volumes, creating MQ objects, clusters, channels etc.
Experience with configuring the clustered Queue managers for HA and load-balancing as well troubleshooting in clustered environment
Installing open source Rabbit MQ on AWS EC2 instances with the use of CFTs/ansible and automating it by using Jenkins. Also creating Classic Load balancer to distribute traffic among those Rabbit MQ instances
Experience with migrating applications from monolithic to kubernetes container platform
Experience with APIGEE Proxy configurations and troubleshooting
Hands on experience with CI/CD tools such as Jenkins, Ansible
Working knowledge of monitoring tools like CA Wily, New Relic, and Datadog
Experience with Elasticsearch, Kibana, and Logstash
Execution on all release engineering aspects of DevOps including the configuration management , Build and Deployment Management, Continuous Integration and Delivery
Ansible based deployment and configuration automation solutions.
Experience with web based services and protocols ( HTTP , HTTPS, REST , Apache , Tomcat)
Experience with micro-service architectures and deployment.
Knowledge on L2/L3 protocols , IPv4/IPv6 and TCP/IP stack .
Proficiency in high level script languages (Python preferred) as well as script environments like bash
Experience with DevOps workflow automation (Jenkins, Ansible, Puppet)
Strong analytical & troubleshooting skills.
Experience with tools like JIRA, Confluence, Stash
M.S. or relevant experience required.

Preferred to have:

AWS Certification

Skills

AnsibleAPIGEEApacheAWSBashCA WilyCentOSCI/CDConfluenceDatadogDevOpsDockerElasticsearchEBSEC2EFSHTTPHTTPSIBM WebSphere MQJIRAJenkinsKibanaKubernetesLinuxLogstashMicroservicesNew RelicOpenStackPythonPuppetRabbitMQRESTRHELS3Shell scriptingSolarisStashTCP/IPTomcatUbuntuWeblogicWebSphereWebSphere MQ

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free

Site Reliability Engineer

About the role

Description

Required Skills

Preferred to have:

Skills

Similar roles

Fullstack Software Architect / Lead Engineer

Java Backend Engineer (all gender)

Senior Mobile Developer (w/m/d) - iOS (ab 32h/Woche)

Don't send a generic resume