T
Java Software
Trantor
Thane · On-site Full-time Mid Level Yesterday
About the role
Software Engineer – SRE (Java Support)
Experience: 5–7 Years
Role Overview
We are looking for a Software Engineer – Site Reliability Engineering (SRE) with strong Java production support experience to ensure the stability, reliability, and performance of enterprise applications. The role involves monitoring production systems, troubleshooting incidents, performing root cause analysis, and supporting Java-based applications running on AWS infrastructure.
Responsibilities
- Provide L2/L3 production support for Java-based applications.
- Monitor application health, system performance, and service availability.
- Investigate and resolve production incidents, bugs, and performance issues.
- Perform root cause analysis (RCA) and implement preventive solutions.
- Work closely with development teams to resolve application defects.
- Analyze application logs and troubleshoot issues across distributed systems.
- Support deployment activities and production releases.
- Ensure system reliability, availability, and uptime as per SLA requirements.
- Participate in on-call support rotations and incident management.
- Maintain documentation for operational procedures and known issues.
Must-Have Skills
- Strong experience in Java production support / application support.
- Experience troubleshooting Java-based microservices applications.
- Hands‑on experience with Spring Boot applications.
- Experience with AWS cloud environment.
- Knowledge of SQL and database troubleshooting.
- Experience with log analysis tools (Splunk, ELK, or similar).
- Experience in incident management and root cause analysis.
- Understanding of Linux/Unix environments.
Good-to-Have Skills
- Experience with Docker and Kubernetes.
- Knowledge of CI/CD pipelines.
- Experience with monitoring tools (CloudWatch, Prometheus, Grafana).
- Familiarity with message queues such as Kafka or RabbitMQ.
- Understanding of site reliability and observ
Requirements
- Strong experience in Java production support / application support.
- Experience troubleshooting Java-based microservices applications.
- Hands-on experience with Spring Boot applications.
- Experience with AWS cloud environment.
- Knowledge of SQL and database troubleshooting.
- Experience with log analysis tools (Splunk, ELK, or similar).
- Experience in incident management and root cause analysis.
- Understanding of Linux/Unix environments.
Responsibilities
- Provide L2/L3 production support for Java-based applications.
- Monitor application health, system performance, and service availability.
- Investigate and resolve production incidents, bugs, and performance issues.
- Perform root cause analysis (RCA) and implement preventive solutions.
- Work closely with development teams to resolve application defects.
- Analyze application logs and troubleshoot issues across distributed systems.
- Support deployment activities and production releases.
- Ensure system reliability, availability, and uptime as per SLA requirements.
- Participate in on-call support rotations and incident management.
- Maintain documentation for operational procedures and known issues.
Skills
AWSELKJavaKubernetesLinuxMicroservicesRabbitMQSQLSpring BootSplunkUnixDockerGrafanaKafkaPrometheus
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free