Skip to content
mimi

Java Software

Trantor

Thane · On-site Full-time Mid Level Yesterday

About the role

Software Engineer – SRE (Java Support)

Experience: 5–7 Years

Role Overview

We are looking for a Software Engineer – Site Reliability Engineering (SRE) with strong Java production support experience to ensure the stability, reliability, and performance of enterprise applications. The role involves monitoring production systems, troubleshooting incidents, performing root cause analysis, and supporting Java-based applications running on AWS infrastructure.

Responsibilities

  • Provide L2/L3 production support for Java-based applications.
  • Monitor application health, system performance, and service availability.
  • Investigate and resolve production incidents, bugs, and performance issues.
  • Perform root cause analysis (RCA) and implement preventive solutions.
  • Work closely with development teams to resolve application defects.
  • Analyze application logs and troubleshoot issues across distributed systems.
  • Support deployment activities and production releases.
  • Ensure system reliability, availability, and uptime as per SLA requirements.
  • Participate in on-call support rotations and incident management.
  • Maintain documentation for operational procedures and known issues.

Must-Have Skills

  • Strong experience in Java production support / application support.
  • Experience troubleshooting Java-based microservices applications.
  • Hands‑on experience with Spring Boot applications.
  • Experience with AWS cloud environment.
  • Knowledge of SQL and database troubleshooting.
  • Experience with log analysis tools (Splunk, ELK, or similar).
  • Experience in incident management and root cause analysis.
  • Understanding of Linux/Unix environments.

Good-to-Have Skills

  • Experience with Docker and Kubernetes.
  • Knowledge of CI/CD pipelines.
  • Experience with monitoring tools (CloudWatch, Prometheus, Grafana).
  • Familiarity with message queues such as Kafka or RabbitMQ.
  • Understanding of site reliability and observ

Requirements

  • Strong experience in Java production support / application support.
  • Experience troubleshooting Java-based microservices applications.
  • Hands-on experience with Spring Boot applications.
  • Experience with AWS cloud environment.
  • Knowledge of SQL and database troubleshooting.
  • Experience with log analysis tools (Splunk, ELK, or similar).
  • Experience in incident management and root cause analysis.
  • Understanding of Linux/Unix environments.

Responsibilities

  • Provide L2/L3 production support for Java-based applications.
  • Monitor application health, system performance, and service availability.
  • Investigate and resolve production incidents, bugs, and performance issues.
  • Perform root cause analysis (RCA) and implement preventive solutions.
  • Work closely with development teams to resolve application defects.
  • Analyze application logs and troubleshoot issues across distributed systems.
  • Support deployment activities and production releases.
  • Ensure system reliability, availability, and uptime as per SLA requirements.
  • Participate in on-call support rotations and incident management.
  • Maintain documentation for operational procedures and known issues.

Skills

AWSELKJavaKubernetesLinuxMicroservicesRabbitMQSQLSpring BootSplunkUnixDockerGrafanaKafkaPrometheus

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free