Skip to content
mimi

Site Reliability Engineer

HCLTech

Caraquet · flexible Full-time Mid Level 2w ago

About the role

About

Join our SRE L2 squad supporting ~1000 AWS-hosted services. You’ll own operational reliability, rapid triage, and proactive maintenance across production and non-prod, partnering closely with Cloud Engineering, SOC, and application teams.

Key Responsibilities

  • Deliver 24×7 monitoring, incident response, and problem management; drive MTTA/MTTR reduction and SLO/SLI adherence.
  • Perform preventive health checks; analyze ticket trends to implement continual service improvements and automation to reduce toil.
  • Execute blameless postmortems and high-quality RCA; maintain SOPs/runbooks and reliability dashboards.
  • Configure/tune observability (Dynatrace, Cloud Watch, ELK); enable self-healing workflows and workload optimizations.
  • Support change/service requests within agreed SLAs; collaborate during transitions and onboard new AWS services.

Core Skills & Tools

  • AWS: Lambda, ECS/Fargate/EC2, API Gateway, SNS/SQS, Kinesis, RDS; IAM/KMS foundations.
  • Observability & ITSM: Dynatrace, Cloud Watch, ELK; Service Now for incidents/changes; SLI/SLO dashboards.
  • Toil Reduction
  • Reliability Practices: Error budgets, capacity/performance benchmarking, automation/runbook execution, Fin Ops awareness.

Qualifications

  • 5+ years SRE/Dev Ops or L2 operations for cloud-native stacks; strong AWS production experience.
  • Proven incident/change/problem management in 24×7 environments; adept at RCA and postmortems.
  • Hands‑on with observability tooling and operational automation; excellent collaboration and documentation skills.

Shift Coverage & Locations

Follow-the-sun model with overlapping handoffs across Canada/India to ensure continuous support. Success is measured by uptime, MTTR/MTTD, change failure rate, error‑budget consumption, SLO adherence, RCA quality, and CSI throughput.

Skills

API GatewayAWSCloud WatchECSELKFargateIAMKinesisKMSLambdaRDSService NowSNSSQSDynatrace

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free