I
Site Reliability Engineer-Onsite@GTA
IH
Canada · On-site Contract 2w ago
About the role
Key Responsibilities
- Monitor and maintain production systems and infrastructure
- Improve system reliability, availability, and performance
- Automate operational tasks and deployment processes
- Troubleshoot application, server, and network issues
- Implement monitoring, alerting, and incident management solutions
- Support CI/CD pipelines and DevOps practices
Required Skills
- Strong knowledge of Linux/Unix administration
- Experience with AWS, Azure, or GCP cloud platforms
- Knowledge of Docker and Kubernetes
- Experience with Terraform, Ansible, or Infrastructure as Code tools
- Familiarity with monitoring tools like Prometheus, Grafana, Splunk, or ELK
- Scripting skills in Python, Bash, or Shell scripting
Skills
AnsibleAWSAzureBashDockerELKGCPGrafanaInfrastructure as CodeKubernetesLinuxPrometheusPythonShellSplunkTerraformUnix
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free