Senior Site Reliability Engineer- EN
RBC
About the role
What is the opportunity?
This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization. As the Engineering arm of the Digital Branch SRE organization, this team will work collaboratively with the Delivery arm of the same organization and any other IT partners required to succeed in its mandate. The incumbent will need intermediate knowledge and experience working in an application development and/or technology operations organization. Perform production support role and partner with the SRE Delivery team in incident management and problem management. Job Description
This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization. As the Engineering arm of the Digital Branch SRE organization, this team will work collaboratively with the Delivery arm of the same organization and any other IT partners required to succeed in its mandate. The incumbent will need intermediate knowledge and experience working in an application development and/or technology operations organization. Perform production support role and partner with the SRE Delivery team in incident management and problem management. What will you do? • Participate in code and non-functional (performance, security, maintainability, compliance, change management) reviews of all production-bound SRE solutions • Ensure problems are quickly identified and solved through review of Zeke / Splunk / Dynatrace / Salesforce monitoring, inbound calls, email or ServiceNow tickets while providing the highest possible level of production support • Drive transformation by continuously looking for ways to automate existing processes • Track, audit, monitor, and implement technical work streams • Act as portfolio SME (Subject Matter Expert) – understand & document common components, core functionalities, and infrastructure of supported applications • Be an escalation point in the on‑call rotation, and support our maintenance, scheduled work, support and release deployment requirements • Drive in incident management and problem management for applications in scope and RCA Action items fulfillment/ownership • Focus on Continuous improvement and technical standards – Drive improvements in productivity, monitoring, tooling, and best practices • Manage technology currency (server patching, certificate renewal, compliance, etc.) with a keen eye on automating opportunities • In this role, you will communicate and interact frequently with RBC partners and/or employees located across Canada and/or worldwide. What do you need to succeed? Must have • 5+ years of working experience in Site Reliability Engineering (SRE) and best practices for running and maintaining critical systems, including monitoring, alerting, and incident management • Intermediate experience in a variety of environments (Cloud, Linux/Unix/Windows and services/APIs, databases) • Working experience with scripting ideally in Java/.NET and SQL • Strong expertise in major incident handling and communication. Issue investigation skills. • Effective negotiation skills, stakeholder management • Ability to influence the squad at an SRE level • Hands‑on experience in a variety of SRE languages and tools (Ansible, Dynatrace Managed, Moog, PagerDuty, ServiceNow, GitHub, Slack, Elastic, Logstash, Kibana, Blue Prism, Catch Point) • Ability to work in a 7x24x365 work environment NICE‑TO‑HAVE • Knowledge of KAFKA, OCP, SCON infrastructure & processes • Knowledge of cloud platform applications and processes What’s in it for you?
We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual. • A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable • Leaders who support your development through coaching and managing opportunities • Flexible work/life balance options • Opportunities to do challenging work • Opportunities to take on progressively greater accountabilities
#TechPJ Job Skills
Agile Methodology, Application Infrastructure, Group Problem Solving, IT Automation, IT Monitoring, Operations Support, Production Support, Software Development Life Cycle (SDLC), Software Engineering, Software Product Technical Knowledge, System Applications, Systems Software Additional Job Details
Address: 1 PLACE VILLE MARIE: MONTRÉAL
City: Montréal
Country: Canada
Work hours/week: 37.5
Employment Type: Full time
Platform: TECHNOLOGY AND OPERATIONS
Job Type: Regular
Pay Type: Salaried
Posted Date:
Application Deadline:
Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above Inclusion and Equal Opportunity Employment
At RBC, we believe an inclusive workplace that has diverse perspectives is core to our continued growth as one of the largest and most successful banks in the world. Maintaining a workplace where our employees feel supported to perform at their best, effectively collaborate, drive innovation, and grow professionally helps to bring our Purpose to life and create value for our clients and communities. RBC strives to deliver this through policies and programs intended to foster a workplace based on respect, belonging and opportunity for all. Join our Talent Community
Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.
Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well‑being of our clients and communities at jobs.rbc.com
RBC is presently inviting candidates to apply for this existing vacancy. Applying to this posting allows you to express your interest in this current career opportunity at RBC. Qualified applicants may be contacted to review their resume in more detail.
#J-18808-Ljbffr
Requirements
- 5+ years of working experience in Site Reliability Engineering (SRE) and best practices for running and maintaining critical systems
- Intermediate experience in a variety of environments (Cloud, Linux/Unix/Windows and services/APIs, databases)
- Working experience with scripting ideally in Java/.NET and SQL
- Strong expertise in major incident handling and communication
- Effective negotiation skills, stakeholder management
- Ability to influence the squad at an SRE level
- Hands-on experience in a variety of SRE languages and tools
Responsibilities
- Participate in code and non-functional reviews of all production-bound SRE solutions
- Ensure problems are quickly identified and solved through review of monitoring tools
- Drive transformation by continuously looking for ways to automate existing processes
- Track, audit, monitor, and implement technical work streams
- Act as portfolio SME and understand & document common components, core functionalities, and infrastructure of supported applications
- Be an escalation point in the on-call rotation, and support maintenance, scheduled work, support and release deployment requirements
- Drive incident management and problem management for applications in scope and RCA Action items fulfillment/ownership
- Focus on Continuous improvement and technical standards
- Manage technology currency with a keen eye on automating opportunities
Benefits
Skills
Don't send a generic resume
Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.
Get started free