Skip to content
mimi

Site Reliability Manager

PNC

Pittsburgh · On-site Full-time Lead $65k – $194k/yr Yesterday

About the role

About

At PNC, our people are our greatest differentiator and competitive advantage in the markets we serve. We are all united in delivering the best experience for our customers. We work together each day to foster an inclusive workplace culture where all of our employees feel respected, valued and have an opportunity to contribute to the company’s success. As a Site Reliability Manager within PNC’s Site Reliability Center (SRC), you will be based in Dallas, TX, Pittsburgh, PA, Strongsville, OH or Phoenix, AZ.

We’re looking for a highly skilled and proactive Technical Manager to lead a dynamic team supporting PNC’s enterprise‑grade middleware platforms. As a key player in IT Production Support, you’ll drive incident resolution, platform stability, and cross‑functional collaboration across RND, Test, QA, and Production environments. A key aspect to this position is enabling teams in utilizing the principals and practices of Site Reliability Engineering.

Middleware & Tooling Supported

  • WebSphere
  • Tomcat
  • IIS
  • uDeploy
  • GIT / Bitbucket
  • Jenkins / CI Pipeline

Daily Responsibilities

  • Provide support for critical applications across all environments and lines of business running on PNC's middleware platform.
  • Lead incident troubleshooting, resolution, and root cause analysis for middleware‑related issues.
  • Partner with vendors, platform engineers, and cross‑functional tech teams to resolve complex problems efficiently.
  • Analyze application and server configurations, identify performance/stability risks, and drive remediation efforts.
  • Detect recurring incident patterns and propose long‑term solutions to reduce operational toil.
  • Guide and validate change requests, including technical impact analysis and risk mitigation aligned with PNC’s framework.
  • Proactively monitor logs and events using Big Panda and other dashboards to identify potential issues before they impact operations.
  • Develop, document, and improve workflows, policies, and procedures for streamlined incident response and knowledge transfer.
  • Support patching cycles by identifying middleware and OS‑related issues (WebSphere, Tomcat, Linux) and working with relevant teams to implement fixes.
  • Actively participate in incident bridges and technical war rooms to ensure timely recovery and escalation.

Additional Responsibilities

  • Leads a team of Site Reliability Engineers in implementing, maintaining, and improving robust monitoring response sites and infrastructure applications.
  • Recommends and facilitates the implementation of infrastructure enhancements as required to maintain the performance of sites in response to business growth and strategy.
  • Streamlines the deployment process by introducing automated configuration management tools, resulting in a reduction in deployment time and increased efficiency.
  • Oversees robust technical solutions for complex business and application challenges, while helping to define and communicate technical standards and best practices. Manages and oversees proactive reviews and audits of production sites, issue triage and follow up.
  • Leads in the collaboration with cross‑functional teams to design and implement scalable and highly available infrastructure.
  • Maximizes staff contribution through professional growth and development, to increase teamwork and more effectively meet business needs.

Qualifications

Preferred Skills

  • Application Development
  • Business Management
  • Customer Solutions
  • Design
  • Group Problem Solving
  • Middleware
  • Process Improvements
  • Release Management
  • Software Solutions
  • Time Management
  • User Experience (UX) Design

Competencies

  • Analytical Thinking
  • Application Design
  • Architecture
  • Application Maintenance
  • Application Testing
  • Emerging Technologies
  • Innovation
  • IT Industry: Trends & Directions
  • IT Standards, Procedures & Policies
  • Software Process Improvement (SPI)
  • Software Reliability Management
  • Technical Troubleshooting

Work Experience

Roles at this level typically require a university / college degree, with 5+ years of industry‑relevant experience. At least 3 years of prior management experience is typically required. In lieu of a degree, a comparable combination of education, job specific certification(s), and experience (including military service) may be considered.

Education

  • Bachelors

Certifications

  • No Required Certification(s)

Licenses

  • No Required License(s)

Pay Transparency

  • Base Salary: $65,000.00 – $194,350.00
  • Salaries may vary based on geographic location, market data and on individual skills, experience, and education. This role is incentive eligible with the payment based upon company, business and/or individual performance.

Benefits

PNC offers a comprehensive range of benefits to help meet your needs now and in the future. Depending on your eligibility, options for full‑time employees include: medical/prescription drug coverage (with a Health Savings Account feature), dental and vision options; employee and spouse/child life insurance; short and long‑term disability protection; 401(k) with PNC match, pension and stock purchase plans; dependent care reimbursement account; back‑up child/elder care; adoption, surrogacy, and doula reimbursement; educational assistance, including select programs fully paid; a robust wellness program with financial incentives.

In addition, PNC generally provides the following paid time off, depending on your eligibility: maternity and/or parental leave; up to 11 paid holidays each year; 9 occasional absence days each year, unless otherwise required by law; between 15 to 25 vacation days each year, depending on career level; and years of service.

Disability Accommodations Statement

If an accommodation is required to participate in the application process, please contact us via email at AccommodationRequest@pnc.com. Please include “accommodation request” in the subject line title and be sure to include your name, the job ID, and your preferred method of contact in the body of the email. Emails not related to accommodation requests will not receive responses. Applicants may also call 877‑968‑7762 and say “Workday” for accommodation assistance. All information provided will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.

At PNC we foster an inclusive and accessible workplace. We provide reasonable accommodations to employment applicants and qualified individuals with a disability who need an accommodation to perform the essential functions of their positions.

Equal Employment Opportunity (EEO)

PNC provides equal employment opportunity to qualified persons regardless of race, color, sex, religion, national origin, age, sexual orientation, gender identity, disability, veteran status, or other categories protected by law.

California Residents

Refer to the California Consumer Privacy Act Privacy Notice to gain understanding of how PNC may use or disclose your personal information in our hiring practices.

Requirements

  • 5+ years of industry-relevant experience.
  • At least 3 years of prior management experience is typically required.

Responsibilities

  • Provide support for critical applications across all environments and lines of business running on PNC's middleware platform.
  • Lead incident troubleshooting, resolution, and root cause analysis for middleware-related issues.
  • Partner with vendors, platform engineers, and cross-functional tech teams to resolve complex problems efficiently.
  • Analyze application and server configurations, identify performance/stability risks, and drive remediation efforts.
  • Detect recurring incident patterns and propose long-term solutions to reduce operational toil.
  • Guide and validate change requests, including technical impact analysis and risk mitigation aligned with PNC’s framework.
  • Proactively monitor logs and events using Big Panda and other dashboards to identify potential issues before they impact operations.
  • Develop, document, and improve workflows, policies, and procedures for streamlined incident response and knowledge transfer.
  • Support patching cycles by identifying middleware and OS-related issues (WebSphere, Tomcat, Linux) and working with relevant teams to implement fixes.
  • Actively participate in incident bridges and technical war rooms to ensure timely recovery and escalation.
  • Leads a team of Site Reliability Engineers in implementing, maintaining, and improving robust monitoring response sites and infrastructure applications.
  • Recommends and facilitates the implementation of infrastructure enhancements as required to maintain the performance of sites in response to business growth and strategy.
  • Streamlines the deployment process by introducing automated configuration management tools, resulting in a reduction in deployment time and increased efficiency.
  • Oversees robust technical solutions for complex business and application challenges, while helping to define and communicate technical standards and best practices.
  • Manages and oversees proactive reviews and audits of production sites, issue triage and follow up.
  • Leads in the collaboration with cross-functional teams to design and implement scalable and highly available infrastructure.
  • Maximizes staff contribution through professional growth and development, to increase teamwork and more effectively meet business needs.

Benefits

dental_coveragepaid_time_offhealth_insurance

Skills

Big PandaCI PipelineDockerGITIISJenkinsLinuxTomcatuDeployWebSphere

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free