Skip to content
mimi

Lead Site Reliability Developer - CSRE Consulting

Live Nation

Remote · Canada Full-time Lead 3w ago

About the role

THE TEAM

A career at Ticketmaster will challenge and engage you. We support the creators and producers of shows and live performances, while connecting more passionate fans to these events. The pace here is fast, the atmosphere is fun and a passion for live events is a common thread that ties us together. As a global and growing business, we can truly offer a world of opportunities to expand your skills and develop your career. Visit any of our offices and you'll find a diverse mix of passionate employees, helping fans around the globe connect with the artists, teams and events they love. It truly is a unique and rewarding environment.

You will be part of the Central SRE Consulting team, which partners with product and platform engineering teams throughout Ticketmaster to improve reliability, resilience, and sustainable engineering practices. We often deliver through work that combine hands-on delivery with capability building so teams can sustain improvements independently. The team’s remit is to increase adoption and maturity of SRE principles across Ticketmaster and ensure our services are appropriately scaled and reliable.

We support teams across the globe, with many peers in the USA. Most of your teammates operate in UTC/UTC+1, and we are adding people in other time zones.

THE JOB

As a Lead Site Reliability Developer in CSRE Consulting, you will lead reliability consulting work across multiple teams or a domain, aligning stakeholders on priorities and driving delivery of sustained improvements. You will translate reliability goals into sequenced workstreams, align dependencies, and ensure teams can maintain the mechanisms after you move on.

You will mentor other consultants, codify reusable patterns, and influence shared platforms so reliability improvements propagate beyond any single team or engagement.

WHAT YOU WILL BE DOING

  • Lead consulting work from discovery through delivery by aligning stakeholders on priorities, sequencing work, and communicating measurable outcomes.
  • Establish working cadence and facilitate decision forums to surface risks, map dependencies, and drive clear ownership and timelines.
  • Align product, platform, and engineering stakeholders on reliability targets and trade-offs using SLOs and error budgets.
  • Partner regularly with Engineering Managers, product managers, Staff and Principal engineers, and platform leads to keep dependencies, decisions, and delivery aligned.
  • Identify systemic risks across shared dependencies and coordinate remediation across multiple teams to reduce recurring incidents.
  • Drive change adoption by embedding reliability mechanisms into partner team routines such as planning, PRRs, and on-call practices.
  • Design and implement reusable reliability mechanisms, templates, and tools that can be adopted by teams across the organization.
  • Establish and evolve production readiness review practices with partner teams to improve launch quality and change safety.
  • Lead observability strategy for partner domains by improving signal quality, alerting philosophy, and operational dashboards.
  • Conduct complex incident investigations and ensure learnings translate into durable fixes with clear owners and verification.
  • Lead reliability-focused design and code reviews and guide teams toward simpler, safer architectures.
  • Mentor senior developers and other consultants through coaching, reviews, and structured coaching to multiply impact.
  • Partner with internal platform engineering to influence roadmaps and deliver shared capabilities that accelerate SRE adoption.
  • Improve CSRE consulting guides and operational practices based on repeated patterns observed across teams.

WHAT YOU NEED TO KNOW (or TECHNICAL SKILLS)

  • Deep practical understanding of SRE principles, including governance of SLOs and error budget policy in practice.
  • Demonstrated ability to lead cross-team technical work and influence without authority.
  • Strong experience designing and troubleshooting distributed systems with inter-service failure modes.
  • Experience defining observability and alerting strategy and improving operational signal quality.
  • Solid experience with Kubernetes and AWS, including governance and cost trade-offs.
  • Ability to design reliability automations and tooling that are reusable and adopted by multiple teams.
  • Experience leading production readiness and resilience practices, including business continuity validation and controlled testing.
  • Excellent software development fundamentals with the ability to deliver and review high-quality changes to enterprise codebases.
  • Advanced incident analysis skills focused on reducing systemic risk and organizational learning.
  • Excellent communication skills, including executive-ready summaries and clear technical diagrams.

YOURSELF (BEHAVIORAL SKILLS)

  • Lead with service and humility, creating clarity and momentum without relying on authority.
  • Build relationships across teams and functions, and set clear expectations for how you collaborate and deliver.
  • Facilitate alignment by framing problems, surfacing trade-offs, and running working sessions that result in decisions.
  • Persuade based on evidence and empathy, adapting your message to developers, product teams, and key decision-makers.
  • Act as a mentor and coach intentionally, helping others develop their reliability thinking and consulting skills.
  • Maintain psychological safety while raising the bar, giving direct feedback respectfully.
  • Remain persistent and patient in complex organizations, keeping work moving forward despite slow dependencies.
  • Learn to manage ambiguity with ease and translate fuzzy information into clear plans, options, and next steps.
  • Foster simple mechanisms that enable extension, not bespoke solutions that require you to maintain them.
  • Operate at a sustainable pace and discourage hero culture by designing systems that don’t require it.
  • Take pride in quality, including documentation and decision records that help teams sustain work.
  • Demonstrate adaptability by switching between hands-on debugging, stakeholder management, and planning as needed.

LIFE AT TICKETMASTER

We are proud to be part of Live Nation Entertainment, the world’s largest live entertainment company.

At Ticketmaster, our vision is to connect people around the world to the live events they love. As the world’s largest ticket marketplace and the premier provider of global event ticketing technologies, we’re uniquely positioned to deliver on this vision.

We do all of this with an intense passion for live events and an inspiring, diverse culture that’s fueled by accessible leaders, caring managers, and enthusiastic teams. If you’re passionate about live events like we are and want to work for a company dedicated to helping millions of fans experience them, we want to hear from you.

Our work is guided by our values:

  • Reliability – We understand that fans and clients count on us to power their event experiences, and we count on each other to deliver.
  • Teamwork – We believe that individual accomplishments pale in comparison to the level of success that a team can achieve.
  • Integrity – We are committed to the highest ethical and moral standards on behalf of the countless partners and stakeholders we represent.
  • Belonging – We are committed to building a culture where all individuals can be their authentic selves, have an equal voice, and opportunities to thrive.

EQUAL OPPORTUNITY EMPLOYER

We are passionate and committed to our employees and go beyond the rhetoric of diversity and inclusion. You will work in an inclusive environment and be encouraged to bring your full self to work. We will do our best to help you find a work-life balance. As a growing company, we will encourage you to develop your professional and personal aspirations, take on new experiences and learn from the talented people you will be working with. It is the talents that matter to us and we encourage applications from people regardless of gender, race, sexual orientation, religion, age, disability or family responsibilities.

Ticketmaster Canada will provide reasonable accommodations to qualified individuals with disabilities in the job application process. If you are having difficulty using our online application system and need an accommodation due to a disability, please email accessibilitycanada@ticketmaster.ca. Please note that this inbox is limited to accommodation requests. Resumes submitted to this inbox will not be considered.

LANGUAGE REQUIREMENTS

Due to the nature of the role, this position involves interacting with English-speaking clients, colleagues, stakeholders, vendors, etc. As such, a high level of English proficiency, both spoken and written, is required. While strong English proficiency is essential for this role, candidates with a working knowledge of French are also encouraged to apply.

Skills

AWSKubernetes

Don't send a generic resume

Paste this job description into Mimi and get a resume tailored to exactly what the hiring team is looking for.

Get started free