T

Senior Site Reliability Champion

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Senior Site Reliability Champion

Core Responsibilities:

  • Evaluate applications, platforms, and vendors to assess resiliency, reliability, and operational risk.

  • Design and implement processes that enforce enterprise resiliency and reliability standards.

  • Lead blameless post‑incident reviews for high‑severity incidents or incidents spanning multiple complex product families.

  • Partner with product and platform teams to proactively identify and remediate reliability risks before they impact clients.

  • Develop, communicate, and evangelize new standards, tools, and frameworks across subdivisions, ensuring consistent adoption.

  • Troubleshoot complex production issues and implement durable solutions that prevent recurrence.

  • Participate in a periodic on‑call rotation to support production stability.

  • Evaluate and onboard resiliency and reliability tooling.

  • Actively participate in reliability engineering and resilience communities of practice, contributing to shared learning and enterprise consistency.

  • Contribute to strategic initiatives that advance Vanguard’s operational maturity and resiliency posture.

 

Qualifications | Technical Skills:

  • Observability Platforms: Experience with modern observability and monitoring tools, such as Splunk, Honeycomb, CloudWatch, Dynatrace, or AppDynamics.

  • Reliability Metrics: Strong understanding of SLIs, SLOs, and SLAs, including dashboarding and reporting practices.

  • Monitoring & Alerting: Experience with alert design, anomaly detection, predictive alerting, and synthetic monitoring using structured methodologies.

  • Automation & Resilience Engineering: Experience with automation and resilience practices such as Python-based automation, RPA platforms (e.g., Blue Prism, UiPath), chaos engineering, and failure analysis techniques (e.g., FMEA).

Special Factors

Sponsorship

Vanguard is not offering visa sponsorship for this position.

About Vanguard

At Vanguard, we don't just have a mission—we're on a mission.

To work for the long-term financial wellbeing of our clients. To lead through product and services that transform our clients' lives. To learn and develop our skills as individuals and as a team. From Malvern to Melbourne, our mission drives us forward and inspires us to be our best.

How We Work

Vanguard has implemented a hybrid working model for the majority of our crew members, designed to capture the benefits of enhanced flexibility while enabling in-person learning, collaboration, and connection. We believe our mission-driven and highly collaborative culture is a critical enabler to support long-term client outcomes and enrich the employee experience.

Original job Senior Site Reliability Champion posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Senior Site Reliability Champion Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Senior Site Reliability Champion Jobs in the US

GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast! Find the best jobs in the US, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.