T

Lead Site Reliability Engineer

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
icon loader

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Lead Site Reliability Engineer

About Ticketek Entertainment Group

Ticketek Entertainment Group is a global fan experience Company that tickets, promotes and delivers incredible live experiences that are impossible to forget.  In a distracted world where nothing beats real human moments, We make life better live!

Our Group includes; our Fan Experience Platform (Ticketek) that sells tickets and provides value added services, Event promoting with businesses across Touring (TEG Touring), Sport (TEG Sport), and Family Experiences (TEG Experiences) and our digital business (Ovation) which focuses on delivering seamless data-driven outcomes for our fans and partners

About the Role: 

We are hiring a Lead Site Reliability Engineers to join TEG's Technology department. Responsible in leading a team of SRE's, you will play a pivotal role in ensuring the exceptional health, stability, performance and cost-effective scalability of our global live entertainment platforms. In this role, you will apply software engineering principles to operations, proactively enhancing system reliability and preventing outages to deliver seamless experiences for our customers across our global ticketing platform.

(Please note: Candidates can apply from our office locations in Adelaide, Sydney, Melbourne, or Brisbane)

What does a day in the life look like?

Team Leadership & Mentorship: Lead, mentor, and develop a high-performing SRE team. Manage workload, set clear objectives, conduct performance reviews, and foster a collaborative, excellent engineering culture.

Strategic Ownership & Optimisation: Define and execute the SRE roadmap. Own the end-to-end availability, performance, scalability, and cost-efficiency of the production environment and all critical systems.

Observability & Cost Management: Drive the continuous enhancement and optimisation of the observability stack (monitoring, logging, tracing, alerting). Lead cost optimisation and reduce MTTD & MTTR through platform observability improvements.

Post-Incident Ownership & Improvement: Promote a continuous improvement culture. Lead post-incident reviews to ensure underlying causes are identified, corrective actions are prioritised, and lessons learned are applied to measurably reduce MTTD and MTTR via long-term strategic improvements.

Automation & Tooling: Collaborate with the SRE team to develop and implement automation and tooling (e.g., CloudFormation, Ansible, Terraform) to improve cloud management processes.

Cross-Functional Influence: Act as the primary SRE liaison, building strong partnerships with Development, Platform, and Systems Engineering leaders to influence architecture and foster a culture of shared responsibility and operational excellence.

About You 

Essential experience & skills

  • Mastery of highly available, fault-tolerant AWS system design and management.
  • Strong foundation in AWS networking (VPC, Route 53) and security best practices.
  • Proficiency in key scripting languages (Python, Bash, PowerShell) for automation.
  • Proven ability to perform effectively under pressure, managing high-volume tasks and meeting tight deadlines
  • Minimum of 3 years of prior SRE or DevOps experience.
  • Expert knowledge of fundamental infrastructure concepts (Networking, Containerisation, Virtualisation, DNS)
  • Working familiarity with key CI/CD and Infrastructure-as-Code tools (e.g., Terraform, Ansible, Jenkins)
  • Excellent verbal and written communication skills

Desirable Experience & Skills

  • Hands-on experience with the ELK Stack or advanced monitoring tools (Datadog/Grafana)
  • Relevant AWS certifications (e.g., AWS Certified SysOps Administrator or DevOps Engineer – Professional).
  • Demonstrated ability to optimise AWS costs while maintaining performance and reliability

Here’s a taste of what TEG offers: 

  • Complimentary event tickets
  • Birthday and volunteering leave
  • Wellbeing discounts & flu vaccinations
  • Paid parental leave & free employee support (EAP)
  • Global rewards and recognition
  • Learning, development & career pathways
  • A diverse, inclusive, and passionate team

Equal Opportunities

TEG is an equal opportunity employer committed to embrace diversity, respect, and care for our people and communities. 

If there are any adjustments that need to be made to ensure you have a fair and equitable experience in our recruitment process, please advise us when scheduling your interview. 

*Only direct applications will be considered. No recruiters please* 

Original job Lead Site Reliability Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Share Job
Share Job

Auto-Apply to Site Reliability Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Site Reliability Engineer Jobs in Australia

GrabJobs is the no1 job portal in Australia, connecting you to thousands of jobs fast! Find the best jobs in Australia, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.