Lead Site Reliability Engineer

Job Description - Lead Site Reliability Engineer

About Ticketek Entertainment Group

Ticketek Entertainment Group is a global fan experience Company that tickets, promotes and delivers incredible live experiences that are impossible to forget. In a distracted world where nothing beats real human moments, We make life better live!

Our Group includes; our Fan Experience Platform (Ticketek) that sells tickets and provides value added services, Event promoting with businesses across Touring (TEG Touring), Sport (TEG Sport), and Family Experiences (TEG Experiences) and our digital business (Ovation) which focuses on delivering seamless data-driven outcomes for our fans and partners

About the Role:

We are hiring a Lead Site Reliability Engineers to join TEG's Technology department. Responsible in leading a team of SRE's, you will play a pivotal role in ensuring the exceptional health, stability, performance and cost-effective scalability of our global live entertainment platforms. In this role, you will apply software engineering principles to operations, proactively enhancing system reliability and preventing outages to deliver seamless experiences for our customers across our global ticketing platform.

(Please note: Candidates can apply from our office locations in Adelaide, Sydney, Melbourne, or Brisbane)

What does a day in the life look like?

Team Leadership & Mentorship: Lead, mentor, and develop a high-performing SRE team. Manage workload, set clear objectives, conduct performance reviews, and foster a collaborative, excellent engineering culture.

Strategic Ownership & Optimisation: Define and execute the SRE roadmap. Own the end-to-end availability, performance, scalability, and cost-efficiency of the production environment and all critical systems.

Observability & Cost Management: Drive the continuous enhancement and optimisation of the observability stack (monitoring, logging, tracing, alerting). Lead cost optimisation and reduce MTTD & MTTR through platform observability improvements.

Post-Incident Ownership & Improvement: Promote a continuous improvement culture. Lead post-incident reviews to ensure underlying causes are identified, corrective actions are prioritised, and lessons learned are applied to measurably reduce MTTD and MTTR via long-term strategic improvements.

Automation & Tooling: Collaborate with the SRE team to develop and implement automation and tooling (e.g., CloudFormation, Ansible, Terraform) to improve cloud management processes.

Cross-Functional Influence: Act as the primary SRE liaison, building strong partnerships with Development, Platform, and Systems Engineering leaders to influence architecture and foster a culture of shared responsibility and operational excellence.

About You

Essential experience & skills

Mastery of highly available, fault-tolerant AWS system design and management.
Strong foundation in AWS networking (VPC, Route 53) and security best practices.
Proficiency in key scripting languages (Python, Bash, PowerShell) for automation.
Proven ability to perform effectively under pressure, managing high-volume tasks and meeting tight deadlines
Minimum of 3 years of prior SRE or DevOps experience.
Expert knowledge of fundamental infrastructure concepts (Networking, Containerisation, Virtualisation, DNS)
Working familiarity with key CI/CD and Infrastructure-as-Code tools (e.g., Terraform, Ansible, Jenkins)
Excellent verbal and written communication skills

Desirable Experience & Skills

Hands-on experience with the ELK Stack or advanced monitoring tools (Datadog/Grafana)
Relevant AWS certifications (e.g., AWS Certified SysOps Administrator or DevOps Engineer – Professional).
Demonstrated ability to optimise AWS costs while maintaining performance and reliability

Here’s a taste of what TEG offers:

Complimentary event tickets
Birthday and volunteering leave
Wellbeing discounts & flu vaccinations
Paid parental leave & free employee support (EAP)
Global rewards and recognition
Learning, development & career pathways
A diverse, inclusive, and passionate team

Equal Opportunities

TEG is an equal opportunity employer committed to embrace diversity, respect, and care for our people and communities.

If there are any adjustments that need to be made to ensure you have a fair and equitable experience in our recruitment process, please advise us when scheduling your interview.

*Only direct applications will be considered. No recruiters please*

Original job Lead Site Reliability Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

Share Job

Get your Resume Reviewed for Free

Similar Site Reliability Engineer Jobs in Australia

Get your Resume Reviewed for Free

Email address

Why are you reporting this job?

I think it’s a discriminatory or offensive

I think it’s fraudulent or a scam

I think it’s trying to sell something unrelated to the job / it’s asking for money

I think it contains incorrect or broken information

Other

All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.

Setup your job alert:

Frequency

By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime. Skip

Lead Site Reliability Engineer

Job Description - Lead Site Reliability Engineer

Similar Site Reliability Engineer Jobs in Australia

Mobile Apps