Logo-of-Walt-Disney-Co.-hiring-for-jobs-in-India-on-GrabJobs

Manager Systems Reliability Engineering

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Manager Systems Reliability Engineering

Job Posting Title:

Manager Systems Reliability Engineering

Req ID:

10153208

Job Description:

Job Summary

As the Manager of Site Reliability Engineering on the Infrastructure Reliability team, you will be responsible for building and leading a high-performing team dedicated to ensuring our infrastructure is reliable, scalable, and efficient. Your primary focus will be on people management, strategic planning, and technical leadership. You will mentor and guide your team members, fostering their professional growth and creating a culture of ownership and operational excellence. You will define the team's vision and roadmap, aligning it with the company's broader goals, and work with cross-functional partners to prioritize and execute projects. You will oversee the development of SRE solutions across our globally distributed environments and empowering your team to improve service resiliency, automate processes, and conduct effective incident response and capacity planning to guarantee the highest level of uptime and Quality of Service (QoS) for our internal customers.

Responsibilities and Duties of the Role:

  • Lead, mentor, and grow a team of software and infrastructure automation engineers.
  • Develop and execute the roadmap for the Infrastructure Reliability Engineering team.
  • Collaborate with engineering and operations teams to identify and prioritize reliability improvements.
  • Drive the design and implementation of tools and automation for infrastructure testing and self-healing.
  • Establish and monitor key performance indicators (KPIs) for infrastructure reliability.

Required Education, Experience/Skills/Training:

Basic Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.
  • 12+ years of experience in a software engineering or infrastructure role.
  • 5+ years of experience in a leadership or management role.
  • Lead a team of Infrastructure Reliability Engineers on projects for users and be directly responsible for uptime.
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence. Automate response to all non-exceptional service conditions.
  • Design, write and deliver software to improve the availability and efficiency of Disney's streaming infrastructure.
  • Set the standard for excellence by mentoring team members and establishing trust through superior technical delivery.
  • Proficiency in Kubernetes administration and modern CI/CD techniques and Infrastructure as Code (IaC).
  • Deep understanding of Linux operating systems and TCP/IP fundamentals.
  • Experience with monitoring, metrics gathering, APM, container management, and log collection tools.
  • Creative problem solver with excellent debugging skills and great documentation abilities.
  • Strong understanding of networking, storage, security, and compute technologies.

Preferred Qualifications

  • Experience building and leading a Site Reliability Engineering (SRE) or Infrastructure Reliability team.
  • Expertise with complex system architectures and infrastructures.
  • Proficiency in one or more programming languages (e.g., Python, Go, Java).
  • Passion for automation, scalability, and building reliable systems from the ground up.

Job Posting Segment:

Disney Entertainment and ESPN Product & Technology

Job Posting Primary Business:

Media Engineering

Primary Job Posting Category:

Site/System Reliability Engineer

Employment Type:

Full time

Primary City, State, Region, Postal Code:

Bangalore, India

Alternate City, State, Region, Postal Code:

Date Posted:

2026-06-11
Original job Manager Systems Reliability Engineering posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Manager Systems Reliability Engineering Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Manager Systems Reliability Engineering Jobs in India

GrabJobs is the no1 job portal in India, connecting you to thousands of jobs fast! Find the best jobs in India, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.