Logo-of-Staples-India-Business-Innovation-Hub-hiring-for-jobs-in-India-on-GrabJobs

Site Reliability Manager

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
icon loader

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Site Reliability Manager

Role Summary
As a Manager of Information Technology at Staples, you will collaborate with a business -critical team of engineers responsible for the B2B and B2C sites performance and availability of one of the top eCommerce companies in the United States. You will be a key contributor to the success of our Public Cloud Adoption initiative. This program will drive critical technology and tangible business value utilizing the latest cloud technologies. We are looking for a highly motivated and experienced Site Reliability and Performance Engineering leader who wants to grow their career and work with cutting -edge tools and technologies. The candidate must have a proven track record of supporting B2B, B2C sites and their integrations, both on -premises and in the public cloud, with demonstrated expertise in related technologies. 

Duties & Responsibilities
Oversee the day -to -day operations of the Site Reliability and Performance Engineering team.
Set clear team goals, supervise, and manage the team.
Provide technical leadership and mentoring to team members.
Engage and collaborate with cross -functional Product, Engineering, Security, Operations, Infrastructure teams and Vendors to improve MTTD and MTTR
Design, develop, and implement infrastructure & application monitoring to ensure optimal platform availability and performance
Design and execute performance testing strategies including load, stress, and capacity planning using tools such as JMeter, Locust, and LoadRunner. 
Automate performance testing within CI/CD pipelines to ensure continuous validation. 
Research, analyze and recommend approaches for solving challenging operational issues
Develop and maintain robust knowledge documentation for the Site Reliability Engineering team and its partners
Proactively perform analysis and identify opportunities to innovate, automate, improve efficiency, and achieve cost savings
Foster innovation by encouraging new ideas and technologies within the team.
Ensure compliance with company standards and industry best practices.
Periodically review and assess the team's performance, providing feedback and facilitating professional growth.


Requirements

Basic Qualifications
Bachelor’s degree in Computer Science or related field with continuous and progressive experience
Minimum of 8 years of related experience working with these technologies:
Application Performance Management and Monitoring tools such as New Relic, AppDynamics, SiteSpect, and Datadog
Content Delivery: Akamai
Infrastructure monitoring tools like Zabbix, and Prometheus
Databases eg: MongoDB, Oracle, Couchbase, Redis, MySQL
Frameworks such as Dust/Angular, Nodejs, Springboot
Log Analytics tools like Splunk, and ELK/Elastic
Digital experience tools like Fullstory
Performance Testing tools such as JMeter, Loadrunner, etc.
Performance tuning experience with Tomcat, Node.js and Spring Boot.
Strong understanding of non -functional requirements, performance testing processes, and defect tracking.
8+ years of experience with Cloud Technologies, at least half of which should be on the Microsoft Azure platform
Strong hands -on experience with infrastructure and services (systems, network, cloud technology, provisioning, storage, etc)
Must have strong experience with programming in one or more scripting languages (Python, Azure CLI, or Powershell)
Hands -on experience with tool sets related to automation, orchestration, and managing infrastructure (Terraform, Puppet, Ansible, or Jenkins)
Experience with configuring, deploying, and administering infrastructure and application monitoring tools that assist in troubleshooting performance and stability issues in a cloud environment.

Preferred Qualifications
Master’s degree in Computer Science Software Engineering or a related field. 
Certifications in project management or specific software development methodologies. 
Experience in working with cross -functional teams and stakeholders at high organizational levels. 

Original job Site Reliability Manager posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Share Job
Share Job

Auto-Apply to Site Reliability Manager Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Site Reliability Manager Jobs in India

GrabJobs is the no1 job portal in India, connecting you to thousands of jobs fast! Find the best jobs in India, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.