Number of Applicants
:000+
Let AI Supercharge Your Job Hunt!
JobCopilot scans 500,000+ company career sites daily to find jobs for you
Job Title
Manager, Site Reliability EngineeringJob Description
Job Summary:
We are seeking a highly skilled and technically hands-on Principal Engineer / Technical Lead to architect, implement, and scale our Cloud Platform, DevOps pipelines, and Infrastructure Quality Engineering initiatives. This role is ideal for a senior technologist with deep expertise in cloud infrastructure engineering, CI/CD automation, Kubernetes orchestration, and infrastructure-as-code. You will play a critical role in designing and building scalable, secure, and highly available cloud-native systems, while directly contributing to the implementation and operationalization of DevOps best practices.
Primary responsibilities:
Implement cloud infrastructure solutions (AWS/GCP) with a focus on reliability, scalability, and cost-efficiency.
Design and maintain robust CI/CD pipelines using Jenkins, Groovy, and Squadron, enabling streamlined deployments and faster release cycles.
Apply Infrastructure as Code (IaC) principles using Terraform, Ansible, and GitOps workflows.
Manage and optimize Kubernetes clusters, including upgrades, autoscaling, liveness/readiness probes, and namespace policies.
Drive the adoption of DevOps and SRE principles: observability, monitoring, alerting, and automated recovery.
Lead technical deep-dives and collaborate with development and QA teams to ensure platform readiness, secure infrastructure, and fast feedback loops.
Identify opportunities to automate, simplify, and strengthen deployment and infrastructure operations.
Contribute to architectural reviews, RFCs, and cloud-native modernization efforts across teams.
Act as the technical authority on DevOps and platform engineering decisions—enabling high-impact delivery
Required Skills:
Mandatory Skills:
12+ years of deep technical experience in cloud infrastructure, DevOps, or platform engineering roles.
Proven hands-on expertise in Jenkins pipeline development (Groovy) and CI/CD systems.
Extensive experience with Terraform, Ansible, and managing infrastructure as code at scale.
Strong Kubernetes experience, including managing clusters, Helm, and auto scaling strategies.
Solid understanding of AWS or GCP services, including compute, storage, networking, and IAM.
Experience implementing observability stacks (e.g., Prometheus, Grafana, ELK, Datadog, etc.).
Familiarity with container security, deployment strategies (blue/green, canary), and cost optimization.
Ability to influence cross-functional teams through technical leadership—not just through hierarchy.
Bonus: Exposure to Infra QA, automated testing pipelines, or AI/LLM-based automation.
2+ Years of experience in leading tech teams
Good to Have:
Prior QA knowledge.
What will you learn on this job?
Opportunity to lead and shape the future of infrastructure in a rapidly scaling environment. Cutting-edge technology stack and automation-first mindset.
Designation: Manager
Working Hours: 1:00 PM to 10:00 PM IST.
Work Location: Ecoworld, Bengaluru
It is the policy of People Inc. to provide equal employment opportunity (EEO) to all persons regardless of age, color, national origin, citizenship status, physical or mental disability, race, religion, creed, gender, sex, sexual orientation, gender identity and/or expression, genetic information, marital status, status with regard to public assistance, veteran status, or any other characteristic protected by federal, state or local law. In addition, the Company will provide reasonable accommodations for qualified individuals with disabilities.
#INDIA#Auto-Apply to Manager, Site Reliability Engineering Jobs with your AI JobCopilot
Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.