Job Description - System Reliability Engineer / T2 Support Engineer
About the Role
We are looking for an engineer who enjoys understanding how systems behave in real production, not just writing features. This role is responsible for maintaining reliability, stability, and smooth functioning of our live platform running on Google Cloud.
You will act as the first technical owner of production systems — monitoring services, investigating alerts, resolving issues, and performing controlled configuration and operational changes. This role works closely with backend developers, QA, and infrastructure teams to prevent incidents and reduce downtime.
This is not a call-center support role and not a pure development role — it is a hands-on technical position focused on debugging, incident handling, and system operations.
Tech Stack
Google Cloud Platform (Compute, Logging, Monitoring)
Practical experience with MongoDB (indexes, connections, slow queries)
Understanding of Kafka concepts (consumer, offset, lag, partitions)
Basic Redis knowledge (caching behavior, TTL)
Cloud & Tools
Hands-on experience with any cloud platform (GCP preferred / AWS acceptable)
Experience using monitoring tools (GCP Monitoring, Prometheus, Grafana, ELK, or similar)
Understanding of REST APIs and HTTP status codes
What We Expect From You
Ability to investigate problems logically rather than randomly restarting services
Comfort working with live production systems
Willingness to participate in on-call support
Strong ownership mindset and attention to detail
Good communication during incidents
Good to Have
Experience in e-commerce, fintech, logistics, or high-traffic systems
Exposure to CI/CD pipelines and deployments
Basic scripting (Shell or Python)
Experience writing RCA documents
Experience
3 – 6 years of relevant experience in production support, application support, SRE, DevOps operations, or similar roles.
Why Join Us
Direct exposure to real distributed systems
Hands-on production debugging experience
Opportunity to learn system architecture deeply
Close interaction with development and platform teams
Important Note
This role involves handling live production systems and occasional on-call responsibilities. Candidates interested only in feature development or pure infrastructure automation may not find this role suitable.
All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.
Be the first to receive the latest Others Full-Time Jobs in India.
Setup your job alert:
By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime.
Skip