We are seeking an experienced Production Support Engineer to manage and support critical production environments, ensuring high availability, performance, and reliability of applications. This role combines strong technical expertise with hands-on experience in L1 and L2 production support, along with working knowledge of cloud platforms and backend technologies.
You will be responsible for monitoring live systems, troubleshooting incidents, and ensuring minimal downtime. The role also involves collaborating with development, DevOps, and infrastructure teams to identify root causes, implement fixes, and continuously improve system stability. A strong understanding of AWS environments and programming exposure in Java, Python, or Node.js will be valuable in diagnosing and resolving complex issues efficiently.
Key Responsibilities
Provide L1 and L2 production support for applications, ensuring timely resolution of incidents and service requests
Monitor production systems, applications, and infrastructure to proactively identify and address issues
Perform root cause analysis (RCA) for production incidents and implement preventive measures
Troubleshoot application, API, and infrastructure-related issues across environments
Work closely with development and DevOps teams to deploy fixes and enhancements
Support and maintain applications hosted on AWS, ensuring optimal performance and scalability
Manage incident tickets, track SLAs, and ensure adherence to support processes
Automate repetitive support tasks to improve efficiency and reduce manual intervention
Participate in release activities, deployment support, and post-release validations
Maintain system documentation, runbooks, and knowledge base articles
Collaborate with cross-functional teams to improve monitoring, alerting, and logging mechanisms
Ensure compliance with operational standards and best practices
What Makes You a Great Fit
5+ years of experience in production support with strong exposure to L1 and L2 support models
Solid understanding of incident management, problem management, and SLA-driven environments
Hands-on experience with AWS services and cloud-based application support
Working knowledge of programming languages such as Java, Python, or Node.js for debugging and issue resolution
Experience in monitoring tools, logging frameworks, and alerting systems
Strong troubleshooting skills across application, database, and infrastructure layers
Familiarity with CI/CD pipelines and deployment processes
Ability to perform root cause analysis and implement long-term fixes
Excellent communication skills and ability to work in a fast-paced, high-pressure environment
Proactive mindset with a focus on continuous improvement and automation
All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.
Be the first to receive the latest Others Full-Time Jobs in India.
Setup your job alert:
By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime.
Skip