Job Description - Site Reliability Engineer (Cloud Network Automation)
Location: Singapore
Industry: Cloud Networking, Enterprise Connectivity, IoT and Smart Technology Solutions
About the Role
We are seeking a Senior Site Reliability Engineer to support and optimize our client's cloud-managed networking and connectivity platform serving SMB and enterprise customers across the region. The successful candidate will be responsible for ensuring the reliability, scalability, security, and operational excellence of cloud-based networking services, including wireless networking, switching, routing, SD-WAN, surveillance management platforms, and cloud-native applications.
Key Responsibilities
Cloud Platform Operations
Manage and maintain cloud-based networking platforms deployed across AWS, Azure, GCP and private cloud environments.
Ensure high availability, performance, and reliability of cloud-managed networking services.
Monitor and optimize system health, resource utilization, latency, and customer service performance.
Site Reliability Engineering
Define and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs).
Develop proactive monitoring, alerting, and observability solutions.
Conduct root cause analysis and drive permanent corrective actions following incidents.
Participate in an on-call rotation schedule and support critical incident management when required.
Cloud Infrastructure & Automation
Design and implement automation solutions using Python, Bash, or Go.
Improve deployment pipelines and operational processes through Infrastructure-as-Code and CI/CD methodologies.
Support Kubernetes-based containerized applications and cloud-native services.
Customer Platform Support
Work closely with enterprise and SMB customers to troubleshoot cloud platform issues.
Support large-scale deployments involving: Cloud-managed WIFI, Enterprise Switching, SD-WAN, Cloud Security, Video Surveillance Platforms, IoT Device Management
Security & Compliance
Ensure platform compliance with security standards and best practices.
Support IAM, network security, vulnerability management, and cloud security initiatives.
Participate in disaster recovery planning and business continuity testing.
Requirements
Bachelor's degree in computer science, Information Technology, Engineering, or a related field from a recognized university or institution.
Proven experience in Site Reliability Engineering, Cloud Operations, Platform Engineering, or DevOps, preferably in enterprise cloud or networking environments.
Hands-on experience with: Kubernetes, Docker, Linux Administration, AWS, Azure, or GCP, CI/CD pipelines, Infrastructure Automation, Networking Knowledge
Experience supporting enterprise networking, managed services, cloud platforms, telecommunications, or related technology environments would be advantageous.
How to Apply:
If you're enthusiastic about this role, send us your resume and a note sharing your relevant experience to [email protected].
All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.
Be the first to receive the latest Civil Engineer Full-Time Jobs in Singapore.
Setup your job alert:
By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime.
Skip
GrabJobs is the no1 job portal in Singapore, connecting you to thousands of jobs fast!
Find the best jobs in Singapore, apply in 1 click and get a job today!