Number of Applicants
:000+
Let AI Supercharge Your Job Hunt!
JobCopilot scans 500,000+ company career sites daily to find jobs for you
We are seeking a highly skilled Network Reliability Engineer (NRE) with strong hands-on experience across hybrid and cloud network environments, including on‑premises data centers and AWS/Azure cloud platforms. The role focuses on reliability, availability, scalability, automation, and observability of network and security platforms using NRE/SRE principles, CI/CD practices, and deep API-driven integrations. Also has performed Network domain automation using Ansible Automation Platform
• Own end‑to‑end reliability and performance of hybrid and cloud-connected network services.
• Apply Network Reliability Engineering(NRE) principles to reduce operational toil and improve service resilience.
• Design, implement, and continuously improve highly available hybrid and cloud network architectures.
• Perform deep technical troubleshooting and root cause analysis for complex network and security incidents.
• Build automation and self‑healing workflows supporting Day‑1, Day‑2, and Day‑N operations.
• Integrate infrastructure changes into CI/CD pipelines with proper testing, validation, and rollback.
• Define, monitor, and improve SLIs, SLOs, and error budgets for network services.
• Partner with cloud, security, application, and platform teams to meet business KPIs and KRAs.
Ansible :
· Develop, maintain, and optimize Ansible playbooks, roles, and inventories to automate network device configuration, deployment, and operational activities across multi-vendor environments.
· Automate routine network tasks such as configuration management, backups, compliance checks, firmware upgrades, and provisioning to reduce manual effort and improve reliability.
· Translate network SOPs, runbooks, and BAU activities into reusable Ansible-based automation workflows(Infrastructure-as-Code) for scalable and consistent execution.
· Implement event-driven and self-healing automation by integrating Ansible playbooks with monitoring systems, APIs, and ITSM tools to enable automated remediation and faster incident resolution.
· Ensure network consistency and compliance by using Ansible to validate device state, detect configuration drift, and enforce standardized configurations across network infrastructure.
• Hands‑on experience supporting Cisco ACI fabrics integrated with hybrid connectivity models.
• Strong working knowledge of Cisco IOS‑XE and Cisco NX‑OS in enterprise and data center environments.
• Experience managing Cisco Wireless LAN Controllers (WLC).
• Practical experience with AWS networking including VPC design, routing, security groups, NACLs, Transit Gateway, VPN, and Direct Connect.
• Practical experience with Azure networking including VNets, UDRs, NSGs, VPN Gateway, ExpressRoute, and Azure Firewall.
• Understanding of hybrid connectivity patterns such as internet, MPLS, site‑to‑site VPN, client VPN, and secure cloud access.
• Hands‑on experience with enterprise firewalls: Check Point, Palo Alto, and Cisco Firepower (FTD).
• Experience using Algo Sec for firewall policy lifecycle management and automation across hybrid and cloud environments.
• Strong expertise with proxy and secure web gateway solutions including Bluecoat ProxySG (SGOS), BMC, and CAS.
• Cloud‑delivered security experience with Zscaler ZIA, ZPA, and ZTNA supporting hybrid and remote access models.
• Hands‑on experience with F5 VELOSchassis.
• Strong expertise in F5 LTM, GTM, and APM modules.
• Experience designing resilient application delivery for on‑prem and cloud workloads.
• Hands‑on experience with Datadog for monitoring hybrid and cloud network services.
• Experience with Cisco DNA Center (DNAC)for assurance and network automation.
• Experience with Forescout for network visibility, device profiling, and compliance.
• Strong experience building dashboards and alerts using Prometheus and Grafana.
• Strong experience with Ansible for network and security automation.
• Hands‑on experience integrating GitHub for version control, peer reviews, and change traceability.
• Experience building CI/CD pipelines for infrastructure and network changes.
• Experience using REST APIs, SDKs, CLI tools, and GUIs to integrate network, security, and cloud platforms.
• Experience developing tools, services, or dashboards using Python and Django.
• Ability to implement Infrastructure as Code (IaC) and automated validation workflows.
• Strong reliability engineering mindset with focus on automation and scalability.
• Proven experience operating production hybrid and cloud network environments.
• Excellent troubleshooting and analytical abilities.
• Ability to translate business KRAs into measurable technical reliability objectives.
• Strong documentation, collaboration, and communication skills.
• Bachelor’s degree in Computer Science, Information Technology, or equivalent experience.
• 7–12 years of experience across Network Engineering, Network Security, Cloud Networking, or NRE/SRE roles.
• Relevant certifications (CCNP/CCIE,AWS/Azure Networking, PCNSE, CCSA/CCSE, F5) are a plus.
This role includes on‑call rotations, participation in major incident response, CI/CD-driven change management, and collaboration with global teams supporting enterprise hybrid and cloud network platforms.
Auto-Apply to Similar Jobs with your AI JobCopilot
Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.