T

Software: Operations & Reliability Lead

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Software: Operations & Reliability Lead


Role Overview
We’re looking for an experienced Operations & Reliability Lead to strengthen our monitoring, security, automation, and cloud operations. This role drives reliability, resilience, and a security‑first posture across all systems and environments.
What You’ll Do
  • Build and maintain application and infrastructure monitoring, dashboards, and automated alerts.
  • Implement cloud and On Premise resource provisioning and enforce standardized configuration baselines.
  • Manage backup, recovery, and resilience workflows with regular testing cycles.
  • Conduct AI‑assisted performance testing, security audits, and penetration testing.
  • Coordinate with NOC and SOC to support continuous monitoring and threat detection.
  • Lead incident response, root‑cause analysis, and operational readiness activities.
  • Implement cost optimization and resource governance across cloud environments.
  • Automate operational tasks and integrate AI‑Ops capabilities.
What You Bring
  • Strong experience with monitoring tools (New Relic, Datadog, Prometheus, Azure Monitor, etc.).
  • Hands‑on expertise with cloud platforms, IaC, CI/CD, and configuration management.
  • Solid understanding of security frameworks, threat detection, and compliance.
  • Experience with backup/DR strategies and resilience best practices.
  • Strong troubleshooting, documentation, and cross‑team collaboration skills.
Valuable Extras
  • Cloud or security certifications (Azure/AWS Architect, Security+, CISSP, ITIL, SRE).
  • Experience with AI‑Ops platforms or ML‑based operational tooling.
  • Background in regulated industries.
Education & Experience
  • Bachelor's degree in Computer Science or related field.
  • At least 2 years of experience working with systems.
Original job Software: Operations & Reliability Lead posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Operations & Reliability Lead Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Operations & Reliability Lead Jobs in the US

GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast! Find the best jobs in the US, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.