Number of Applicants
:000+
Let AI Supercharge Your Job Hunt!
JobCopilot scans 500,000+ company career sites daily to find jobs for you
We are looking for a BCDR Resilience Specialist to support the assessment, design, and assurance of business continuity and disaster recovery capabilities for solution platforms, including AI-enabled services.
The role ensures that solutions demonstrate resilience under failure scenarios and meet governance, audit, and regulatory expectations prior to production deployment.
Assess and provide assurance that availability and resilience requirements are met across solution platforms.
Validate RTO/RPO definitions and ensure alignment with business criticality and service tiering.
Support solution teams in defining and validating appropriate recovery strategies.
Identify and address single points of failure (SPOFs) across:
application and platform components
AI agents and orchestration layers
data dependencies
third-party services
Assess architectural resilience and recommend mitigation strategies to improve fault tolerance and recovery.
Review and ensure BC/DR plans and operational runbooks are fit for purpose.
Validate recovery procedures, including AI-specific failure modes, model dependencies, and recovery actions.
Support and participate in resilience testing, tabletop exercises, and recovery simulations.
Provide credible resilience evidence to support:
governance approvals
audit readiness
regulatory requirements
Ensure resilience controls and recovery capabilities are documented, traceable, and defensible prior to production release.
Proven experience in BCDR, resilience engineering, or service continuity roles
Strong understanding of:
business impact analysis (BIA)
RTO / RPO definitions
high availability and disaster recovery strategies
Experience validating BC/DR readiness for production environments
Ability to assess end-to-end solution architectures, including:
application and platform layers
data flows and dependencies
third-party integrations
Understanding of modern, distributed, and cloud-native architectures
Awareness of AI-specific failure modes, such as:
model availability and dependency failures
orchestration and pipeline failures
data drift or data dependency issues
Ability to assess recovery strategies for AI-enabled components
Experience producing decision-ready documentation for governance, audit, and regulatory stakeholders
Strong communication skills, with the ability to articulate resilience risks and recovery capabilities clearly
Auto-Apply to Resilience Specialist Jobs with your AI JobCopilot
Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.