BMO

Senior Disaster Recovery Manager

Toronto, ON, CAN Full time

Application Deadline:

12/25/2025

Address:

100 King Street West

Job Family Group:

Business Management

At Bank of Montreal, we don’t just prepare for the unexpected — we lead through it.

As our Senior Disaster Recovery Manager, you’ll be at the forefront of safeguarding mission-critical infrastructure across global enterprise environments. Your expertise within disaster recovery specifically for IT infrastructure, and cloud will ensure that our systems not only withstand disruption but recover with precision and speed. The ideal candidate will have strong expertise in planning and executing disaster recovery exercises. This role ensures the organization's critical IT systems can recover rapidly and effectively in the event of an outage or disaster.

Why You’ll Thrive Here:

  • Strategic Impact: Drive the development and execution of robust disaster recovery strategies for IT Infrastructure and Cloud which protects our customers and critical business operations.

  • Cutting-Edge Technology: Work hands-on with leading cloud platforms (AWS, Azure, GCP), virtualization technologies, and advanced backup and replication technologies

  • Collaboration: Partner with cross-functional teams across IT Infrastructure, Cloud, and Security

  • Leadership & Innovation: Lead high-stakes tabletop exercises and full-scale DR drills, shaping the future of resilience in a dynamic, hybrid-cloud environment.

  • Growth & Recognition: Leverage your experience and certifications (CBCP, CDRE, AWS/Azure) to influence enterprise-wide policies, while advancing your career in a culture that values expertise and innovation.

  • Purpose-Driven Work: Be the guardian of continuity, ensuring our organization is always prepared, always protected, and always ready to recover.

Join us to build a resilient digital future — where your leadership in disaster recovery within IT infrastructure, and cloud makes a lasting difference.

****This is a HYBRID role***

There are no direct reports for this role. This role is an Individual Contributor role. Its not a Managerial or people-leadership role.

KEY Skills and Experience :

  • 4-5+ years specifically in Cloud Recovery Strategies (AWS, Azure, GCP)

  • 3-4+ years specifically in Cloud Platforms: AWS, Azure, GCP

  • 3-4+ years in Virtualization: VMware, Hyper-V

  • 8–12 years of hands-on experience in Infrastructure resilience

  • 5+ years in Backup & Replication as well as restore technologies

  • 5-8 years in creating Disaster Recovery Playbooks /runbooks, reports, and executive summaries

  • 5-8 years in leading and executing Tabletop Testing & Exercises including failover testing, and full Disaster Recovery drills

  • 8–12 years of hands-on experience in general IT Disaster Recovery planning and testing

  • Working in large enterprise environments

Additional Skills:

Frameworks & Standards: ISO 22301, NIST SP 800-34

Education and Preferred Certifications:

  • Bachelor’s degree in computer science, Information Systems, or related field.

  • CBCP (Certified Business Continuity Professional)

  • CDRE (Certified Disaster Recovery Engineer)

  • Cloud certifications (AWS, Azure, Microsoft)

MAIN responsibilities:

  • Drives the policies, procedures, and processes to manage and oversee the planning, implementation, and execution of disaster recovery strategies.

  • Provides support to assigned technology business/groups regarding Disaster Recovery (DR) Management Frameworks. This includes enhancing processes and protocols and providing subject matter expertise and guidance to mitigate risk and enhance resilience. 

  • Partner with Infrastructure, Cloud, Application, and Security teams as consultant to give them guidelines and requirements.

  • Support hybrid and cloud-native recovery guidelines.

  • Identify technology resilience gaps thought end to end tech mapping and testing, etc

  • Evaluates DR and resilience capabilities of external partners, cloud vendors (e.g., AWS, Azure), and critical SaaS providers.

  • Ensuring that before Franchise Critical Assets move to production there is DR infrastructure in place to support the asset and that the asset is scheduled for DR testing.

  • Coordinate and lead Disaster Recovery exercises, failover testing, and tabletop exercises.

  • Work closely with application, Cloud engineering, infrastructure, and business teams to validate various recovery time objectives.

  • Monitor compliance with DR policies and industry best practices.

  • Collaborate with vendors and cloud service providers for DR solutions.

  • Strong reporting and documentation skills to help develop materials on the action plans and provide status updates to senior leadership on the risks and dependencies associated with the action plans.

  • Strong stakeholder management skills for international clients primarily based out of US and UK locations including prior experience in managing clients based out of the mentioned locations.

  • Partners with Technology teams to complete application playbooks and maintain completeness in advance of testing. Works with internal stakeholders to manage the business continuity plan review, training and testing requirements.

  • Collaborate with SMEs during DR drills to validate failover and integrity checks.

  • Ensure alignment of database high availability (HA) and DR strategies.

Additional Information:

Participates in the development, implementation, and maintenance of DR Projects and Recovery Capabilities for technology-managed applications and business-managed applications identified as critical.  Works with designated groups to ensure critical processes, plans and playbooks are in place in the event of a significant business interruption.

  • Provides guidance for mid-sized to large enterprise-wide DR initiatives and event management incidents.
  • Provides strategic input into business decisions as a trusted advisor.
  • Makes recommendations to senior leaders on strategy and new initiatives,  based on an in-depth understanding of the DR and resilience activities and structures.
  • Acts as a subject matter expert on relevant regulations and policies.
  • May network with industry contacts to gain competitive insights and best practices.
  • Develops an in-depth understanding of business strategies & challenges to support effective DR resilience.
  • Leads the development, maintenance and management of DR and Recovery Process.
  • Recommends business priorities, advises on resource requirements and develops roadmap for strategic execution.
  • Manages/executes against DR projects in support of DR roadmap e.g. DR program enhancements, remediation of CAD/regulatory findings, support resilience projects that impact DR components.
  • In partnership with technology stakeholders supports enterprise DR recovery activities in the event of a business interruption incident.
  • Participates in Crisis Management and technology Incident Response Team calls, represents DR on crisis response and status conference calls.
  • Builds and manages relationships among different teams to ensure that proper event management protocols are used across various groups.
  • Acts as the prime subject matter expert for internal/external stakeholders.
  • Ensures alignment between stakeholders.
  • Defines business requirements for analytics and reporting to ensure data insights inform business decision making.
  • Develops and applies the framework for databases; oversees database management in adherence with data governance standards.
  • Designs and produces regular and ad-hoc reports, and dashboards.
  • Identifies opportunities to simplify recovery management processes and to minimize business disruptions.
  • Maintains DR Key Risk Indicators/Key Performance indicators (KRI/KPIs) for reporting purposes.
  • Ensures quarterly reporting deliverables and participates in the presentation of results and trend reporting for management review.
  • Leads change management programs of varying scope and type, including readiness assessments, planning, stakeholder management, execution, evaluation and sustainment of initiatives.
  • Builds change management plans of varying scope and type; leads or participates in a variety of change management activities including readiness assessments, planning, stakeholder management, execution, evaluation and sustainment of initiatives.
  • Leads the execution of operational programs and tool requirements; assesses and adapts as needed to ensure quality of execution.
  • Acts as the central point of contact to coordinate the logistics for implementation of projects/initiatives within DR covering Events and Recovery.
  • Leads the implementation of the DR Management Frameworks for designated businesses/groups.
  • Governs any exceptions/deferrals to planned testing.
  • Provides input into the planning and implementation of operational programs.
  • Initiates and manages the event status monitoring and reporting.
  • Coordinates the implementation and facilitation of DR solutions, including developing & maintaining playbooks, ensuring that before Franchise Critical Assets move to production there is DR infrastructure in place to support the asset and that the asset is scheduled for DR testing.
  • Partners with Technology teams to complete application playbooks and maintain completeness in advance of testing. Works with internal stakeholders to manage the business continuity plan review, training and testing requirements.
  • Maintains DR testing schedule to working with partners to confirm or amend test dates.
  • Manages annual DR tests across all Franchise Critical Assets which includes BMO hosted, Third-Party and Cloud Assets, identified as critical.
  • Validates outcomes of testing to confirm the continuity and data integrity of technology assets identified as critical.
  • Facilitates exception process should DR testing not be completed for critical assets.
  • Documents DR testing outcomes and post-mortem reviews.
  • Ensures communication between business units and the business/group event management team in support of crisis management activities; participates in the Major Incident Management Team (MIRT) calls/activities
  • Participates in DR related projects, audits and examinations as appropriate.
  • Keeps abreast and ensures compliance to Disaster Recovery industry standards, best practices, regulatory trends, and regulatory guidelines (e.g. FFIEC).
  • Operates at a group/enterprise-wide level and serves as a specialist resource to senior leaders and stakeholders.
  • Applies expertise and thinks creatively to address unique or ambiguous situations and to find solutions to problems that can be complex and non-routine.
  • Implements changes in response to shifting trends.
  • Broader work or accountabilities may be assigned as needed.

Qualifications:

  • Typically 7+ years of relevant experience and post-secondary degree in related field of study or an equivalent combination of education and experience.
  • Industry certification in Business Continuity, Disaster Recovery, Resilience or Crisis Management is considered an asset.
  • Seasoned professional with a combination of education, experience and industry knowledge.
  • Verbal & written communication skills - In-depth / Expert.
  • Analytical and problem solving skills - In-depth / Expert.
  • Influence skills - In-depth / Expert.
  • Collaboration & team skills; with a focus on cross-group collaboration - In-depth / Expert.
  • Able to manage ambiguity.

Data driven decision making - In-depth / Expert.

Salary:

$86,000.00 - $160,000.00

Pay Type:

Salaried

The above represents BMO Financial Group’s pay range and type.

Salaries will vary based on factors such as location, skills, experience, education, and qualifications for the role, and may include a commission structure. Salaries for part-time roles will be pro-rated based on number of hours regularly worked. For commission roles, the salary listed above represents BMO Financial Group’s expected target for the first year in this position.

BMO Financial Group’s total compensation package will vary based on the pay type of the position and may include performance-based incentives, discretionary bonuses, as well as other perks and rewards. BMO also offers health insurance, tuition reimbursement, accident and life insurance, and retirement savings plans. To view more details of our benefits, please visit: https://jobs.bmo.com/global/en/Total-Rewards

About Us

At BMO we are driven by a shared Purpose: Boldly Grow the Good in business and life. It calls on us to create lasting, positive change for our customers, our communities and our people. By working together, innovating and pushing boundaries, we transform lives and businesses, and power economic growth around the world.

As a member of the BMO team you are valued, respected and heard, and you have more ways to grow and make an impact. We strive to help you make an impact from day one – for yourself and our customers. We’ll support you with the tools and resources you need to reach new milestones, as you help our customers reach theirs. From in-depth training and coaching, to manager support and network-building opportunities, we’ll help you gain valuable experience, and broaden your skillset.

To find out more visit us at https://jobs.bmo.com/ca/en.

BMO is committed to an inclusive, equitable and accessible workplace. By learning from each other’s differences, we gain strength through our people and our perspectives. Accommodations are available on request for candidates taking part in all aspects of the selection process. To request accommodation, please contact your recruiter.

Note to Recruiters: BMO does not accept unsolicited resumes from any source other than directly from a candidate. Any unsolicited resumes sent to BMO, directly or indirectly, will be considered BMO property. BMO will not pay a fee for any placement resulting from the receipt of an unsolicited resume. A recruiting agency must first have a valid, written and fully executed agency agreement contract for service to submit resumes.