The New Face Of Disaster Recovery In A Cloud-Enabled Environment
The advancement of Cloud technologies and cloud-native design patterns has significantly altered the Information Technology (IT) landscape, however one would argue that no area of IT has been more influenced by Cloud than Disaster Recovery (DR) and the emergence of DRaaS (disaster recovery as a service).
Traditionally speaking, legacy DR has entailed significant capital investment in often-times underutilized technology assets and facilities. This activity typically involving technology engineers planning and building production equivalent (or resembling) environments in a geographically diverse location with various forms of data synchronization and circuitry linking the two (or in some instances multiple sites beyond two (2)).
“Traditionally speaking, legacy DR has entailed significant capital investment in oftentimes underutilized technology assets and facilities”
So, in what ways has Cloud influenced DR? Provided below we will dive into this in more detail by reviewing it through the prism of the three (3) primary public cloud providers, those being Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP).
For those companies willing to make the plunge into public cloud adoption (specifically AWS, Azure, and/or GCP (Google Cloud)) there are a number of CSP (cloud service provider) specific services available to help improve an organization’s recovery posture while doing so in a more cost-effectivemanner.
• AWS –Back in January of 2019 AWS announced the acquisition of the Israeli start-up CloudEndure. Prior to its acquisition, CloudEndure was an AWS Advanced Technology Partnerwith both disaster recovery and backup-as-a-service capabilities for AWS, Azure and GCP clouds alike. AWS has since integrated CloudEndure as part of its core platform services thereby enabling customers who are either migrating to AWS, leveraging AWS in a hybrid cloud model, and/or supporting a multi-region cloud estate to have access to a managed service for backup and recoveryneeds. CloudEndure enables this using secureprotocols for data transport and ongoing synchronization.
The CloudEndure service also offers means by which to keep traditional cold site (or in this case region) costs down by not requiring parity between the two from a compute perspective. In essence you pay for the data storage in your secondary site and should the time come to leverage the site/region, automated runbooks created within the CloudEndure platform can be used to spin-up the required EC2 compute to support the fail-over effort(s).
• Azure –Microsoft customers interested in and/or currently leveraging Azure have use of the ASR (Azure Site Recovery) managed service which was created in-house at Microsoft but really began to take form after the June 2014 acquisition and later integration of InMage. InMage was a US/ India based company specializing in the continuous data protection space through their Scout service. Similar to CloudEndure, ASR checks all the requisite boxes for a complete backup and recovery-as-a-service solution chief among those being:
o Heterogeneous Workload Support for various different Windows and Linux editions
o Automated Discovery of VMware vCenter Server managed virtual machines for replication and recovery
o Continuous Data Protection with software-based replication to support stringent Recovery Point Objectives (RPO)
o On the Fly Conversion of source Virtual Machine Disk (VMDK) files to bootable target Virtual Hard Disk (VHD) files
o Active-Passive Replication that does not require running target virtual machines at the time of replication
o Single-Click Failovers with Recovery Plans to provide workload-aware disaster recovery and orchestration
o Health Monitoring for replication, failover and failback with events and notifications
NOTE:Azure Migrate is a similarly-aligned service to ASR, however it is focused exclusively as a cloud migration tool and is not intended for ongoing recovery and backup.
• GCP –Google unlike AWS and Azure, does not provide a formal DRaaS offering native to the platform. However,Google does providecomprehensive documentation on how to approach DR in GCPby explaining the key building blocks needed (e.g. use of GCP’s Deployment Manager, use of Terraform/ Ansible/Chef for automation,as well as recommendations related to 3rd party partners).
Multi-Cloud Approach (Considerations for leveraging 3rd party migration/recovery toolsets)
For more complex hybrid (public/ private) and/or multi cloud (use of multiple CSPs) use cases, there are a number of 3rd party solutions in the DR-as-a-S space.
Each of the above mentioned 3rd party solutions behave similarly to the CSP specific solutions we touched upon earlier albeit with the option to leverage not only to a specific cloud (e.g. Azure) across cloud providers (e.g. AWS, GCP). This allows for a level of flexibility for those who may need to move certain on-premise workloads to a given CSP based on supportability and/or function.
Hybrid-Cloud Approach (For workloads that support cloud but entail on-premise dependencies)
For organizations that host workloads that support public cloud but however be it for compliance and/or physical infrastructure requirement dependencies cannot persist entirely in cloud. There are a number of solutions to help solve for this situation based upon your hardware support needs.
In effect what these solutions provide is a means to administer their systems on-premise and in the cloud from a single management plane. As well as provide the ability to shift workloads hosted within them as needed between on-premise and cloud without fear of having to re-platform/re-engineer in order to do so.
In conclusion the overall maturity for back-up and recovery in the cloud has grown exponentially and has to a large extent become table stakes where it is now becoming difficult to judge one solution from the other unless there is an underlying infrastructure platform dependency that requires it.Cloud in general provides a great option for infrastructure and operations leaders who want to cost-effectively improve IT resilience while being mindful of potential resource deficiencies.
By Leni Kaufman, VP & CIO, Newport News Shipbuilding
By George Evans, CIO, Singing River Health System
By John Kamin, EVP and CIO, Old National Bancorp
By Elliot Garbus, VP-IoT Solutions Group & GM-Automotive...
By Gregory Morrison, SVP & CIO, Cox Enterprises
By Alberto Ruocco, CIO, American Electric Power
By Sam Lamonica, CIO & VP Information Systems, Rosendin...
By Sergey Cherkasov, CIO, PhosAgro
By Pascal Becotte, MD-Global Supply Chain Practice for the...
By Stephen Caulfield, Executive Director, Global Field...
By Shamim Mohammad, SVP & CIO, CarMax
By Ronald Seymore, Managing Director, Enterprise Performance...
By Brad Bodell, SVP and CIO, CNO Financial Group, Inc.
By Jim Whitehurst, CEO, Red Hat
By Clark Golestani, EVP and CIO, Merck
By Scott Craig, Vice President of Product Marketing, Lexmark...
By Dave Kipe, SVP, Global Operations, Scholastic Inc.
By Meerah Rajavel, CIO, Forcepoint
By Amit Bahree, Executive, Global Technology and Innovation,...
By Greg Tacchetti, CIO, State Auto Insurance