Zerto Platform Architecture Guide: Backup, Disaster Recovery, and Data Mobility
Zerto Platform Architecture Guide: Backup, Disaster Recovery, and Data Mobility
Architecture Guide
Backup, Disaster Recovery, and
Data Mobility
Version 3
Abstract
The purpose of this document is to provide an architecture guide for anyone designing or implementing an all-
encompassing IT resilience solution with Zerto utilizing an on-premises production environment.
It includes a full overview of all design principles, architecture considerations and sizing, as well as an installation
overview for anyone looking to, or already utilizing Zerto. In this guide, we also look at common challenges and uses
cases that are solved using the Zerto Platform.
The architectures in this guide show only a glimpse of what the Zerto Platform can support. Having a true software-
only scale-out architecture and a technology agnostic approach allows you to protect and mobilize your applications
in a mix-and-match nature.
Disaster Recovery
Organizations of all sizes that have a virtualized environment use replication for disaster recovery because the impact
of not being able to recover successfully and quickly from a disaster can be catastrophic and create systemic risk to
the business. Different enterprise-class disaster recovery technologies have been available since the mass adoption
of virtualization, but they were typically designed to protect physical servers using storage-based replication and not
virtual machines (VMs). This adds significant complexity because the replication is configured on a disk/LUN basis,
requiring matching storage and LUN configurations. There is no VM-level granularity and no integration into the
virtualization platform. In addition, separate complex software for VM orchestration and automation is required which
involves multiple skill sets, resources and does not fully align to the benefits of virtualization.
Zerto is built from the ground up to be the simplest, most powerful disaster recovery solution for virtualized
infrastructures. By including all the replication, recovery orchestration, and automation in one simple software
platform, users can recover one, all, or a subset of virtualized applications, from anywhere to anywhere, maximizing
the benefits of virtualization and cloud.
Through native integration into all supported platforms, Zerto not only allows replication and recovery between any
storage, but it also protects across and between multiple hypervisors and public cloud platforms. This market-leading
technology delivers a best of breed business continuity/disaster recovery (BC/DR) solution irrespective of underlying
hypervisor, public cloud or storage.
Backup
Backup has been an essential part of IT infrastructure since inception, and it is unlikely that will ever change. But
with the IT landscape rapidly changing and threats increasing, are we still able to rely on the backup technology we
currently use? Organizations today cannot afford to sustain any data loss or downtime. To avoid the impact of data,
productivity, and revenue loss, organizations need more granularity in recovery, while maintaining the same level of
performance.
To ensure granularity without impacting production performance, the future of backup requires the change from
periodic or scheduled backups to a continuous backup. By using Zerto’s continuous data replication, you can deliver
recovery point objectives (RPOs) of seconds by replicating every change that is being generated in near real-time. All
these replicated changes are then stored in a journal which allows you to not only recover to the latest point in time
but also offers you a granularity of seconds. The outcome of this is the ability to safely rewind to any point in the
past, even up to 30 days ago. Recover files, applications, VMs, or even entire datacenters by merely pressing a virtual
“rewind” button.
Most recovery use cases that require granular recoveries—such as file deletions, database corruption, or
ransomware—only require short-term retention and a 30-day journal supports those requirements. Many
organizations have compliance requirements to store data for longer retention periods going back years. Long-term
retention data has different requirements as it relates to storage and recovery times but needs to be an integral part
of your data protection platform and strategy. As with short-term backups, copies should not come directly from
production systems as this impacts performance and often disrupts user experiences.
• A Zerto Virtual Manager (ZVM): This is a Windows service that hosts the UI and integrates with the native APIs of
Azure/AWS for management and orchestration.
• A Virtual Replication Appliance (VRA): This is a Windows service that performs the replication of data itself from or
to Azure/AWS.
The ZCA integrates natively with the platform it is deployed on, allowing you to utilize S3 buckets for journal storage in
AWS or blob storage within a Storage Account on Microsoft Azure. This ensures the most cost-efficient deployment on
each of these platforms.
Journal
In addition to VPGs, Zerto’s continuous data protection (CDP) stores all replicated data in the journal. The journal stores
all changes for a user-defined period, up to 30 days, and allows you to recover to any point in time within the journal,
ensuring your recovery point objective (RPO) is always as low as possible. Every write to a protected virtual machine
is copied by Zerto. These writes are replicated locally and/or remotely and written to a journal managed by a Virtual
Replication Appliance (VRA). Each protected virtual machine has its own journal. In addition to the writes, every few
seconds all journals within the VPG are updated with a checkpoint timestamp. Checkpoints are used to ensure write
order fidelity and crash-consistency. Recovery can be performed to the last checkpoint or a user-selected checkpoint.
This enables recovering files, VMs, applications, or entire sites, either to the previous crash-consistent point-in-time or,
for example, when the virtual machine is attacked by a virus or ransomware, to a point-in-time before the attack.
Zerto supports the use of disk, object, and cloud storage; for a full list of supported repositories and their versions,
please see our Interoperability Matrix.
Use Cases
All three example configurations support the following use cases. Where there are unique differences, they will be
highlighted under the relevant architectures.
Any disruption on the production site, whether it’s power, network or otherwise, is
Outages & protected with recovery of your files and folders, VMs, applications or site within just a few
Disruptions minutes to a point in time just seconds before the issue occurred. Example: Recovery of your
whole site within minutes after a power outage.
Recovery from ransomware attacks can be from just seconds before encryption occurred,
Ransomware minimizing data loss and business impact. You can recover your files and folders, VMs,
Attacks applications or entire sites. Example: Recover encrypted files from seconds before they were
encrypted.
This same architecture can be used to move your workloads from an end of life platform
Infrastructure to your new infrastructure in just minutes, significantly speeding up infrastructure
Modernization modernization projects. Example: Move your workloads to a new platform in just minutes
with no data loss.
Where multiple sites are to be consolidated or migrated to the same target, this architecture
Consolidations can be used to streamline the process. This enables pre-migration testing and live migration
& Migrations times of just minutes. Example: Consolidate workloads from diverse hardware, hypervisors
and cloud platforms to meet business standards in minutes.
Allows the creation of replicas, at the remote site, of your production environment from any
Testing & point in time in just minutes. This provides greater flexibility for your development teams
DevOps and reduces overhead on DevOps teams, as well as enabling DR testing and validation.
Example: Create exact replicas of production applications from seconds ago in just minutes
for UAT purposes.
A single SaaS based analytics platform providing complete data analysis across all your
Analytics sites, both on-premises and in the cloud. This provides a single view that simplifies
Across Clouds management and monitoring without added cost. Example: Identify bandwidth bottlenecks
across your entire IT infrastructure through a single portal.
The disaster reference architecture depicted in figure 1 shows a set of proven practices for setting up the platform
where a single remote target is being used as the recovery site.
Description
Protected VMs are grouped in VPGs with consistency across all the VMs within each VPG. A remote journal is configured
on the remote target side and used for short-term recovery scenarios where recovery granularity of just seconds can be
achieved. The recommend journal history period for this journal is 8 days as this will cover most recovery scenarios. All
changes on the protected VMs are then kept for 8 days before being promoted to the remote replica disk(s).
The ability to recover files and folders, VMs, applications or site locally in the event of a
Outages &
Disruptions
logical failure is included in this configuration. Example: If just one application has an issue,
recover just this application locally, rather than remotely, to seconds before the issue.
The ability to recover files and folders, VMs, applications or site locally in the event of a
Ransomware
Recovery
ransomware attack is included in this configuration. Example: Recover only impacted Files,
VMs or Applications locally, rather than remotely, to seconds before the issue.
The ability to test recovery or create replicas for DevOps purposes locally is included in this
Test/ DevOps configuration. Example: Create a replica of a production workload locally, for development
purposes.
Zerto’s continuous data protection, with always-on replication and journaling technology,
Converged DR delivers on true convergence of DR and backup providing complete data protection for both
and backup short-term and long-term recovery. Example: One product providing DR and backup with
granularity of seconds and no traditional production impact.
Description
In this configuration, the same VMs exist in two VPGs. The first VPG is for the creation of the journal on the source,
and the second VPG is created to provide a journal capability on the remote public cloud target. This cloud journal is
placed on blob storage in Azure or an S3 bucket in AWS and is configured on the cloud side. This reduces cost footprint
by having only storage costs incurred and the compute requirements only being spun up in a recovery scenario. The
local journal is configured on the source site and used for backup scenarios where a logical failure occurs, providing
recovery granularity of just seconds. The recommended journal history period for this journal is 14 days as this will
cover most logical recovery scenarios and with the deduplication capabilities of modern storage arrays will consume
minimal storage space. A daily retention process will archive points-in-time from the local journal to the long-term
retention repository for compliance needs. With this architecture, it is recommended that the remote journal history
period is 3 days, as this will cover most recovery scenarios where a physical failure has occurred. All changes on the
protected VMs are then kept for 3 day before being promoted to the remote replica which resides on cloud storage.
Cloud adoption, and the challenges associated with it, can be simplified with this
Cloud architecture to move workloads to your chosen cloud platform in just minutes with zero
Integration &
data loss. In this use case, long-term retention is likely not needed during the migration.
Migration
Example: Move complex applications to the cloud in just 3 steps.
With the increasing adoption of hybrid and multi-cloud strategies, this architecture
Multi-Cloud
provides freedom to move workloads around on-demand as requirements change.
Hybrid Cloud
Example: Move workloads to, from and across cloud platforms to gain maximum efficiency.
Zerto Analytics
About Zerto
Zerto helps customers accelerate IT transformation by eliminating the risk and complexity of modernization and cloud
adoption. By replacing multiple legacy solutions with a single IT Resilience Platform, Zerto is changing the way disaster
recovery, data protection and cloud are managed. With enterprise scale, Zerto’s software platform delivers continuous
availability for an always-on customer experience while simplifying workload mobility to protect, recover and move
IS-14437