SS DesignWorkshop
SS DesignWorkshop
Design Workshop
Scope
Technology
People & Process
2
Agenda Operations for Hybrid Cloud
<Process Type Name>
<Process Type Name>
<Process Type Name>
3
Introductions and Workshop Objectives
Introductions
• Your Role
• Engagement responsibilities
• Expectations of the workshop
4
Solution Scope
The table below lists the scope of the engagement in the context of the VMware journey model. These are
the IT Capabilities that have been determined as the focus for this engagement
IT Capabilities in Scope
Automatically recover from hardware failures
Abstract and pool compute and storage resources
5
Solution Scope
The table below lists the scope of the engagement in the context of the VMware journey model. These are
the IT Problems that have been determined as the focus for this engagement
IT Problems in Scope
High CAPEX for dedicated infrastructure
Single point of failure
Long wait time for hardware purchases
Unexpected infrastructure outages
Performance bottlenecks
Not enough data center resources or space
6
VMware Solution
PSO Consultant: Insert
appropriate diagrams based on the
current solution.
7
VMware Solution Conceptual Design
Management Automation Consumption Interface Consumption Interface
End User Access Security
Applications
Client Data
Financial Scaling GUI API CLI SaaS 3rd Platform
Server Availability
Compliance Analytics
Resource Catalogs Active Workloads Active Workloads
Data On Premesis Cloud
Governance Data Services Isolation
Applications Virtual Virtual
Containers Containers
Machines Machines
Infrastructure
Service Level Reclamation
Abstraction, Pooling and Tenancy
Development Threat
Event
lifecycle Compute Network Storage Compute Network Storage Compute Network Storage Containment
Application
Capacity
provisioning
Physical Resources
Infrastructure Compute Storage Network Data
Performance
Provisioning Encryption
8
IT Value Model
9
1
0
Digital Workspace Journey Model
10
VMware Solution
Logical Diagram
11
Technology Scope
This specific engagement by VMware Professional Services included the following components of the VMware
Solution. This Solution Design content will only refer to these components.
12
VMware vSphere 7.0.x
14
Logical Architecture
vSphere 7.0
15
vSphere
Logical Architecture Overview
16
VMware vSphere
Logical Architecture Overview
17
Design Decisions
vSphere 7.0.x
18
vSphere Design Decisions
The VMware vSphere design here follows the recommendations in the VMware Validated Design for
SDDC version 5.1.2.
Only the VVD design decisions that are used and specific to vSphere are listed here. Note that some may
be modified to fit this design, for example the removal of vSAN from the default design.
For a complete list of design decisions for vSphere refer to the vSphere Design Service, or review the
VMware Validated Design for SDDC.
Note that some of these design decisions will be changed if you are using the vSAN Design Service, as it
will add or modify decisions to incorporate vSAN.
19
Consultant:
Design Decisions Please refer to speaker notes for instruction
Physical Design Summary
SDDC-PHY-009 Verify that all nodes have uniform configuration across a cluster. PS-Custom
20
Design Decisions
Physical Design Summary (2)
21
Design Decisions
Physical Network Design Summary
Design Decision ID Design Decision Description Applicable to
Architecture Model
Implement the following physical network architecture: PS-Custom
- A minimum of one 10-GbE port (one 25 GbE port recommended) on each
ToR switch for ESXi host uplinks
- No EtherChannel (LAG/vPC) configuration for ESXi host uplinks
SDDC-PHY-NET-001 - Layer 3 device with BGP and IGMP support
SDDC-PHY-NET-002 Use a physical network that is configured for BGP routing adjacency. PS-Custom
22
Design Decisions
Physical Network Design Summary (2)
Design Decision ID Design Decision Description Applicable to
Architecture Model
SDDC-PHY-NET-007 Use an NTP time source for all management nodes. PS-Custom
Configure the MTU size to 9000 bytes (Jumbo Frames) on the port groups PS-Custom
SDDC-PHY-NET-008 that support vSAN, vMotion, VXLAN, vSphere Replication, and NFS.
23
Design Decision
Physical Storage Design Summary
Design Decision ID Design Decision Description Applicable to Architecture
Model
SDDC-PHY-STO-006 When using a single availability zone, provide NFS storage. PS-Custom
SDDC-PHY-STO-007 Store templates and ISO files on the primary datastore. PS-Custom
Provide storage for virtual machine backups for the availability PS-Custom
SDDC-PHY-STO-008 zone.
SDDC-PHY-STO-009 Use 10K SAS drives for NFS volumes. PS-Custom
24
Design Decision
Virtual Infrastructure – ESXi Host Design Summary
Design Decision Design Decision Description Applicable to Architecture
ID Model
Install and configure all ESXi hosts to boot using a SD device of 16 PS-Custom
SDDC-VI-ESXi-001 GB or greater.
Add each host to the Active Directory domain for the region in PS-Custom
SDDC-VI-ESXi-002 which it will reside.
Change the default ESX Admins group to the SDDC-Admins PS-Custom
Active Directory group. Add ESXi administrators to the SDDC-
SDDC-VI-ESXi-003 Admins group following standard access procedures.
Configure all ESXi hosts to synchronize time with the central NTP PS-Custom
SDDC-VI-ESXi-004 servers.
SDDC-VI-ESXi-005 Enable Lockdown mode on all ESXi hosts. PS-Custom
25
Design Decision
Virtual Infrastructure – vCenter Server Design Summary (1)
Design Decision Design Decision Description Applicable to
ID Architecture Model
Deploy two vCenter Server systems. One vCenter Server supporting the SDDC management PS-Custom
components. One vCenter Server supporting the compute components and tenant workloads.
Note: Platform Services Controller services are embedded in to the vCenter appliance in
SDDC-VI-VC-001 vSphere 7.0. The external deployment model is no longer supported.
PS-Custom
SDDC-VI-VC-002 Deploy all vCenter Server instances as Linux-based vCenter Server Appliances.
PS-Custom
SDDC-VI-VC-003 Join all vCenter Server instances to a single vCenter Single Sign-On domain.
PS-Custom
SDDC-VI-VC-004 Create a ring topology for the Platform Service Controllers.
PS-Custom
SDDC-VI-VC-005 Protect all vCenter Server appliances by using vSphere HA.
PS-Custom
SDDC-VI-VC-006 Deploy Management vCenter Server Appliances of a small deployment size or larger.
PS-Custom
SDDC-VI-VC-007 Deploy Compute vCenter Server Appliances of a large deployment size or larger.
PS-Custom
SDDC-VI-VC-008 Use vSphere HA to protect all clusters virtual machines against failures.
26
Design Decision
Virtual Infrastructure – vCenter Server Design Summary (2)
Design Decision ID Design Decision Description Architecture Model
PS-Custom
SDDC-VI-VC-009 Set vSphere HA Host Isolation Response to Power Off.
Create a single management cluster per region. This cluster contains all management ESXi PS-Custom
SDDC-VI-VC-010 hosts.
PS-Custom
SDDC-VI-VC-011 Create a compute cluster per region. This cluster contains tenant workloads.
In Region A, create a management cluster with a minimum of 4 ESXi hosts for a single PS-Custom
SDDC-VI-VC-012 availability zone (there is a minimum of 4 ESXi hosts in each availability zone).
When using a single availability zone, configure Admission Control for 1 ESXi host failure PS-Custom
SDDC-VI-VC-013 and percentage-based failover capacity.
PS-Custom
SDDC-VI-VC-014 When using a single availability zone, create a host profile for the Management Cluster.
For a single availability zone, configure vSphere HA to use percentage-based failover PS-Custom
SDDC-VI-VC-015 capacity to ensure n+1 availability.
PS-Custom
SDDC-VI-VC-016 Enable Virtual Machine Monitoring for each cluster.
Create Virtual Machine Groups for use in startup rules in the management and compute PS-Custom
SDDC-VI-VC-017 clusters.
Create Virtual Machine rules to specify the startup order of the SDDC management PS-Custom
SDDC-VI-VC-018 components.
27
Design Decision
Virtual Infrastructure – vCenter Server Design Summary (3)
Design Decision Design Decision Description Applicable to Architecture Model
ID
Enable vSphere DRS on all clusters and set it to Fully Automated, with the default PS-Custom
SDDC-VI-VC-019 setting (medium).
PS-Custom
SDDC-VI-VC-020 Enable Enhanced vMotion Compatibility (EVC) on all clusters.
Set the cluster EVC mode to the highest available baseline that is supported for the PS-Custom
SDDC-VI-VC-021 lowest CPU architecture on the hosts in the cluster.
Replace the vCenter Server machine certificate and Platform Services Controller PS-Custom
machine certificate with a certificate signed by a 3rd party Public Key
SDDC-VI-VC-022 Infrastructure.
PS-Custom
SDDC-VI-VC-023 Use a SHA-2 or higher algorithm when signing certificates.
28
Design Decision
Virtual Infrastructure – Network Design Summary
29
Design Decision
Virtual Infrastructure – Network Design Summary (2)
30
Design Decision
Virtual Infrastructure – Storage Design Summary
31
Design Decision
Virtual Infrastructure – NFS Storage Design Summary
32
Design Decision
Virtual Infrastructure – vSphere Lifecycle Manager Design Summary
Design Decision ID Design Decision Description Applicable to
Architecture Model
Use the vSphere Lifecycle Manager service on each vCenter Server PS-Custom
Appliance to provide a total of two vSphere Lifecycle Manager instances
SDDC-OPS-VLM-001 that you configure and use for patch management.
Use the network settings of the vCenter Server Appliance for vSphere PS-Custom
SDDC-OPS-VLM-002 Lifecycle Manager.
PS-Custom
SDDC-OPS-VLM-003 Use the default patch repositories by VMware.
PS-Custom
SDDC-OPS-VLM-004 Set the VM power state to Do Not Power Off.
Enable parallel remediation of hosts assuming that there are enough PS-Custom
SDDC-OPS-VLM-005 resources available to support update of multiple hosts at the same time.
PS-Custom
SDDC-OPS-VLM-006 Enable migration of powered off virtual machines and templates.
Use the default critical and non-critical patch baselines for the management PS-Custom
SDDC-OPS-VLM-007 cluster and for the compute cluster.
33
Design Decision
Virtual Infrastructure – vSphere Lifecycle Manager Design Summary (2)
Design Decision ID Design Decision Description Applicable to Architecture
Model
PS-Custom
SDDC-OPS-VLM-008 Use the default schedule of a once-per-day check and patch download.
Remediate hosts, virtual machines, and virtual appliances once a month or PS-Custom
SDDC-OPS-VLM-009 per business guidelines.
34
Design Decision
Virtual Infrastructure – Data Protection and Backup Design Summary
Design Decision ID Design Decision Description Architecture Model
Use a backup solution that is compatible with vSphere Storage APIs - Data PS-Custom
Protection (VADP) and can perform image level backups of the management
SDDC-OPS-BKP-001 components.
Use a VADP-compatible backup solution that can perform application-level PS-Custom
SDDC-OPS-BKP-002 backups of the management components.
Allocate a dedicated datastore for the VADP-compatible backup solution and the PS-Custom
SDDC-OPS-BKP-003 backup data according to NFS Physical Storage Design.
PS-Custom
SDDC-OPS-BKP-004 Provide storage with a capacity of at least 12 TB on-disk.
PS-Custom
SDDC-OPS-BKP-005 Use HotAdd to back up virtual machines.
PS-Custom
SDDC-OPS-BKP-006 Use the VADP solution agent for backups of the Microsoft SQL Server.
PS-Custom
SDDC-OPS-BKP-007 Schedule daily backups.
PS-Custom
SDDC-OPS-BKP-008 Schedule backups outside the production peak times.
PS-Custom
SDDC-OPS-BKP-009 Retain backups for at least 3 days.
35
Design Decision
Virtual Infrastructure – Data Protection and Backup Design Summary (2)
Design Decision ID Design Decision Description Applicable to Architecture
Model
Configure a service account in vCenter Server for application-to- PS-Custom
application communication from VADP-compatible backup solution with
SDDC-OPS-BKP-010 vSphere.
Use global permissions when you create the service account in vCenter PS-Custom
SDDC-OPS-BKP-011 Server.
36
Design Decision Detail
vSphere 7.0.x
37
Design Decision Detail
Physical Design
38
Design Decision # SDDC-PHY-001
In Region A, deploy one availability zone to support all SDDC management components and their SLAs.
In each availability zone, place the management cluster and the compute cluster in the same rack.
Combined with redundant network connections into a If the equipment of an entire rack fails, the cause,
rack and within a rack, redundant power feeds prevent such as flooding or an earthquake, also affects
failure of equipment in an entire rack. neighboring racks. A second region is needed to
mitigate downtime associated with such an event.
Single Power Feeds Reduced cost for implementation of racks. Increased chance of failure if the power feed fails.
A second Region would be needed to mitigate
downtime in this case, increasing cost.
For each availability zone, mount the compute resources (minimum of 4 ESXi hosts) for the management cluster together in a rack.
For each availability zone, mount the compute resources for the compute cluster (minimum of 4 ESXi hosts) together in a rack.
Set up each ESXi host in the management cluster with a minimum of 256 GB RAM.
49
Design Decision # SDDC-PHY-NET-001
Provides compatibility with vSphere host profiles because Requires dynamic routing protocol
they do not store link-aggregation settings. configuration in the physical networking
Supports BGP as the dynamic routing protocol in the stack.
SDDC.
Assign static IP addresses to all management components in the SDDC infrastructure except for NSX VTEPs.
NSX VTEPs are assigned by using a DHCP server. Set the lease duration for the VTEP DHCP scope to at least 7 days..
Create DNS records for all management nodes to enable forward, reverse, short and FQDN resolution.
Configure the MTU size to 9000 bytes (Jumbo Frames) on the port groups that support vSAN, vMotion, VXLAN, vSphere
Replication, and NFS.
58
Design Decision # SDDC-PHY-STO-006
Provide storage for virtual machine backups for the availability zone.
vRealize Automation uses NFS datastores for its content catalog which
requires high performance datastores.
vRealize Log Insight uses NFS datastores for its archive storage which,
depending on compliance regulations, can use a large amount of disk space.
65
Design Decision # SDDC-VI-ESXi-001
Install and configure all ESXi hosts to boot using a SD device of 16 GB or greater.
Add each host to the Active Directory domain for the region in which it will reside.
Ensuring that users log in with a unique user account allows greater
visibility for auditing.
Local Authentication No additional configuration needed in Active Directory. Hard to maintain across many hosts, as all
users need to be configured and
maintained per host.
Change the default ESXi Admins group to the SDDC-Admins Active Directory group. Add ESXi administrators to the SDDC-
Admins group following standard access procedures.
Configure all ESXi hosts to synchronize time with the central NTP servers.
71
Design Decision # SDDC-VI-VC-001
Deploy two vCenter Server systems. One vCenter Server supporting the SDDC management components. One vCenter Server
supporting the compute components and tenant workloads. Note: Platform Services Controller services are embedded in to the
vCenter appliance in vSphere 7.0. The external deployment model is no longer supported.
Join all vCenter Server instances to a single vCenter Single Sign-On domain.
Create a single management cluster per region. This cluster contains all management ESXi hosts.
Create a compute cluster per region. This cluster contains tenant workloads.
In Region A, create a management cluster with a minimum of 4 ESXi hosts for a single availability zone (there is a minimum of 4
ESXi hosts in each availability zone).
When using a single availability zone, configure Admission Control for 1 ESXi host failure and percentage based failover capacity.
When using a single availability zone, create a host profile for the Management Cluster.
For a single availability zone, configure vSphere HA to use percentage-based failover capacity to ensure n+1 availability.
Create virtual machine groups for use in startup rules in the management and compute clusters.
Create Virtual Machine rules to specify the startup order of the SDDC management components.
Enable DRS on all clusters and set it to Fully Automated, with the default setting (medium).
Set EVC mode to the highest available setting supported for the hosts in the cluster.
Replace the vCenter Server machine certificate and Platform Services Controller machine certificate with a certificate signed by a
3rd party Public Key Infrastructure.
95
Design Decision # SDDC-VI-NET-001
Enable vSphere Distributed Switch Health Check on all virtual distributed switches.
Use the Route based on physical NIC load teaming algorithm for all port groups except for ECMP uplinks and ones that carry
VXLAN traffic.
- ECMP uplink port groups use Route based on originating virtual port.
- VTEP kernel ports and VXLAN traffic use Route based on SRC-ID.
High share value High importance for the traffic, during times of contention. This traffic may consume significant amount of
bandwidth when there is contention and impact other
traffic types.
Normal share value Because NFS is used for secondary storage, such as VDP During times of contention VDP backups will be
backups and vRealize Log Insight archives it is not as slower than usual.
important as vSAN traffic, by prioritizing it lower vSAN is not
impacted.
Custom share value Completely custom share definition based on environmental May not be a relative to other traffic value set
need. which could cause unpredictable share values.
Normal share value Average importance for traffic, during times of contention. During times of network contention this traffic may be
impacted.
Custom share value Completely custom share definition based on environmental May not be a relative to other traffic value set which
need. could cause unpredictable share values.
113
Design Decision # SDDC-VI-Storage-001
When using a single availability zone in the management cluster, use NFS as the shared storage platform for the management
cluster.
In all clusters, ensure that at least 20% of free space is always available on all datastores.
Select an array that supports vStorage APIs for Array Integration (VAAI) over NAS (NFS).
Enable Storage I/O Control with the default values on all non-vSAN datastores (not supported with vSAN as it is built into the
product).
118
Design Decision # SDDC-VI-Storage-NFS-001
Create volumes for the clusters as appropriate for the workloads running.
Place the VADP based backup export on its own separate volume as per SDDC-PHY-STO-010.
For each export, limit access to only the application VMs or hosts requiring the ability to mount the storage.
123
Design Decision # SDDC-OPS-VLM-001
Use the vSphere Lifecycle Manager service on each vCenter Server Appliance to provide a total of two vSphere Lifecycle Manager
instances that you configure and use for patch management.
Use the network settings of the vCenter Server Appliance for vSphere Lifecycle Manager.
Enable parallel remediation of hosts assuming that there are enough resources available to support update of multiple hosts at the
same time.
Use the default critical and non-critical patch baselines for the management cluster and for the compute cluster.
Custom Schedule Allows for checks to be done as appropriate, such as during change None.
control windows or the like.
Remediate hosts, virtual machines, and virtual appliances once a month or per business guidelines.
133
Design Decision # SDDC-OPS-BKP-001
Use a backup solution that is compatible with vSphere Storage APIs - Data Protection (VADP) and can perform image level backups
of the management components.
Use a VADP-compatible backup solution that can perform application-level backups of the management components.
Allocate a dedicated datastore for the VADP-compatible backup solution and the backup data according to NFS Physical Storage
Design.
Use the VADP solution agent for backups of the Microsoft SQL Server.
Configure a service account in vCenter Server for application-to-application communication from VADP-compatible backup
solution with vSphere.
Use global permissions when you create the service account in vCenter Server.