Lecture 4
Lecture 4
•Lecture 5
• Traditional RAID
• RAID 2.0+
• Other RAID Technologies
• Challenges in traditional computer systems must be addressed. Individual disks
provide insufficient capacities, do not offer redundancy protection, feature
limited performance, and require complex management. Failures of individual
disks may cause data loss.
Instructions processed
per second
CPU > 1 million Disks have become the bottleneck for system performance.
How large is a
logical disk?
Logical disk
• Stripe width
Indicates the number of disks used in an array for striping. For example,
if a disk array consists of three member disks, the stripe width is 3.
• Stripe depth
Indicates the capacity of a strip. Disk 1 Disk 2 Disk 3
D6 D7 D8 Stripe 2 Depth of a stripe
D3 D4 D5 Stripe 1
D0 D1 D2 Stripe 0
Data strips in a Data strips in a Data strips in a
disk disk disk
Data Protection Techniques
• Mirroring: Data copies are stored on another redundant disk.
• Exclusive or (XOR)
• XOR is widely used in digital electronics and computer science.
• XOR is a logical operation that outputs true only when inputs differ (one is true, the other is false).
• 0 ⊕ 0 = 0, 0 ⊕ 1 = 1, 1 ⊕ 0 = 1, 1 ⊕ 1 = 0
1 1 0
0 1 1
0 0 0
D5 D 0, D 1, D 2, D 3, D 4, D 5
D4
D3
Physical disk 1 Physical disk 2
D2 D6
D4 D5 Stripe 2
D1
D2 D3 Stripe 1
D0
D0 D1 Stripe 0
D1
D1 D1
Logical disk D0 D0
C
A 0, A 1, A 2, B 0, B 1, B 2, C 0, C 1, C 2
A C0 C1 C2 P3
B0 B1 B2 P2
A0 A1 A2 P1
Logical disk
Note: A write penalty occurs when just a small amount of new data needs to be written to one or two disks.
• RAID 3 is similar to RAID 0 but uses dedicated parity stripes. In a RAID 3 array, a dedicated disk (parity disk) is used to
store the parity data of strips in other disks in the same stripe. If incorrect data is detected or a disk fails, data in the
faulty disk can be recovered using the parity data. RAID 3 applies to data-intensive or single-user environments
where data blocks need to be continuously accessed for a long time. RAID 3 writes data to all member data disks.
However, when new data is written to any disk, RAID 3 recalculates and rewrites parity data. Therefore, when a large
amount of data from an application is written, the parity disk in a RAID 3 array needs to process heavy workloads.
Parity operations have certain impact on the read and write performance of a RAID 3 array. In addition, the parity
disk is subject to the highest failure rate in a RAID 3 array due to heavy workloads. This is why the parity disk is called
the bottleneck of RAID 3. A write penalty occurs when just a small amount of data is written to multiple disks, which
does not improve disk performance as compared with data writes to a single disk.
• RAID 3 uses a single disk for fault tolerance and performs parallel data transmission. In other words, RAID 3 uses
striping to divide data into blocks and writes XOR parity data to the last disk (parity disk).
• The write performance of RAID 3 depends on the amount of changed data, the number of disks, and the time
required to calculate and store parity data. If a RAID 3 array consists of N member disks of the same rotational speed
and write penalty is not considered, its sequential I/O write performance is theoretically slightly inferior to N – 1
times that of a single disk when full-stripe write is performed. (Additional time is required to calculate redundancy
check.)
How Does RAID 5 Work
Write data.
Read data.
D5
D 0, D 1, D 2, D 3, D 4, D 5
D4
D3
D2
Physical disk 1 Physical disk 2 Physical disk 3
D1
P2 D4 D5
D0
D2 P1 D3
D0 D1 P0
Logical disk
User data D 0, D 1, D 2, D 3, D 4, D 5
D4 D4 D5 D5
D2 D2 D3 D3
D0 D0 D1 D1