Nl400 Boot Drive Replacement Guide
Nl400 Boot Drive Replacement Guide
00 KB)
6/30/2025 1:47 PM
Last saved by EMC
Isilon
994-0019-01 Rev I
CAUTION: If this procedure is not followed accurately, data loss and severe disruption of cluster
operations can occur. Perform every step in this procedure; if the system does not respond as
expected, contact Isilon Technical Support.
CAUTION: Perform this procedure on only one node at a time. Performing maintenance on
multiple nodes in parallel may lower the protection level of the cluster, put data at risk, and lead to
the interruption of client workflows.
1
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
Note: See the Considerations for installing the latest drive support package section in order to select
the appropriate variant of the package. If you are unable to download the package, contact EMC
Isilon Technical Support for assistance.
3. [ ] Open a secure shell (SSH) connection to any node in the cluster and log in.
4. [ ] Create or check for the availability of the directory structure
/ifs/data/Isilon_Support/dsp.
5. [ ] Copy the downloaded file to the dsp directory through SCP, FTP, SMB, NFS, or any
other supported data-access protocols.
6. [ ] Unpack the drive support package by running the following command:
tar -zxvf Drive_Support_<version>.tgz
7. [ ] Install the package by running the following command:
isi_dsp_install Drive_Support_<version>.tar
Note: You must run the isi_dsp_install command to install the dirve support package. Do not
use the isi pkg command.
2
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
The boot drives are listed in the left column. In the previous example, both boot drives are healthy. If
one of the boot drives has failed, the drive will not appear in the output.
Make note of whether the failed boot drive is the ada1 or ada0 device, and then use the following
table to determine the location of the boot drive inside the node.
3
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
Make note of the board drive slot that contains the failed boot drive. The ada1 drive is on the right
side of the boot carrier card. The ada0 drive is on the left side of the boot carrier card.
CAUTION: If both drives appear to have failed, do not continue. Contact Isilon Technical
Support immediately.
The boot drives are listed under ATA channel 2 (master) and ATA channel 3 (slave). In the
previous example, both boot drives are healthy. If one of the boot drives has failed, the display reads
no device present for that drive.
Make note of whether the failed boot drive is the ad4 or ad7 device, and then use the following table
to determine the location of the boot drive inside the node.
Make note of the board drive slot that contains the failed boot drive.
CAUTION: If both drives appear to have failed, do not continue. Contact Isilon Product
Support immediately.
4
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
3. [ ] If both drives appear to be healthy, one of the drives may have partially failed. To identify
a partially failed drive, check the status of the individual partition mirrors by typing the following
command:
gmirror status
From left to right, the output displays the name of each mirror, the status of the mirror relationship,
and the component IDs for each boot drive.
The following example shows the boot drive partition layout in a healthy node. The mirrors for each
partition show:
a value of COMPLETE in the Status column.
the component IDs for both boot drives in the Components column. The component IDs are a
combination of the OneFS Drive ID, and the partition number (the number following the letter p).
Both boot drives are listed for each mirror with the exception of the var-crash mirror, which only
lists the slave drive.
Note: If you are running OneFS 8.0 or later, your OneFS Drive IDs will display as ada0 or ada1.The
partition numbers in the display may differ from the following example.
The following example shows the boot drive partition layout as it appears in the event of a failed boot
drive. A failed boot drive forces the mirrors for a partition to show:
a value of DEGRADED in the Status column.
only the component ID of the healthy boot drive in the Components column. The failed boot drive
does not appear.
Note: DEGRADED does not refer to a specific drive, but to the mirror relationship between the drives.
If a drive appears in the Components column next to the DEGRADED status, it is healthy and should
not be removed.
5
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
ad4p5
In the previous example, ad7p4 is missing from the degraded partition mirror/root0, and ad7p6 is
missing from the degraded partition mirror/var0. The missing drive, ad7, is the failed drive.
Determine which drive has failed. Use the previous table to determine which board drive slot contains
the failed boot drive and make a note of the number (J3 or J4).
Note: If both drives have failed, do not continue. Contact Isilon Product Support.
CAUTION: A forced power down should be attempted only if a node is unresponsive. Forcing
the power down of a healthy node can result in data loss.
5. [ ] Verify that the node is powered down by typing the following command:
isi status -q
Confirm that the node has a status of D--R (Down, Read Only). See node 3 in the following example.
ID |IP Address |DASR| In Out Total| Used / Size |Used / Size
---+---------------+----+-----+-----+-----+------------------+-
1|10.53.217.201 | OK | 48M| 0| 48M| 19G/ 6.2T(< 1%)|(No SSDs)
2|10.53.217.202 | OK | 46M| 0| 46M| 23G/ 6.2T(< 1%)|(No SSDs)
3|10.53.217.203 |D--R| n/a| n/a| n/a| n/a/ n/a( n/a)|n/a/n/a( n/a)
6
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
Note: If there are transceivers connected to the end of your IB or ethernet cables, make sure to
remove them with the cables. If you are using fiber ethernet cables, you will need to disconnect the
cable from the transceiver, then remove the transceiver from the node.
DANGER: Slide the node out from the rack slowly. Do not extend the rails completely until
you confirm that the node is latched and safely secured to the rails.
WARNING: Properly ground yourself to prevent electrostatic discharge from damaging the node.
For example, attach an ESD strap to your wrist and the node chassis.
Procedure
1. [ ] Loosen the captive screw that secures the node top panel.
2. [ ] Slide the top panel toward the rear of the node, and then lift the top panel to access the
node interior.
7
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
2. [ ] Remove the cross bracket by pressing on the side of the node chassis where the cross
bracket is connected. Unhook the cross bracket from the chassis, then lift straight up to unhook the
other side of the bracket.
8
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
1. Boot drive
Procedure
1. [ ] Locate the two board drive slots that contain the boot drives. The slots are labeled J3 and
J4. Gently pull the failed boot drive from the board drive slot.
9
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
1. J3 connector 2. J4 connector
2. [ ] Insert the replacement boot drive into the empty boot drive slot and gently press down to
secure the drive.
WARNING: The cross bracket sits directly above the boot drives. Use caution when installing the
cross bracket so that the boot drives are not dislodged or damaged.
10
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
WARNING: The chassis intrusion switch can be damaged if the top panel is slid too far back
on the node.
2. [ ] Tighten the captive top panel screw to secure the top panel to the node.
WARNING: Slide the node slowly so you do not slam the node into the rack and damage the
node.
2. [ ] Reconnect the ethernet, InfiniBand, and power cables to the back of the node.
3. [ ] Secure the node to the rack cabinet.
4. [ ] Replace the node front panel.
11
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
rm /sentinel.txt
Note: If you are running OneFS 8.0 or later, your OneFS Drive IDs will display as ada0 or ada1.The
partition numbers in the display may differ from the following example.
Confirm that the values in the Status column all read COMPLETE.
2. [ ] Verify boot drive information. Depending on your version of OneFS, run one of the
following commands:
OneFS 8.0 or later
camcontrol devlist | grep ad
The following information appears:
<SanDisk SSD P4 8GB SSD 8.10> at scbus1 target 0 lun 0 (pass13,ada0)
<SanDisk SSD P4 8GB SSD 8.10> at scbus2 target 1 lun 0 (pass14,ada1)
12
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
ATA channel 4:
Master: no device present
Slave: no device present
ATA channel 5:
Master: no device present
Slave: no device present
Note: If your cluster is running in SmartLock compliance mode with OneFS 7.0.2.10 or later, 7.0.1.4 or
later, or 7.1.1.0 or later you will need to enter the provided compliance mode commands to run the FRU
scripts. If your cluster is running in compliance mode but is not running one of these versions, you will
need to upgrade your OneFS version to support the compliance mode commands. Contact Isilon
Technical Support.
13
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
Note: You must include the period at the end of the command.
14
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
Note: Do not restart or power off nodes while drive firmware is being updated on the cluster.
Procedure
1. [ ] Open a secure shell (SSH) connection to any node in the cluster and log in.
2. [ ] Depending on your version of OneFS, run one of the following commands to update the
drive firmware for your cluster:
OneFS 8.0 or later
To update the drive firmware for your entire cluster, run the following command:
isi devices drive firmware update start all --node-lnn all
15
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
To update the drive firmware for a specific node only, run the following command:
isi devices drive firmware update start all --node-lnn <node-number>
OneFS 7.1.1 - OneFS 8.0
For OneFS versions between 7.1.1 - 8.0 you will need to run the following command on each node
that requires drive firmware:
isi devices -a fwupdate
CAUTION: You must wait for one node to finish updating before you initiate an update on the
next node. To confirm that a node has finished updating, run the following command:
isi devices -d <node-number>
Updating the drive firmware of a single drive takes approximately 15 seconds, depending on the drive
model. OneFS updates drives sequentially.
16
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
17
906285432.doc (287.00 KB)
6/30/2025 1:47 PM
Last saved by EMC
Isilon Info Hubs For the list of Isilon info hubs, see the Isilon Info Hubs
page on the EMC Isilon Community Network. Isilon info
hubs organize Isilon documentation, videos, blogs, and
user-contributed content into topic areas, making it easy
to find content about subjects that interest you.
18