t5440 Service en 02
t5440 Service en 02
TM
Service Manual
Please
Recycle
Copyright © 2009 Sun Microsystems, Inc., 4150 Network Circle, Santa Clara, California 95054, Etats-Unis. Tous droits réservés.
Entrée et revue tecnical fournies par FUJITSU LIMITED sur des parties de ce matériel.
Sun Microsystems, Inc. et Fujitsu Limited détiennent et contrôlent toutes deux des droits de propriété intellectuelle relatifs aux produits et
technologies décrits dans ce document. De même, ces produits, technologies et ce document sont protégés par des lois sur le copyright, des
brevets, d’autres lois sur la propriété intellectuelle et des traités internationaux. Les droits de propriété intellectuelle de Sun Microsystems, Inc.
et Fujitsu Limited concernant ces produits, ces technologies et ce document comprennent, sans que cette liste soit exhaustive, un ou plusieurs
des brevets déposés aux États-Unis et indiqués à l’adresse http://www.sun.com/patents de même qu’un ou plusieurs brevets ou applications
brevetées supplémentaires aux États-Unis et dans d’autres pays.
Ce document, le produit et les technologies afférents sont exclusivement distribués avec des licences qui en restreignent l’utilisation, la copie, la
distribution et la décompilation. Aucune partie de ce produit, de ces technologies ou de ce document ne peut être reproduite sous quelque
forme que ce soit, par quelque moyen que ce soit, sans l’autorisation écrite préalable de Fujitsu Limited et de Sun Microsystems, Inc., et de leurs
éventuels bailleurs de licence. Ce document, bien qu’il vous ait été fourni, ne vous confère aucun droit et aucune licence, expresses ou tacites,
concernant le produit ou la technologie auxquels il se rapporte. Par ailleurs, il ne contient ni ne représente aucun engagement, de quelque type
que ce soit, de la part de Fujitsu Limited ou de Sun Microsystems, Inc., ou des sociétés affiliées.
Ce document, et le produit et les technologies qu’il décrit, peuvent inclure des droits de propriété intellectuelle de parties tierces protégés par
copyright et/ou cédés sous licence par des fournisseurs à Fujitsu Limited et/ou Sun Microsystems, Inc., y compris des logiciels et des
technologies relatives aux polices de caractères.
Par limites du GPL ou du LGPL, une copie du code source régi par le GPL ou LGPL, comme applicable, est sur demande vers la fin utilsateur
disponible; veuillez contacter Fujitsu Limted ou Sun Microsystems, Inc.
Cette distribution peut comprendre des composants développés par des tierces parties.
Des parties de ce produit pourront être dérivées des systèmes Berkeley BSD licenciés par l’Université de Californie. UNIX est une marque
déposée aux Etats-Unis et dans d’autres pays et licenciée exclusivement par X/Open Company, Ltd.
Sun™, Sun Microsystems™, le logo Sun©, Java™, Netra™, Solaris™, Sun StorageTek™, docs.sun.comSM, OpenBoot™, SunVTS™, Sun Fire™,
SunSolveSM, CoolThreads™, et J2EE™ sont des marques de fabrique ou des marques déposées de Sun Microsystems, Inc. , ou ses filiales aux
Etats-Unis et dans d’autres pays.
Fujitsu et le logo Fujitsu sont des marques déposées de Fujitsu Limited.
Toutes les marques SPARC sont utilisées sous licence et sont des marques de fabrique ou des marques déposées de SPARC International, Inc.
aux Etats-Unis et dans d’autres pays. Les produits portant les marques SPARC sont basés sur une architecture développée par Sun
Microsystems, Inc.
SPARC64 est une marques déposée de SPARC International, Inc., utilisée sous le permis par Fujitsu Microelectronics, Inc. et Fujitsu Limited.
SSH est une marque déposée registre de SSH Communications Security aux Etats-Uniset dans certaines autres juridictions.
L’interface d’utilisation graphique OPEN LOOK et Sun™ a été développée par Sun Microsystems, Inc. pour ses utilisateurs et licenciés. Sun
reconnaît les efforts de pionniers de Xerox pour la recherche et le développement du concept des interfaces d’utilisation visuelle ou graphique
pour l’industrie de l’informatique. Sun détient une license non exclusive de Xerox sur l’interface d’utilisation graphique Xerox, cette licence
couvrant également les licenciés de Sun qui mettent en place l’interface d’utilisation graphique OPEN LOOK et qui, en outre, se conforment aux
licences écrites de Sun.
Droits du gouvernement américain - logiciel commercial. Les utilisateurs du gouvernement américain sont soumis aux contrats de licence
standard de Sun Microsystems, Inc. et de Fujitsu Limited ainsi qu’aux clauses applicables stipulées dans le FAR et ses suppléments.
Avis de non-responsabilité: les seules garanties octroyées par Fujitsu Limited, Sun Microsystems, Inc. ou toute société affiliée de l’une ou l’autre
entité en rapport avec ce document ou tout produit ou toute technologie décrit(e) dans les présentes correspondent aux garanties expressément
stipulées dans le contrat de licence régissant le produit ou la technologie fourni(e).
SAUF MENTION CONTRAIRE EXPRESSÉMENT STIPULÉE DANS CE CONTRAT, FUJITSU LIMITED, SUN MICROSYSTEMS, INC. ET LES
SOCIÉTÉS AFFILIÉES REJETTENT TOUTE REPRÉSENTATION OU TOUTE GARANTIE, QUELLE QU’EN SOIT LA NATURE (EXPRESSE
OU IMPLICITE) CONCERNANT CE PRODUIT, CETTE TECHNOLOGIE OU CE DOCUMENT, LESQUELS SONT FOURNIS EN L’ÉTAT. EN
OUTRE, TOUTES LES CONDITIONS, REPRÉSENTATIONS ET GARANTIES EXPRESSES OU TACITES, Y COMPRIS NOTAMMENT TOUTE
GARANTIE IMPLICITE RELATIVE À LA QUALITÉ MARCHANDE, À L’APTITUDE À UNE UTILISATION PARTICULIÈRE OU À
L’ABSENCE DE CONTREFAÇON, SONT EXCLUES, DANS LA MESURE AUTORISÉE PAR LA LOI APPLICABLE.
Sauf mention contraire expressément stipulée dans ce contrat, dans la mesure autorisée par la loi applicable, en aucun cas Fujitsu Limited, Sun
Microsystems, Inc. ou l’une de leurs filiales ne sauraient être tenues responsables envers une quelconque partie tierce, sous quelque théorie
juridique que ce soit, de tout manque à gagner ou de perte de profit, de problèmes d’utilisation ou de perte de données, ou d’interruptions
d’activités, ou de tout dommage indirect, spécial, secondaire ou consécutif, même si ces entités ont été préalablement informées d’une telle
éventualité.
LA DOCUMENTATION EST FOURNIE “EN L’ETAT” ET TOUTES AUTRES CONDITIONS, DECLARATIONS ET GARANTIES EXPRESSES
OU TACITES SONT FORMELLEMENT EXCLUES, DANS LA MESURE AUTORISEE PAR LA LOI APPLICABLE, Y COMPRIS NOTAMMENT
TOUTE GARANTIE IMPLICITE RELATIVE A LA QUALITE MARCHANDE, A L’APTITUDE A UNE UTILISATION PARTICULIERE OU A
L’ABSENCE DE CONTREFACON.
Contents
Preface xiii
Managing Faults 9
Understanding Fault Handling Options 9
Server Diagnostics Overview 9
Diagnostic Flowchart 11
Options for Accessing the Service Processor 14
ILOM Overview 15
ALOM CMT Compatibility Shell Overview 17
Solaris Predictive Self-Healing Overview 17
SunVTS Overview 18
POST Fault Management Overview 19
POST Fault Management Flowchart 20
Memory Fault Handling Overview 21
Connecting to the Service Processor 22
v
▼ Switch From the System Console to the Service Processor (ILOM or
ALOM CMT Compatibility Shell) 23
▼ Switch From ILOM to the System Console 23
▼ Switch From the ALOM CMT Compatibility Shell to the System
Console 24
Displaying FRU Information with ILOM 24
▼ Display System Components With the ILOM show components
Command 24
▼ Display Individual Component Information With the ILOM show
Command 25
Controlling How POST Runs 26
▼ Change POST Parameters 27
▼ Run POST in Maximum Mode 28
Detecting Faults 30
Detecting Faults Using LEDs 30
Detecting Faults Using ILOM show faulty Command 32
▼ Detect Faults Using the ILOM show faulty Command 33
Detecting Faults Using Solaris OS Files and Commands 35
▼ Check the Message Buffer 36
▼ View System Message Log Files 36
Detecting Faults Using the ILOM Event Log 37
▼ View ILOM Event Log 37
Detecting Faults Using SunVTS Software 38
▼ Verify Installation of SunVTS Software 38
▼ Start the SunVTS Browser Environment 39
SunVTS Software Packages 41
Useful SunVTS Tests 42
Detecting Faults Using POST 42
Identifying Faults Detected by PSH 44
Contents vii
Performing Electrostatic Discharge – Antistatic Prevention Measures 69
▼ Perform Electrostatic Discharge – Antistatic Prevention Measures 69
Removing the Top Cover 69
▼ Remove the Top Cover 69
Contents ix
▼ Install the Service Processor 122
Servicing the IDPROM 123
▼ Remove the IDPROM 123
▼ Install the IDPROM 124
Servicing the Battery 125
▼ Remove the Battery 125
▼ Install the Battery 125
Servicing the Power Distribution Board 126
▼ Remove the Power Distribution Board 126
▼ Install the Power Distribution Board 128
Servicing the Fan Tray Carriage 129
▼ Remove the Fan Tray Carriage 129
▼ Install the Fan Tray Carriage 131
Servicing the Hard Drive Backplane 132
▼ Remove the Hard Drive Backplane 132
▼ Install the Hard Drive Backplane 133
Servicing the Motherboard 135
▼ Remove the Motherboard 135
▼ Install the Motherboard 138
Motherboard Fastener Locations 139
Servicing the Flex Cable Assembly 140
▼ Remove the Flex Cable Assembly 141
▼ Install the Flex Cable Assembly 142
Servicing the Front Control Panel 144
▼ Remove the Front Control Panel 144
▼ Install the Front Control Panel 145
Servicing the Front I/O Board 146
▼ Remove the Front I/O Board 147
Contents xi
Field-Replaceable Units 176
Index 179
This manual provides detailed procedures that describe the removal and replacement
of replaceable parts in the SPARC Enterprise™ T5440 Server. This manual also
includes information about the use and maintenance of the server. This document is
written for technicians, system administrators, authorized service providers (ASPs),
and users who have advanced experience troubleshooting and replacing hardware.
Keep this manual handy for further reference. Fujitsu makes every effort to prevent
users and bystanders from being injured or from suffering damage to their property.
Use the product according to this manual.
xiii
Structure and Contents of This Manual
This manual is organized as described below:
■ “Identifying Server Components” on page 1
Provides an overview of the server, including major boards and components, as
well as front and rear panel features.
■ “Managing Faults” on page 9
Describes the diagnostics that are available for monitoring and troubleshooting
the server.
■ “Preparing to Service the System” on page 59
Describes the steps necessary to prepare the server for service.
■ “Servicing Customer-Replaceable Units” on page 71
Describes how to service customer-replaceable units (CRUs)
■ “Servicing Field-Replaceable Units” on page 115
Describes how to service field-replaceable units (FRUs)
■ “Returning the Server to Operation” on page 149
Describes how to bring the server back to operation after performing service
procedures.
■ “Performing Node Reconfiguration” on page 155
Describes how to perform node reconfiguration.
■ “Connector Pinouts” on page 167
Contains pinout tables for all external connectors.
■ “Server Components” on page 173
Contains illustrations showing server components.
Global Site
(http://www.fujitsu.com/sparcenterprise/manual/)
Japanese Site
(http://primeserver.fujitsu.com/sparcenterprise/manual/)
SPARC Enterprise T5440 Server Minimum steps to power on and boot the C120-E504
Getting Started Guide server for the first time
SPARC Enterprise T5440 Server Information about the latest product C120-E508
Product Notes updates and issues
Important Safety Information for Safety information that is common to all C120-E391
Hardware Systems SPARC Enterprise series servers
SPARC Enterprise T5440 Server Safety and compliance information that is C120-E509
Safety and Compliance Guide specific to the server
SPARC Enterprise/ Requirements and concepts of installation C120-H007
PRIMEQUEST Common and facility planning for the setup of
Installation Planning Manual SPARC Enterprise and PRIMEQUEST
SPARC Enterprise T5440 Server Server specifications for site planning C120-H029
Site Planning Guide
SPARC Enterprise T5440 Server Detailed rackmounting, cabling, power on, C120-E510
Installation and Setup Guide and configuring information
SPARC Enterprise T5440 Server How to run diagnostics to troubleshoot the C120-E512
Service Manual server, and how to remove and replace
parts in the server
SPARC Enterprise T5440 Server How to perform administrative tasks that C120-E511
Administration Guide are specific to the server
Integrated Lights Out Manager Information that is common to all C120-E474
2.0 User’s Guide platforms managed by Integrated Lights
Out Manager (ILOM) 2.0
Integrated Lights Out Manager How to use the ILOM 2.0 software on the C120-E513
2.0 Supplement for SPARC server
Enterprise T5440 Server
Preface xv
Title Description Manual Code
Integrated Lights Out Manager Information that describes ILOM 3.0 C120-E573
3.0 Concepts Guide features and functionality
Integrated Lights Out Manager Information and procedures for network C120-E576
3.0 Getting Started Guide connection, logging in to ILOM 3.0 for the
first time, and configuring a user account
or a directory service
Integrated Lights Out Manager Information and procedures for accessing C120-E574
3.0 Web Interface Procedures ILOM 3.0 functions using the ILOM web
Guide interface
Integrated Lights Out Manager Information and procedures for accessing C120-E575
3.0 CLI Procedures Guide ILOM 3.0 functions using the ILOM CLI
Integrated Lights Out Manager Information and procedures for accessing C120-E579
3.0 SNMP and IPMI Procedure ILOM 3.0 functions using SNMP or IPMI
Guide management hosts
Integrated Lights Out Manager Enhancements that have been made to C120-E600
3.x Feature Updates and Release ILOM firmware since the ILOM 3.0 release
Notes
Integrated Lights Out Manager How to use the ILOM 3.0 software on the C120-E587
3.0 Supplement for SPARC server
Enterprise T5440 Server
External I/O Expansion Unit Procedures for installing the External I/O C120-E543
Installation and Service Manual Expansion Unit on the SPARC Enterprise
T5120/T5140/T5220/T5240/T5440 servers
External I/O Expansion Unit Important and late-breaking information C120-E544
Product Notes about the External I/O Expansion Unit
Note – Product Notes are available on the website only. Please check for the recent
update on your product.
Text Conventions
Typeface* Meaning Examples
Preface xvii
Prompt Notations
The following prompt notations are used in this manual.
C shell machine-name%
C shell superuser machine-name#
Bourne shell and Korn shell $
Bourne shell and Korn shell superuser #
ILOM service processor ->
ALOM compatibility shell sc>
OpenBoot PROM firmware ok
Warning – This indicates a hazardous situation that could result in death or serious
personal injury (potential hazard) if the user does not perform the procedure
correctly.
Caution – This indicates that surfaces are hot and might cause personal injury if
touched. Avoid contact.
Tip – This indicates information that could help the user to use the product more
effectively.
Caution – The following tasks regarding this product and the optional products
provided from Fujitsu should only be performed by a certified service engineer.
Users must not perform these tasks. Incorrect operation of these tasks may cause
malfunction.
Notes on Safety
Preface xix
Task Warning
Maintenance Damage
Two people must dismount and carry the chassis.
The server weighs approximately 88 lb (40 kg). Two people are required
to lift and mount the server into a rack enclosure when using the
procedures in this chapter.
Caution – This indicates that hazardous voltages are present. To reduce the risk of
electric shock and danger to personal health, follow the instructions.
Task Warning
Because 3.3v standby power is always present in the system, you must
unplug the power cords before accessing any cold-serviceable
components.
Caution – This indicates that surfaces are hot and might cause personal injury if
touched. Avoid contact.
Task Warning
Maintenance
Caution – The following tasks regarding this product and the optional products
provided from Fujitsu should only be performed by a certified service engineer.
Users must not perform these tasks. Incorrect operation of these tasks may cause
malfunction.
Remodeling/Rebuilding
Preface xxi
Alert Label
The following is a label attached to this product:
■ Never peel off the label.
■ The following label provides information to the users of this product.
(https://download.computers.us.fujitsu.com/)
(http://www.fujitsu.com/global/contact/computing/sparce_index.ht
ml)
Preface xxiii
xxiv SPARC Enterprise T5440 Server Service Manual • July 2009
Identifying Server Components
This section provides an overview of the server, including major boards and
components, as well as front and rear panel features.
Description Links
Related Information
■ “Server Components” on page 173
The SPARC Enterprise T5440 server has the following boards installed in the chassis:
■ Motherboard – The motherboard includes slots for up to four CMP modules and
four memory modules, memory control subsystem, up to eight PCIe expansion
slots, and a service processor slot. The motherboard also contains a top cover
safety interlock (“kill”) switch.
■ CMP module – Each CMP module contains an UltraSPARC T2 Plus chip, slots for
four FB-DIMMs, and associated DC-DC converters.
1
■ Memory module – A memory module containing slots for an additional 12
FB-DIMMs is associated with each CMP module.
■ Service processor – The service processor (ILOM) board controls the server
power and monitors server power and environmental events. The service
processor draws power from the server’s 3.3V standby supply rail, which is
available whenever the system is receiving main input power, even when the
system is turned off.
A removable IDPROM contains MAC addresses, host ID, and ILOM and
OpenBoot PROM configuration data. When replacing the service processor, the
IDPROM can be transferred to a new board to retain system configuration data.
■ Power supply backplane – This board distributes main 12V power from the
power supplies to the rest of the system. The power supply backplane is
connected to the motherboard and the disk drive backplane via a flex cable. High
voltage power is provided to the motherboard via a bus bar assembly.
■ Hard drive backplane – This board includes the connectors for up to four hard
drives. It is connected to the motherboard via a flex cable assembly.
Each drive has its own Power/Activity, Fault, and Ready-to-Remove LEDs.
■ Front control panel – This board connects directly to the motherboard, and serves
as the interconnect for the front I/O board. It contains the front panel LEDs and
the Power button.
■ Front I/O board – This board connects to the front control panel interconnect. It
contains two USB ports.
■ Flex cable assembly – The flex cable assembly serves as the interconnect between
the power supply backplane, motherboard, hard drive backplane, and DVD-ROM
drive.
■ Power supply backplane I2C cable – This cable transmits power supply status to
the motherboard.
Related Information
■ SPARC Enterprise T5440 Server Site Planning Guide.
■ “Managing Faults” on page 9
■ “Servicing Customer-Replaceable Units” on page 71
■ “Servicing Field-Replaceable Units” on page 115
FIGURE: Front Panel Features on page 3 shows front panel features on the SPARC
Enterprise T5440 server. For a detailed description of front panel controls and LEDs,
see “Front Panel LEDs” on page 4.
Figure Legend
Related Information
■ “Front Panel LEDs” on page 4
Locator LED The Locator LED enables you to find a particular system. The LED is activated
and button using one of the following methods:
(white) • The ALOM CMT command setlocator on.
• The ILOM command set /SYS/LOCATE value=Fast_Blink
• Manually press the Locator button to toggle the Locator LED on or off.
This LED provides the following indications:
• Off – Normal operating state.
• Fast blink – System received a signal as a result of one of the methods
previously mentioned, indicating that it is active.
Service If on, indicates that service is required. POST and ILOM are two diagnostics
Required LED tools that can detect a fault or failure resulting in this indication.
(amber) The ILOM show faulty command provides details about any faults that cause
this indicator to light.
Under some fault conditions, individual component fault LEDs are lit in
addition to the system Service Required LED.
Power OK Provides the following indications:
LED • Off – Indicates that the system is not running in its normal state. System
(green) power might be off. The service processor might be running.
• Steady on – Indicates that the system is powered on and is running in its
normal operating state. No service actions are required.
• Fast blink – Indicates the system is running at a minimum level in standby
and is ready to be quickly returned to full function. The service processor is
running.
• Slow blink – Indicates that a normal transitory activity is taking place. Slow
blinking could indicate that the system diagnostics are running, or that the
system is booting.
Power button The recessed Power button toggles the system on or off.
• If the system is powered off, press once to power on.
• If the system is powered on, press once to initiate a graceful system shutdown.
• If the system is powered on, press and hold for 4 seconds to initiate an
emergency shutdown.
For more information about powering on and powering off the system, see the
SPARC Enterprise T5440 Server Administration Guide.
Fan Fault LED TOP Provides the following operational fan indications:
(amber) FAN • Off – Indicates a steady state, no service action is required.
• Steady on – Indicates that a fan failure event has been acknowledged and a
service action is required on at least one of the fan modules.
Power Supply REAR Provides the following operational PSU indications:
Fault LED PS • Off – Indicates a steady state, no service action is required.
(amber) • Steady on – Indicates that a power supply failure event has been
acknowledged and a service action is required on at least one PSU.
Overtemp LED Provides the following operational temperature indications:
(amber) • Off – Indicates a steady state, no service action is required.
• Steady on – Indicates that a temperature failure event has been acknowledged
and a service action is required.
Related Information
■ “Front Panel Diagram” on page 3
■ “Rear Panel LEDs” on page 7
■ “Detecting Faults Using LEDs” on page 30
FIGURE: Rear Panel Features on page 6 shows rear panel features on the SPARC
Enterprise T5440 server. For more detailed information about ports and their uses,
see the SPARC Enterprise T5440 Server Installation and Setup Guide. For a detailed
description of PCIe slots, see “PCIe Device Identifiers” on page 96.
Figure Legend
1 Power supplies
2 Serial port
3 Serial management port
4 System status LEDs
5 USB ports
6 Network management port
7 Gigabit Ethernet ports
Related Information
■ “Front Panel Diagram” on page 3
■ “Rear Panel LEDs” on page 7
■ “Ethernet Port LEDs” on page 8
■ “Detecting Faults Using LEDs” on page 30
Locator LED The Locator LED enables you to find a particular system. The LED is
and button activated using one of the following methods:
(white) • The ALOM CMT command setlocator on.
• The ILOM command set /SYS/LOCATE value=Fast_Blink
• Manually press the Locator button to toggle the Locator LED on or off.
This LED provides the following indications:
• Off – Normal operating state.
• Fast blink – System received a signal as a result of one of the methods
previously mentioned, indicating that it is active.
Service If on, indicates that service is required. POST and ILOM are two diagnostics
Required LED tools that can detect a fault or failure resulting in this indication.
(amber) The ILOM show faulty command provides details about any faults that
cause this indicator to light.
Under some fault conditions, individual component fault LEDs are lit in
addition to the system Service Required LED.
Power OK LED Provides the following indications:
(green) • Off – Indicates that the system is not running in its normal state. System
power might be off. The service processor might be running.
• Steady on – Indicates that the system is powered on and is running in its
normal operating state. No service actions are required.
• Fast blink – Indicates the system is running at a minimum level in standby
and is ready to be quickly returned to full function. The service processor is
running.
• Slow blink – Indicates that a normal transitory activity is taking place. Slow
blinking could indicate the system diagnostics are running, or that the
system is booting.
Related Information
■ “Rear Panel Diagram” on page 5
■ “Ethernet Port LEDs” on page 8
■ “Detecting Faults Using LEDs” on page 30
Related Information
■ “Rear Panel Diagram” on page 5
■ “Rear Panel LEDs” on page 7
■ “Detecting Faults Using LEDs” on page 30
These topics describe the diagnostics tools that are available for monitoring and
troubleshooting the server.
Topic Links
9
■ LEDs – Provide a quick visual notification of the status of the server and of some
of the FRUs.
■ ILOM firmware – This system firmware runs on the service processor. In addition
to providing the interface between the hardware and OS, ILOM also tracks and
reports the health of key server components. ILOM works closely with POST and
Solaris Operating System (Solaris OS) Predictive Self-Healing technology to keep
the system up and running even when there is a faulty component.
■ Power-on self-test (POST) – POST performs diagnostics on system components
upon system reset to ensure the integrity of those components. POST is
configurable and works with ILOM to take faulty components offline if needed.
■ Solaris OS Predictive Self-Healing (PSH) – This technology continuously
monitors the health of the processor and memory, and works with ILOM to take a
faulty component offline if needed. The Predictive Self-Healing technology
enables systems to accurately predict component failures and mitigate many
serious problems before they occur.
■ Log files and console messages – Solaris OS log files and ILOM system event log
can be accessed and displayed on the device of your choice.
■ SunVTS software – The SunVTS software exercises the system, provides
hardware validation, and discloses possible faulty components with
recommendations for repair.
The LEDs, ILOM, Solaris OS PSH, and many of the log files and console messages
are integrated. For example, a fault detected by the Solaris software displays the
fault, logs it, and passes information to ILOM where it is logged. Depending on the
fault, one or more LEDs might be illuminated.
Related Information
■ “Diagnostic Flowchart” on page 11
■ SPARC Enterprise T5440 Server Installation and Setup Guide
■ SPARC Enterprise T5440 Server Administration Guide
Managing Faults 11
TABLE: Diagnostic Flowchart Actions
Action
No. Diagnostic Action Resulting Action For more information
1. Check Power OK The Power OK LED is located on the front and rear “Detecting Faults” on
and AC Present of the chassis. page 30
LEDs on the server. The AC Present LED is located on the rear of the
server on each power supply.
If these LEDs are not on, check the power source
and power connections to the server.
2. Run the ILOM The show faulty command displays the following “Detect Faults Using the
show faulty kinds of faults: ILOM show faulty
command to check • Environmental faults Command” on page 33
for faults. • External I/O Expansion Unit faults
• Solaris Predictive Self-Healing (PSH) detected
faults
• POST-detected faults
Faulty FRUs are identified in fault messages using
the FRU name.
Note - If the ILOM show faulty output includes
an error string such as Ext sensor or Ext FRU, it
indicates a fault in the External I/O Expansion Unit.
3. Check the Solaris The Solaris log files and the ILOM system event log “Detecting Faults Using
log files and ILOM record system events, and provide information Solaris OS Files and
system event log about faults. Commands” on page 35
for fault • Browse the ILOM system event log for major or
information. critical events. Some problems are logged in the
event log but not added to the show faulty list
• If system messages indicate a faulty device,
replace the FRU.
• To obtain more diagnostic information, go to
Action No. 4
4. Run SunVTS SunVTS is an application you can run to exercise “Detecting Faults Using
software. and diagnose FRUs. To run SunVTS, the server must SunVTS Software” on
be running the Solaris OS. page 38
• If SunVTS reports a faulty device, replace the
FRU.
• If SunVTS does not report a faulty device, go to
Action No. 5.
Action
No. Diagnostic Action Resulting Action For more information
5. Run POST. POST performs basic tests of the server components “Detecting Faults Using
and reports faulty FRUs. POST” on page 42
Managing Faults 13
TABLE: Diagnostic Flowchart Actions (Continued)
Action
No. Diagnostic Action Resulting Action For more information
8. Determine if the If the fault displayed included a uuid and “Identifying Faults
fault was detected sunw-msg-id property, the fault was detected by the Detected by PSH” on
by PSH. Solaris Predictive Self-Healing software. page 44
If the fault is a PSH-detected fault, refer to the PSH
Knowledge Article web site for additional “Clear Faults Detected by
information. The Knowledge Article for the fault is PSH” on page 49
located at the following link:
(http://www.sun.com/msg/)message-ID
where message-ID is the value of the sunw-msg-id
property displayed by the show faulty command.
After the FRU is replaced, perform the procedure to
clear PSH-detected faults.
9. Determine if the POST performs basic tests of the server components “POST Fault Management
fault was detected and reports faulty FRUs. When POST detects a Overview” on page 19
by POST. faulty FRU, it logs the fault and if possible, takes the
FRU offline. POST detected FRUs display the “Clear Faults Detected
following text in the fault message: During POST” on page 48
Forced fail reason
In a POST fault message, reason is the name of the
power-on routine that detected the failure.
10. Contact technical The majority of hardware faults are detected by the “Obtain the Chassis Serial
support. server’s diagnostics. In rare cases a problem might Number” on page 62
require additional troubleshooting. If you are unable
to determine the cause of the problem, contact your
service representative for support.
Related Information
■ “Server Diagnostics Overview” on page 9
■ SPARC Enterprise T5440 Server Administration Guide
Note – Multiple service processor accounts can be active concurrently. A user can be
logged in under one account using the ILOM shell, and another account using the
ALOM CMT shell.
Related Information
■ “Diagnostic Flowchart” on page 11
■ SPARC Enterprise T5440 Server Installation and Setup Guide
■ SPARC Enterprise T5440 Server Administration Guide
■ Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server
ILOM Overview
The Integrated Lights Out Manager (ILOM) firmware runs on the service processor
in the server, enabling you to remotely manage and administer your server.
ILOM enables you to remotely run diagnostics such as power-on self-test (POST),
that would otherwise require physical proximity to the server’s serial port. You can
also configure ILOM to send email alerts of hardware failures, hardware warnings,
and other events related to the server or to ILOM.
The service processor runs independently of the server, using the server’s standby
power. Therefore, ILOM firmware and software continue to function when the server
OS goes offline or when the server is powered off.
Note – Refer to the Integrated Lights Out Manager 3.0 Concepts Guide for
comprehensive ILOM information.
Faults detected by ILOM, POST, the Solaris Predictive Self-Healing (PSH) technology,
and the External I/O Expansion Unit (if attached) are forwarded to ILOM for fault
handling (FIGURE: ILOM Fault Management on page 16).
In the event of a system fault, ILOM ensures that the Service Required LED is lit,
FRUID PROMs are updated, the fault is logged, and alerts are displayed. Faulty
FRUs are identified in fault messages using the FRU name.
Managing Faults 15
FIGURE: ILOM Fault Management
The service processor can detect when a fault is no longer present and clears the fault
in several ways:
■ Fault recovery – The system automatically detects that the fault condition is no
longer present. The service processor extinguishes the Service Required LED and
updates the FRU’s PROM, indicating that the fault is no longer present.
■ Fault repair – The fault has been repaired by human intervention. In most cases,
the service processor detects the repair and extinguishes the Service Required
LED. If the service processor does not perform these actions, you must perform
these tasks manually by setting the ILOM component_state or fault_state of the
faulted component.
The service processor can detect the removal of a FRU, in many cases even if the FRU
is removed while the service processor is powered off (for example, if the system
power cables are unplugged during service procedures). This function enables ILOM
to know that a fault, diagnosed to a specific FRU, has been repaired.
The Solaris Predictive Self-Healing technology does not monitor the hard drive for
faults. As a result, the service processor does not recognize hard drive faults, and will
not light the fault LEDs on either the chassis or the hard drive itself. Use the Solaris
message files to view hard drive faults.
Related Information
■ “Diagnostic Flowchart” on page 11
The service processor sends alerts to all ALOM CMT users that are logged in, sends
the alert through email to a configured email address, and writes the event to the
ILOM event log. The ILOM event log is also available using the ALOM CMT
compatibility shell.
See the Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server
for comparisons between the ILOM CLI and the ALOM CMT compatibility CLI, and
for instructions for adding an ALOM-CMT account.
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Detecting Faults Using LEDs” on page 30
■ “ILOM-to-ALOM CMT Command Reference” on page 53
■ SPARC Enterprise T5440 Server Installation and Setup Guide
■ SPARC Enterprise T5440 Server Administration Guide
■ Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server
Managing Faults 17
The Solaris OS uses the Fault Manager daemon, fmd (1M), which starts at boot time
and runs in the background to monitor the system. If a component generates an
error, the daemon handles the error by correlating the error with data from previous
errors and other related information to diagnose the problem. Once diagnosed, the
Fault Manager daemon assigns the problem a Universal Unique Identifier (UUID)
that distinguishes the problem across any set of systems. When possible, the Fault
Manager daemon initiates steps to self-heal the failed component and take the
component offline. The daemon also logs the fault to the syslogd daemon and
provides a fault notification with a message ID (MSGID). You can use the message ID
to get additional information about the problem from the knowledge article database.
The PSH console message provides the following information about each detected
fault:
■ Type
■ Severity
■ Description
■ Automated response
■ Impact
■ Suggested action for system administrator
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Identifying Faults Detected by PSH” on page 44
■ SPARC Enterprise T5440 Server Administration Guide
SunVTS Overview
Sometimes a server exhibits a problem that cannot be isolated definitively to a
particular hardware or software component. In such cases, it might be useful to run a
diagnostic tool that stresses the system by continuously running a comprehensive
battery of tests. SunVTS software is provided for this purpose.
Related Information
■ “Diagnostic Flowchart” on page 11
POST tests critical hardware components to verify functionality before the system
boots and accesses software. If POST detects a faulty component, the component is
disabled automatically, preventing faulty hardware from potentially harming any
software. If the system is capable of running without the disabled component, the
system will boot when POST is complete. For example, if one of the processor cores
is deemed faulty by POST, the core will be disabled. The system will boot and run
using the remaining cores.
You can use POST as an initial diagnostic tool for the system hardware. In this case,
configure POST to run in maximum mode (diag_mode=service, setkeyswitch=
diag, diag_level=max) for thorough test coverage and verbose output.
Managing Faults 19
POST Fault Management Flowchart
FIGURE: Flowchart of Variables for POST Configuration
Related Information
■ “Diagnostic Flowchart” on page 11
The server uses advanced ECC technology that corrects up to 4-bits in error on nibble
boundaries, as long as the bits are all in the same DRAM. On 4 GB FB-DIMMs, if a
DRAM fails, the DIMM continues to function.
If you suspect that the server has a memory problem, follow the flowchart (see
FIGURE: Diagnostic Flowchart on page 11). Run the ILOM show faulty command.
The show faulty command lists memory faults and lists the specific FB-DIMMs
that are associated with the fault.
Managing Faults 21
Note – You can use the FB-DIMM DIAG buttons on the CMP module and memory
module to identify faulty FB-DIMMs. See “FB-DIMM Fault Button Locations” on
page 113.
Once you identify which FB-DIMMs you want to replace, see “Servicing FB-DIMMs”
on page 104 for FB-DIMM removal and replacement instructions. You must perform
the instructions in that section to clear the faults and enable the replaced FB-DIMMs.
Related Information
■ “Controlling How POST Runs” on page 26
■ “Displaying FRU Information with ILOM” on page 24
■ “Detecting Faults” on page 30
■ “Servicing FB-DIMMs” on page 104
Topic Links
Connect an ASCII terminal directly to the SPARC Enterprise T5440 Server Installation
serial management port. and Setup Guide
Use the ssh command to connect to service SPARC Enterprise T5440 Server Installation
processor through an Ethernet connection and Setup Guide
on the network management port.
Switch from the system console to the “Switch From the System Console to the
service processor Service Processor (ILOM or ALOM CMT
Compatibility Shell)” on page 23
Switch from the service processor to the “Switch From ILOM to the System Console”
system console on page 23
“Switch From the ALOM CMT
Compatibility Shell to the System Console”
on page 24
Related Information
■ “Diagnostic Flowchart” on page 11
# #.
->
sc> console
#
Managing Faults 23
Displaying FRU Information with ILOM
CODE EXAMPLE: Output of the show components Command With No Disabled Components
-> show components
Target | Property | Value
--------------------+------------------------+-------------------------------
/SYS/MB/PCIE0 | component_state | Enabled
/SYS/MB/PCIE3/ | component_state | Enabled
/SYS/MB/PCIE1/ | component_state | Enabled
/SYS/MB/PCIE4/ | component_state | Enabled
/SYS/MB/PCIE2/ | component_state | Enabled
/SYS/MB/PCIE5/ | component_state | Enabled
/SYS/MB/NET0 | component_state | Enabled
/SYS/MB/NET1 | component_state | Enabled
/SYS/MB/NET2 | component_state | Enabled
/SYS/MB/NET3 | component_state | Enabled
/SYS/MB/PCIE | component_state | Enabled
CODE EXAMPLE: Output of the show components Command Showing Disabled Components
-> show components
Target | Property | Value
--------------------+------------------------+-------------------------------
/SYS/MB/PCIE0/ | component_state | Enabled
/SYS/MB/PCIE3/ | component_state | Disabled
/SYS/MB/PCIE1/ | component_state | Enabled
/SYS/MB/PCIE4/ | component_state | Enabled
/SYS/MB/CPU0/CMP0/BR1/CH0/D0
Targets:
R0
R1
SEEPROM
SERVICE
PRSNT
T_AMB
Properties:
type = DIMM
component_state = Enabled
fru_name = 1024MB DDR2 SDRAM FB-DIMM 333 (PC2 5300)
fru_description = FBDIMM 1024 Mbyte
fru_manufacturer = Micron Technology
fru_version = FFFFFF
fru_part_number = 18HF12872FD667D6D4
fru_serial_number = d81813ce
fault_state = OK
clear_fault_action = (none)
Managing Faults 25
CODE EXAMPLE: show Command Output (Continued)
Commands:
cd
show
The keyswitch_state parameter, when set to diag, overrides all the other ILOM
POST variables.
TABLE: ILOM Parameters Used for POST Configuration on page 26 lists the ILOM
variables used to configure POST. FIGURE: Flowchart of Variables for POST
Configuration on page 20 shows how the variables work together.
keyswitch_mode normal The system can power on and run POST (based
on the other parameter settings). For details see
FIGURE: Flowchart of Variables for POST
Configuration on page 20. This parameter
overrides all other commands.
diag The system runs POST based on predetermined
settings.
stby The system cannot power on.
locked The system can power on and run POST, but no
flash updates can be made.
diag_mode off POST does not run.
normal Runs POST according to diag_level value.
service Runs POST with preset values for diag_level
and diag_verbosity.
diag_level max If diag_mode = normal, runs all the minimum
tests plus extensive processor and memory tests.
min If diag_mode = normal, runs minimum set of
tests.
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Change POST Parameters” on page 28
■ “Run POST in Maximum Mode” on page 28
■ “Detecting Faults Using POST” on page 42
■ “Clear Faults Detected During POST” on page 48
■ SPARC Enterprise T5440 Server Installation and Setup Guide
■ SPARC Enterprise T5440 Server Administration Guide
Managing Faults 27
2. Use the ILOM commands to change the POST parameters.
Refer to TABLE: System Faults and Fault LED States on page 31 for a list of ILOM
POST parameters and their values.
The set /SYS keyswitch_state command sets the virtual keyswitch
parameter. For example:
To change individual POST parameters, you must first set the keyswitch_state
parameter to normal. For example:
2. Set the virtual keyswitch to diag so that POST will run in service mode.
Note – The server takes about one minute to power off. Use the show /HOST
command to determine when the host has been powered off. The console will display
status=Powered Off
...
2007-12-19 22:01:17.810 0:0:0>INFO: STATUS: Running RGMII 1G
BCM5466R PHY level Loopback Test
2007-12-19 22:01:22.534 0:0:0>End : Neptune 1G Loopback Test -
Port 2
2007-12-19 22:01:22.553 0:0:0>
2007-12-19 22:01:22.542 0:0:0>Begin: Neptune 1G Loopback Test -
Port 3
2007-12-19 22:01:22.556 0:0:0>INFO: STATUS: Running BMAC level
Loopback Test
2007-12-19 22:01:32.004 0:0:0>End : Neptune 1G Loopback Test -
Port 3
2007-12-19 22:01:27.271 0:0:0>
T5440, No Keyboard
Enter #. to return to ALOM.
2007-12-19 22:01:32.012 0:0:0>INFO:
2007-12-19 22:01:27.274 0:0:0>INFO: STATUS: Running RGMII 1G
BCM5466R PHY level Loopback Test
OpenBoot ..., 7968 MB memory available, Serial #75916434.
2007-12-19 22:01:32.019 0:0:0>POST Passed all devices.
[stacie obp #0]
2007-12-19 22:01:32.028 0:0:0>POST:Return to VBSC.
Ethernet address 0:14:4f:86:64:92, Host ID: xxxxx
2007-12-19 22:01:32.036 0:0:0>Master set ACK for vbsc runpost
command and spin...
{0} ok
Managing Faults 29
Detecting Faults
This section describes the different methods you can use to identify system faults in
the SPARC Enterprise T5440.
Task Topic
Use front panel and back panel LEDs to “Detecting Faults Using LEDs” on page 30
identify system faults.
Use the ILOM show faulty command to “Detecting Faults Using ILOM show faulty
detect faults. Command” on page 32
Use Solaris OS files and commands to “Detecting Faults Using Solaris OS Files and
detect faults. Commands” on page 35
Use the ILOM event log to detect faults. “Detecting Faults Using the ILOM Event Log”
on page 37
Use POST to identify faults. “Detecting Faults Using POST” on page 42
Use Solaris Predictive Self-Healing (PSH) “Identifying Faults Detected by PSH” on
to identify faults. page 44
These LEDs provide a quick visual check of the state of the system.
Power supply • Service Required LED (front and rear • “Front Panel LEDs” on page 4
panel) • “Rear Panel LEDs” on page 7
• Front panel Power Supply Fault LED • “Power Supply LED” on page 91
• Individual power supply Fault LED • “Servicing Power Supplies” on page 85
Fan tray • Service Required LED (front and rear • “Front Panel LEDs” on page 4
panel) • “Rear Panel LEDs” on page 7
• Front panel Fan Fault LED • “Fan Tray Fault LED” on page 84
• Individual fan tray Fault LED • “Servicing Fan Trays” on page 81
• Overtemp LED (if overtemp condition
exists)
Hard drive • Service Required LED (front and rear See these sections:
panel) • “Front Panel LEDs” on page 4
• Individual hard drive Fault LED • “Rear Panel LEDs” on page 7
• “Hard Drive LEDs” on page 80
• “Servicing Hard Drives” on page 72
Managing Faults 31
TABLE: System Faults and Fault LED States (Continued)
CMP module • Service Required LED (front and rear A lit CMP module or memory module fault LED
or memory panel) might indicate a problem with an FB-DIMM
module • CMP Module Fault LED or Memory installed on the CMP module, or a problem with
Module Fault LED the CMP module itself.
See these sections:
• “Front Panel LEDs” on page 4
• “Rear Panel LEDs” on page 7
• “Servicing CMP/Memory Modules” on page 98
• “Servicing FB-DIMMs” on page 104
FB-DIMM • Service Required LED (front and rear See these sections:
panel) • “Front Panel LEDs” on page 4
• CMP Module Fault LED or Memory • “Rear Panel LEDs” on page 7
Module Fault LED • “Servicing FB-DIMMs” on page 104
• FB-DIMM Fault LED (CMP and • “FB-DIMM Fault Button Locations” on page 113
memory modules) (when FB-DIMM
Locate button is pressed)
Other • Service Required LED (front and rear Not all components have an individual component
components panel) Fault LED. If the Service Required LED is lit, use
the show faulty command to obtain additional
information about the component affected. See
these sections:
• “Front Panel LEDs” on page 4
• “Rear Panel LEDs” on page 7
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Detecting Faults Using LEDs” on page 30
■ “ILOM-to-ALOM CMT Command Reference” on page 53
■ SPARC Enterprise T5440 Server Installation and Setup Guide
■ SPARC Enterprise T5440 Server Administration Guide
■ Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Detecting Faults Using LEDs” on page 30
■ “ILOM-to-ALOM CMT Command Reference” on page 53
■ SPARC Enterprise T5440 Server Installation and Setup Guide
■ SPARC Enterprise T5440 Server Administration Guide
■ Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server
-----------------------------------------------------------------------------
Managing Faults 33
■ Example of the show faulty command displaying an environmental fault:
■ Example showing a fault that was detected by the PSH technology. These kinds
of faults are distinguished from other kinds of faults by the presence of a
sunw-msg-id and by a UUID.
■ Example showing a fault in the External I/O Expansion Unit. These faults can
be identified by the text string Ext FRU or Ext sensor at the beginning of the
fault description.
The text string Ext FRU indicates that the specified FRU is faulty and should
be replaced. The text string Ext sensor indicates that the specified FRU
contains the sensor that detected the problem. In this case, the specified FRU
may not be faulty. Contact service support to isolate the problem.
If POST, ILOM, or the Solaris PSH features do not indicate the source of a fault, check
the message buffer and log files for notifications for faults. Hard drive faults are
usually captured by the Solaris message files.
Use the dmesg command to view the most recent system message. To view the
system messages log file, view the contents of the /var/adm/messages file.
Managing Faults 35
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Detecting Faults Using LEDs” on page 30
■ “ILOM-to-ALOM CMT Command Reference” on page 53
■ SPARC Enterprise T5440 Server Installation and Setup Guide
■ SPARC Enterprise T5440 Server Administration Guide
■ Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server
# dmesg
The dmesg command displays the most recent messages generated by the system.
The /var/adm directory contains several message files. The most recent messages
are in the /var/adm/messages file. After a period of time (usually every week), a
new messages file is automatically created. The original contents of the messages
file are rotated to a file named messages.1. Over a period of time, the messages are
further rotated to messages.2 and messages.3, and then deleted.
1. Log in as superuser.
# more /var/adm/messages
3. If you want to view all logged messages, type the following command:
# more /var/adm/messages*
Related Information
■ “Diagnostic Flowchart” on page 11
■ “View ILOM Event Log” on page 37
■ “ILOM-to-ALOM CMT Command Reference” on page 53
■ SPARC Enterprise T5440 Server Installation and Setup Guide
■ SPARC Enterprise T5440 Server Administration Guide
■ Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server
Note – The ILOM event log can also be viewed through the ILOM BUI or the ALOM
CMT CLI.
If a “major” or “critical” event is found that was not expected and not included
under ILOM show faulty than it may indicate a system fault. The following is an
example of unexpected major events in the log.
Managing Faults 37
1623 Fri Feb 15 18:57:27 2008 Chassis Log major
Feb 15 18:57:27 ERROR: Degraded configuration: system operating at
reduced capacity
The Solaris OS must be running in order to use the SunVTS software. You also must
ensure that the SunVTS validation test software is installed on your system.
This section describes the tasks necessary to use SunVTS software to exercise your
server.
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Verify Installation of SunVTS Software” on page 38
■ “Start the SunVTS Browser Environment” on page 39
■ “SunVTS Software Packages” on page 41
■ “Useful SunVTS Tests” on page 42
■ SPARC Enterprise T5440 Server Administration Guide
■ SunVTS 7.0 User’s Guide
Note – The SunVTS 7.0 software, and future compatible versions, are supported on
the server.
1. Check for the presence of SunVTS packages using the pkginfo command.
2. If the SunVTS software is not installed, you can obtain the installation
packages from the following places:
■ Solaris Operating System DVDs
■ Download from the web. Refer to the Preface for information on how to access
the web site.
Note – SunVTS software can be run in several modes. You must perform this
procedure using the default mode.
# cd /usr/sunvts/bin
# ./startsunvts
Managing Faults 39
3. Start the SunVTS browser environment from a web browser on the client
system. Type https://server-name:6789.
The SunVTS browser environment is displayed (FIGURE: SunVTS Browser
Environment (Test Group Screen) on page 40).
Note – TABLE: Useful SunVTS Tests on page 42 lists test categories that are
especially useful to run on this server.
Tip – Use the System Excerciser – High Stress Mode to test system operations. Use
the Component Stress – High setting for the highest stress possible.
Package Description
SUNWvts Test development library APIs and SunVTS kernel. You must install
this package to run the SunVTS software.
SUNWvtsmn Man pages for the SunVTS utilities, including the command-line
utility.
SUNWvtsr SunVTS framework (root)
SUNWvtss SunVTS browser user interface (BUI) components required on the
server system.
SUNWvtsts SunVTS test binaries
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Useful SunVTS Tests” on page 42
■ SPARC Enterprise T5440 Server Installation and Setup Guide
■ SPARC Enterprise T5440 Server Administration Guide
Managing Faults 41
Useful SunVTS Tests
TABLE: Useful SunVTS Tests on page 42 describes the SunVTS tests which are useful
for diagnosing issues with the SPARC Enterprise T5440 server.
Related Information
■ “Diagnostic Flowchart” on page 11
■ “SunVTS Software Packages” on page 41
■ SPARC Enterprise T5440 Server Installation and Setup Guide
■ SPARC Enterprise T5440 Server Administration Guide
In CODE EXAMPLE: POST Error Message on page 43, POST reports a memory error
at FB-DIMM location /SYS/MB/CPU0/CMP0/BR1/CH0/D0. The error was detected
by POST running on core 7, strand 2.
Note – You can use ASR commands to display and control disabled components. See
“Disabling Faulty Components” on page 50.
Related Information
■ “Diagnostic Flowchart” on page 11
Managing Faults 43
■ “POST Fault Management Overview” on page 19
■ “POST Fault Management Flowchart” on page 20
■ SPARC Enterprise T5440 Server Administration Guide
Faults detected by the Solaris PSH facility are also reported through service processor
alerts.
Note – You can configure ILOM to generate SNMP traps or e-mail alerts when a
fault is detected by Solaris PSH. You can also configure the ALOM CMT
compatibility shell to display Solaris PSH alerts. See the Integrated Lights Out Manager
3.0 Concepts Guide.
CODE EXAMPLE: ALOM CMT Alert of PSH Diagnosed Fault on page 44 depicts an
ALOM CMT alert of the same fault reported by Solaris PSH in CODE EXAMPLE:
Console Message Showing Fault Detected by PSH on page 44.
The ILOM show faulty command provides summary information about the fault.
See “Detect Faults Using the ILOM show faulty Command” on page 33 for more
information about the show faulty command.
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Solaris Predictive Self-Healing Overview” on page 17
■ “ILOM-to-ALOM CMT Command Reference” on page 53
■ SPARC Enterprise T5440 Server Administration Guide
■ Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server
Note – Do not use fmdump to verify that a FRU replacement has cleared a fault,
because the output of fmdump is the same after the FRU has been replaced. Use the
fmadm faulty command to verify that the fault has cleared. See “Clear Faults
Detected by PSH” on page 49.
1. Check the event log using the fmdump command with -v for verbose output.
In CODE EXAMPLE: Output from the fmdump -v Command on page 46, a fault
is displayed, indicating the following details.
■ Date and time of the fault (Jul 31 12:47:42.2007)
■ Universal Unique Identifier (UUID). The UUID is unique for every fault
(fd940ac2-d21e-c94a-f258-f8a9bb69d05b)
■ Message identifier, which can be used to obtain additional fault information
(SUN4V-8000-JA)
■ Faulted FRU. The information provided in the example includes the part
number of the FRU (part=541215101) and the serial number of the FRU
(serial=101083). The Location field provides the name of the FRU. In
CODE EXAMPLE: Output from the fmdump -v Command on page 46 the FRU
name is MB, meaning the motherboard.
Managing Faults 45
Note – fmdump displays the PSH event log. Entries remain in the log after the fault
has been repaired.
2. Use the message ID to obtain more information about this type of fault.
b. Obtain the message ID from the console output or the ILOM show faulty
command.
Type
Fault
Severity
Major
Description
The number of errors associated with this CPU has exceeded
acceptable levels.
Automated Response
The fault manager will attempt to remove the affected CPU from
service.
Impact
System performance may be affected.
Details
The Message ID: SUN4V-8000-JA indicates diagnosis has
determined that a CPU is faulty. The Solaris fault manager arranged
an automated attempt to disable this CPU....
Clearing Faults
This section describes how to clear faults.
Task Topic
Clear faults detected during POST. “Clear Faults Detected During POST” on
page 48
Clear faults detected by PSH. “Clear Faults Detected by PSH” on page 49
Clear faults detected in the Internal I/O “Clear Faults Detected in the External I/O
Expansion Unit Expansion Unit” on page 50
Related Information
■ “Diagnostic Flowchart” on page 11
■ “POST Fault Management Overview” on page 19
■ “Solaris Predictive Self-Healing Overview” on page 17
■ SPARC Enterprise T5440 Server Administration Guide
■ Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server
■ External I/O Expansion Unit Installation and Service Manual for SPARC Enterprise
T5120/T5140/T5220/T5240/T5440 Servers
Managing Faults 47
▼ Clear Faults Detected During POST
In most cases, when POST detects a faulty component, POST logs the fault and
automatically takes the failed component out of operation by placing the component
in the ASR blacklist. See “Disabling Faulty Components” on page 50.
In most cases, the replacement of the faulty FRU is detected when the service
processor is reset or power cycled. In this case, the fault is automatically cleared from
the system. This procedure describes how to identify a POST-detected fault and, if
necessary, manually clear the fault.
1. After replacing a faulty FRU, at the ILOM prompt use the show faulty
command to identify POST-detected faults.
Faults detected by POST are distinguished from other kinds of faults by the text:
Forced fail. No UUID number is reported. Refer to CODE EXAMPLE: Fault
Detected by POST on page 48.
If no fault is reported, you do not need to do anything else. Do not perform the
subsequent steps.
2. Use the component_state property of the component to clear the fault and
remove the component from the ASR blacklist.
Use the FRU name that was reported in the fault in Step 1:
The fault is cleared and should not show up when you run the show faulty
command. Additionally, the Service Required LED is no longer on.
4. At the ILOM prompt, use the show faulty command to verify that no faults
are reported.
->
2. At the ILOM prompt, use the show faulty command to identify PSH-detected
faults.
■ If no fault is reported, you do not need to do anything else. Do not perform the
subsequent steps.
■ If a fault is reported, perform Step 3 and Step 4.
3. Use the clear_fault_action property of the FRU to clear the fault from the
service processor. For example:
Managing Faults 49
▼ Clear Faults Detected in the External I/O
Expansion Unit
For service processor detected faults in the External I/O Expansion Unit, the fault
must be manually cleared from ILOM show faulty after the problem has been
repaired.
Note – After the problem has been repaired, the fault will also be cleared from the
ILOM show faulty command by resetting the service processor.
The example below shows a problem detected in the External I/O Expansion Unit:
The database that contains the list of disabled components is referred to as the ASR
blacklist (asr-db).
In most cases, POST automatically disables a faulty component. After the cause of
the fault is repaired (FRU replacement, loose connector reseated, and so on), you
might need to remove the component from the ASR blacklist.
Note – For instructions on enabling or disabling ASR, see the SPARC Enterprise
T5440 Server Administration Guide.
The ASR commands (TABLE: ASR Commands on page 51) enable you to view and
manually add or remove components (asrkeys) from the ASR blacklist. You run
these commands from the ILOM -> prompt.
Command Description
Note – The asrkeys vary from system to system, depending on how many cores and
memory are present. Use the show components command to see the asrkeys on a
given system.
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Detecting Faults” on page 30
■ SPARC Enterprise T5440 Server Administration Guide
Managing Faults 51
▼ Disable System Components
The component_state property disables a component by adding it to the ASR
blacklist.
Note – In the ILOM shell there is no notification when the system is actually
powered off. Powering off takes about a minute. Use the show /HOST command to
determine if the host has powered off.
Managing Faults 53
TABLE: Service-Related Commands (Continued)
show /SP/console/history consolehistory [-b lines|-e lines|-v] Displays the contents of the
[-g lines] [boot|run] system’s console buffer.
The following options enable you to
specify how the output is
displayed:
• -g lines specifies the number of
lines to display before pausing.
• -e lines displays n lines from the
end of the buffer.
• -b lines displays n lines from the
beginning of the buffer.
• -v displays the entire buffer.
• boot|run specifies the log to
display (run is the default log).
set bootmode value Enables control of the firmware
/HOST/bootmode/value[normal|re [normal|reset_nvram| during system initialization with
set_nvram|bootscript=string] bootscript=string] the following options:
• normal is the default boot mode.
• reset_nvram resets OpenBoot
PROM parameters to their
default values.
• bootscript=string enables the
passing of a string to the boot
command.
stop/SYS; start/SYS powercycle [-f] Performs a poweroff followed by
The -f option forces an immediate poweron.
poweroff. Otherwise the command
attempts a graceful shutdown.
stop/SYS poweroff [-y] [-f] Powers off the host server.
• -y enables you to skip the
confirmation question.
• -f forces an immediate
shutdown.
Managing Faults 55
TABLE: Service-Related Commands (Continued)
(No ILOM equivalent.) showfru [-g lines] [-s | -d] [FRU] Displays information about the
• -g lines specifies the number of FRUs in the server.
lines to display before pausing
the output to the screen.
• -s displays static information
about system FRUs (defaults to
all FRUs, unless one is specified).
• -d displays dynamic information
about system FRUs (defaults to
all FRUs, unless one is specified).
See “Display Individual
Component Information With the
ILOM show Command” on
page 25.
TABLE: ALOM CMT Parameters and POST Modes on page 56 shows typical
combinations of ALOM CMT variables and associated POST modes.
Related Information
■ “Diagnostic Flowchart” on page 11
■ “Detecting Faults Using LEDs” on page 30
■ “ILOM-to-ALOM CMT Command Reference” on page 53
■ SPARC Enterprise T5440 Server Administration Guide
■ Integrated Lights Out Manager 3.0 Supplement for SPARC Enterprise T5440 Server
Managing Faults 57
58 SPARC Enterprise T5440 Server Service Manual • July 2009
Preparing to Service the System
These topics describe how to prepare the SPARC Enterprise T5440 for servicing.
Topic Links
Related Information
■ “Managing Faults” on page 9
■ “Servicing Customer-Replaceable Units” on page 71
■ “Servicing Field-Replaceable Units” on page 115
■ “Returning the Server to Operation” on page 149
Safety Information
This section describes important safety information that you need to know prior to
removing or installing parts in the SPARC Enterprise T5440 server.
59
For your protection, observe the following safety precautions when setting up your
equipment:
■ Follow all cautions and instructions marked on the equipment and described in
the documentation shipped with your system.
■ Follow all cautions and instructions marked on the equipment and described in
the SPARC Enterprise T5440 Server Safety and Compliance Guide.
■ Ensure that the voltage and frequency of your power source match the voltage
and frequency inscribed on the equipment’s electrical rating label.
■ Follow the electrostatic discharge safety practices as described in this section.
Related Information
■ “Safety Symbols” on page 60
■ “Antistatic Wrist Strap” on page 61
■ “Antistatic Mat” on page 61
■ “Perform Electrostatic Discharge – Antistatic Prevention Measures” on page 69
Safety Symbols
Note the meanings of the following symbols that might appear in this document:
Caution – Hot surface. Avoid contact. Surfaces are hot and might cause personal
injury if touched.
Caution – Hazardous voltages are present. To reduce the risk of electric shock and
danger to personal health, follow the instructions.
Related Information
■ “Safety Information” on page 59
Caution – Circuit boards and hard drives contain electronic components that are
extremely sensitive to static electricity. Ordinary amounts of static electricity from
clothing or the work environment can destroy the components located on these
boards. Do not touch the components along their connector edges.
Caution – You must disconnect both power supplies before servicing any of the
components documented in this chapter.
Related Information
■ “Safety Information” on page 59
■ “Antistatic Wrist Strap” on page 61
■ “Antistatic Mat” on page 61
Note – An antistatic wrist strap is no longer included in the accessory kit for the
SPARC Enterprise T5440 servers. However, antistatic wrist straps are still included
with options.
Antistatic Mat
Place ESD-sensitive components such as motherboards, memory, and other PCBs on
an antistatic mat.
● The chassis serial number is located on a sticker that is on the front of the
server and another sticker on the side of the server.
/SYS
Targets:
SERVICE
LOCATE
ACT
PS_FAULT
TEMP_FAULT
Commands:
cd
reset
set
show
start
stop
Related Information
■ “Power Off From the Command Line” on page 64
■ “Power Off – Graceful Shutdown” on page 64
■ “Power Off – Emergency Shutdown” on page 65
2. Switch from the system console prompt to the service processor console prompt.
Type:
ok #.
->
->
Note – To perform an immediate shutdown, use the stop -force -script /SYS
command. Ensure that all data is saved before entering this command.
Caution – All applications and files will be closed abruptly without saving changes.
File system corruption might occur.
Caution – Because 3.3v standby power is always present in the system, you must
unplug the power cords before accessing any cold-serviceable components.
Related Information
■ “Front Panel Diagram” on page 3
■ “Rear Panel Diagram” on page 5
■ “Extend the Server to the Maintenance Position” on page 66
Once you have located the server, press the Locator LED and button to turn it off.
2. Verify that no cables will be damaged or will interfere when the server is
extended.
Although the cable management arm (CMA) that is supplied with the server is
hinged to accommodate extending the server, you should ensure that all cables
and cords are capable of extending.
3. From the front of the server, release the two slide release latches (FIGURE:
Extending the Server Into the Maintenance Position on page 66).
Squeeze the slide rail locks to release the slide rails.
Figure Legend
Related Information
■ “Front Panel Diagram” on page 3
■ “Rear Panel Diagram” on page 5
■ “Extend the Server to the Maintenance Position” on page 66
■ “Remove the Server From the Rack” on page 67
Figure Legend
4. From the front of the server, press inner rail release buttons and pull the server
forward until it is free of the rack rails.
Related Information
■ “Electrostatic Discharge Safety Measures” on page 61
■ “Perform Electrostatic Discharge – Antistatic Prevention Measures” on page 69
1. Loosen the two captive No. 2 Phillips screws at the rear edge of the top panel.
2. Slide the top cover to the rear about 0.5 inch (12.7 mm).
Caution – If the top cover is removed before the server is powered off, the server
will immediately disable the front panel Power button and shut down. After such an
event, you must replace the top cover and use the poweron command to power on
the server. See “Power On the Server” on page 153.
Topic Links
Related Information
■ “Servicing Field-Replaceable Units” on page 115
71
Hot-Pluggable and Hot-Swappable
Devices
Hot-pluggable devices are those devices that you can remove and install while the
server is running. However, you must perform administrative tasks before or after
installing the hardware (for example, mounting a hard drive). In the SPARC
Enterprise T5440 server, the following devices are hot-pluggable:
■ Hard drives
Hot-swappable devices are those devices that can be removed and installed while
the server is running without affecting the rest of the server’s capabilities. In the
SPARC Enterprise T5440 server, the following devices are hot-swappable:
■ Fan trays
■ Power supplies
Related Information
■ “Servicing Hard Drives” on page 72
■ “Servicing Fan Trays” on page 81
■ “Servicing Power Supplies” on page 85
■ “Server Components” on page 173
If your drive falls into one of these conditions, you must power off the server before
you replace the hard drive.
Related Information
■ “Identifying Server Components” on page 1
■ “Managing Faults” on page 9
■ “Powering Off the System” on page 63
■ “Hot-Pluggable and Hot-Swappable Devices” on page 72
■ “Hard Drive Device Identifiers” on page 79
■ “Hard Drive LEDs” on page 80
■ “Server Components” on page 173
Note – See “Hard Drive Device Identifiers” on page 79 for information about
identifying hard drives.
# cfgadm -al
This command should identify the Ap_id for the hard drive you wish to remove,
as in CODE EXAMPLE: Sample Ap_id Output on page 75.
4. On the drive you plan to remove, push the hard drive release button to open the
latch (FIGURE: Removing a Hard Drive on page 74).
Caution – The latch is not an ejector. Do not bend the latch too far. Doing so can
damage the latch.
Note – The server might have up to three blank panels covering unoccupied drive
slots.
3. Slide the drive into the drive slot until it is fully seated.
5. At the Solaris prompt, type the cfgadm -al command to list all drives in the
device tree, including any drives that are not configured. Type:
# cfgadm -al
This command should help you identify the Ap_id for the hard drive you
installed. For an output example refer to CODE EXAMPLE: Sample Ap_id Output
on page 77.
7. Wait until the blue Ready-to-Remove LED is no longer lit on the drive that you
installed.
8. At the Solaris prompt, type the cfgadm -al command to list all drives in the
device tree, including any drives that are not configured. Type:
# cfgadm -al
This command should identify the Ap_id for the hard drive that you installed.
The drive you installed should be is configured.
# iostat -E
Do the following :
2. Slide the drive into the drive slot until it is fully seated.
4. If you performed any additional service procedures, see “Power On the Server”
on page 153.
TABLE: Physical Drive Locations, FRU Names, and Default Drive Path Names
Note – Hard drive names in ILOM messages are displayed with the full FRU name,
such as /SYS/HDD0.
The front and rear panel Service Required LEDs are also lit if the system detects a
hard drive fault.
Related Information
■ “Hard Drive Device Identifiers” on page 79
Caution – Hazardous moving parts. Unless the power to the server is completely
shut down, the only service permitted in the fan compartment is the replacement of
the fan trays by trained personnel.
Related Information
■ “Identifying Server Components” on page 1
■ “Managing Faults” on page 9
■ “Powering Off the System” on page 63
■ “Hot-Pluggable and Hot-Swappable Devices” on page 72
■ “Fan Tray Device Identifiers” on page 84
■ “Fan Tray Fault LED” on page 84
■ “Server Components” on page 173
Do the following :
2. Press the fan tray latches toward the center of the fan tray and pull the fan tray
up and out of the system.
Next Steps
If you are replacing a faulty fan tray due to an overtemperature condition, monitor
the system to ensure proper cooling.
■ “Slide the Server Into the Rack” on page 151
■ If you performed any additional service procedures, see “Power On the Server” on
page 153.
Do the following :
● Press the fan tray latches toward the center of the fan tray and pull the fan tray
up and out of the system.
Next Steps
If you are replacing the fan trays after performing another service procedure,
complete these steps.
■ “Slide the Server Into the Rack” on page 151
■ “Power On the Server” on page 153
FT0 /SYS/MB/FT0
FT1 /SYS/MB/FT1
FT2 /SYS/MB/FT2
FT3 /SYS/MB/FT3
Related Information
■ “Managing Faults” on page 9
■ “Hot-Pluggable and Hot-Swappable Devices” on page 72
■ “Fan Tray Fault LED” on page 84
Fault Amber This LED is lit when the fan tray is faulty.
The front panel Fan Fault LED, and the front and rear panel Service Required LEDs
are also lit if the system detects a fan tray fault. In addition, the system Overtemp
LED might be lit if a fan fault causes an increase in system operating temperature.
See “Front Panel LEDs” on page 4 and “Rear Panel LEDs” on page 7 for more
information about system status LEDs.
Related Information
■ “Managing Faults” on page 9
■ “Hot-Pluggable and Hot-Swappable Devices” on page 72
■ “Fan Tray Fault LED” on page 84
Note – If a power supply fails and you do not have a replacement available, leave
the failed power supply installed to ensure proper airflow in the server.
Related Information
■ “Identifying Server Components” on page 1
■ “Managing Faults” on page 9
■ “Hot-Pluggable and Hot-Swappable Devices” on page 72
■ “Power Supply Device Identifiers” on page 91
Caution – Hazardous voltages are present. To reduce the risk of electric shock and
danger to personal health, follow the instructions.
Note – If you are servicing Power Supply 0, you must disconnect the cable
management arm support strut.
2. Gain access to the rear of the server where the faulty power supply is located.
If necessary, slide the system partially out of the rack to obtain better access to the
rear panel.
4. Grasp the power supply handle and press the release latch.
2. Slide the power supply into the bay until it is fully seated.
4. Verify that the system Power Supply Fault LED, and the front and rear Service
Required LEDs are not lit.
Note – See “Front Panel LEDs” on page 4 and “Rear Panel LEDs” on page 7 for more
information about identifying and interpreting system LEDs.
5. At the ILOM -> prompt, use the show faulty command to verify the status of
the power supplies.
Caution – Hazardous voltages are present. To reduce the risk of electric shock and
danger to personal health, follow the instructions.
If you are removing the power supplies as a prerequisite for another service
procedure, follow these steps.
Note – If you are servicing Power Supply 0, you must disconnect the cable
management arm support strut.
1. Grasp the power supply handle and press the release latch.
1. Align the replacement power supply with the empty power supply bay.
2. Slide the power supply into the bay until it is fully seated.
Next Steps
■ “Connect the Power Cords to the Server” on page 153
■ “Power On the Server” on page 153
PS0 /SYS/PS0
PS1 /SYS/PS1
PS2 /SYS/PS2
PS3 /SYS/PS3
Note – Power supply names in ILOM messages are displayed with the full FRU
name, such as /SYS/PS0.
Related Information
■ “Managing Faults” on page 9
■ “Hot-Pluggable and Hot-Swappable Devices” on page 72
■ “Power Supply LED” on page 91
See TABLE: Power Supply Status LEDs on page 91 for a description of power supply
LED modes and their function, listed from top to bottom.
The following LEDs are lit when a power supply fault is detected:
■ Front and rear Service Required LEDs
■ Rear PS Failure LED on the bezel of the server
■ Fault LED mode on the faulty power supply
The front and rear panel Service Required LEDs are also lit if the system detects a
power supply fault.
See “Front Panel LEDs” on page 4 and “Rear Panel LEDs” on page 7 for more
information about identifying and interpreting system LEDs.
See “Power Supply LED” on page 91 for specific information about power supply
status LEDs.
Related Information
■ “Managing Faults” on page 9
■ “Hot-Pluggable and Hot-Swappable Devices” on page 72
■ “Front Panel LEDs” on page 4
■ “Rear Panel LEDs” on page 7
Related Information
■ “PCIe Device Identifiers” on page 96
Do the following :
5. If you are not replacing the PCIe card, install a PCIe filler panel in its place.
Next Steps
■ “Install the Top Cover” on page 150
■ “Slide the Server Into the Rack” on page 151
■ “Power On the Server” on page 153
Next Steps
■ “Install the Top Cover” on page 150
■ “Slide the Server Into the Rack” on page 151
■ “Power On the Server” on page 153
Note – PCIe names in ILOM messages are displayed with the full FRU name, such
as /SYS/MB/PCIE0.
Note – In the Solaris OS, PCIe slot addresses are associated with CMP modules. The
PCIe slot address in the Solaris OS might change if you add or remove CMP
modules, or if a CMP module is brought offline. For more information, see the
SPARC Enterprise T5440 Server Product Notes.
Related Information
■ “Managing Faults” on page 9
■ “PCIe Slot Configuration Guidelines” on page 97
■ “System Bus Topology” on page 162
■ “Performing Node Reconfiguration” on page 155
Number of CMP/Memory
PCIe/XAUI Card Type Modules Installation Order Notes
External I/O Expansion Unit PCIe Link cards must be placed in a PCIe slot with a
CMP/memory module pair present as follows:
■ PCIe Slots 0 and 1 require CMP/Memory pair 0.
■ PCIe Slots 4 and 5 require CMP/Memory pair 1.
■ PCIe Slots 2 and 3 require CMP/Memory pair 2.
■ PCIe Slots 6 and 7 require CMP/Memory pair 3.
Related Information
■ “PCIe Device Identifiers” on page 96
■ “System Bus Topology” on page 162
■ “I/O Fabric in 2P Configuration” on page 164
■ “I/O Fabric in 4P Configuration” on page 165
Do the following :
Note – If you are replacing a faulty CMP or memory module, you must transfer the
FB-DIMMs on the faulty module to the replacement module. Replacement
CMP/memory modules do not include FB-DIMMs.
3. Rotate the ejector levers down to secure the module into place.
Next Steps
■ “Install the Top Cover” on page 150
■ “Slide the Server Into the Rack” on page 151
■ “Power On the Server” on page 153
Do the following :
3. If you are installing the module into a previously empty slot, remove the plastic
connector cover on the motherboard.
5. Rotate the ejector levers down to secure the module into place.
Next Steps
■ “Install the Top Cover” on page 150
■ “Slide the Server Into the Rack” on page 151
■ “Power On the Server” on page 153
CMP0 /SYS/MB/CPU0/CMP0
MEM0 /SYS/MB/MEM0/CMP0
CMP1 /SYS/MB/CPU1/CMP1
MEM1 /SYS/MB/MEM1/CMP1
CMP2 /SYS/MB/CPU2/CMP2
MEM2 /SYS/MB/MEM2/CMP2
CMP3 /SYS/MB/CPU3/CMP3
MEM3 /SYS/MB/MEM3/CMP3
Note – CMP and memory module names in ILOM messages are displayed with the
full FRU name, such as /SYS/MB/CPU0.
Related Information
■ “Managing Faults” on page 9
■ “Supported FB-DIMM Configurations” on page 110
■ “Performing Node Reconfiguration” on page 155
Related Information
■ “CMP and Memory Module Device Identifiers” on page 103
■ “Performing Node Reconfiguration” on page 155
Servicing FB-DIMMs
Up to 16 FB-DIMMs can be installed in each CMP/memory module pair.
Related Information
■ “Managing Faults” on page 9
■ “Remove FB-DIMMs” on page 105
■ “Install FB-DIMMs” on page 105
■ “Verify FB-DIMM Replacement” on page 106
■ “Add FB-DIMMs” on page 109
■ “Supported FB-DIMM Configurations” on page 110
■ “FB-DIMM Device Identifiers” on page 112
■ “FB-DIMM Fault Button Locations” on page 113
■ “Servicing CMP/Memory Modules” on page 98
■ “Performing Node Reconfiguration” on page 155
Do the following:
1. If you are removing a faulty FB-DIMM, determine which FB-DIMM you want
to remove.
2. Push down on the ejector tabs on each side of the FB-DIMM until the
FB-DIMM is released.
3. Grasp the top corners of the faulty FB-DIMM and remove it from the
CMP/memory module.
▼ Install FB-DIMMs
1. Unpackage the replacement FB-DIMMs and place them on an antistatic mat.
Tip – See “Supported FB-DIMM Configurations” on page 110 for information about
configuring the FB-DIMMs.
4. Push the FB-DIMM into the connector until the ejector tabs lock the FB-DIMM
in place.
If the FB-DIMM does not easily seat into the connector, verify that the orientation
of the FB-DIMM is correct. If the orientation is reversed, damage to the FB-DIMM
might occur.
5. Repeat Step 2 through Step 4 until all replacement FB-DIMMs are installed.
Next Steps
■ “Install a CMP/Memory Module” on page 100
■ “Install the Top Cover” on page 150
■ “Slide the Server Into the Rack” on page 151
■ “Power On the Server” on page 153
2. Run the show faulty command to determine how to clear the fault.
The method you use to clear a fault depends on how the fault is identified by the
showfaults command.
Examples:
■ If the fault is a host-detected fault (displays a UUID), continue to Step 3. For
example:
If the fault is still displayed by the show faulty command, then run the set
command to enable the FB-DIMM and clear the fault.
Example:
a. Set the virtual keyswitch to diag so that POST will run in Service mode.
Note – The server takes about one minute to power off. Use the show /HOST
command to determine when the host has been powered off. The console will display
status=Powered Off.
Watch the POST output for possible fault messages. The following output is a
sign that POST did not detect any faults:
.
.
.
0:0:0>INFO:
0:0:0> POST Passed all devices.
0:0:0>POST: Return to VBSC.
0:0:0>Master set ACK for vbsc runpost command and spin...
e. Switch to the system console and issue the Solaris OS fmadm faulty
command.
# fmadm faulty
■ If the show faulty command does not report a fault with a UUID, then you
do not need to proceed with the following step because the fault is cleared.
▼ Add FB-DIMMs
If you are upgrading the system with additional FB-DIMMs, use this procedure.
5. Repeat Step 2 through Step 4 until all the FB-DIMMs are installed.
Next Steps
■ “Install a CMP/Memory Module” on page 100
■ “Install the Top Cover” on page 150
■ “Slide the Server Into the Rack” on page 151
■ “Power On the Server” on page 153
Figure Legend
Note – See TABLE: FB-DIMM Configurations and Device Identifiers on page 112 for
a list of device identifiers and the corresponding slots on the CMP/memory modules.
Related Information
■ “Managing Faults” on page 9
■ “FB-DIMM Device Identifiers” on page 112
■ “FB-DIMM Fault Button Locations” on page 113
■ “Performing Node Reconfiguration” on page 155
The FB-DIMM address follows the same convention as the CMP or memory module
upon which it is mounted. For example, /SYS/MB/CPU0/CMP0/BR1/CH0/D0 is the
device identifier for the FB-DIMM mounted at J792 on CMP module 0.
Connector
Location FB-DIMM Device Identifiers Number FB-DIMM Group
Related Information
■ “Managing Faults” on page 9
■ “Supported FB-DIMM Configurations” on page 110
■ “FB-DIMM Fault Button Locations” on page 113
■ “Performing Node Reconfiguration” on page 155
Note – You must replace a faulty FB-DIMM with an identical part (same part
number). See “Supported FB-DIMM Configurations” on page 110 for more
information.
Related Information
■ “Managing Faults” on page 9
■ “Supported FB-DIMM Configurations” on page 110
■ “FB-DIMM Device Identifiers” on page 112
These topics describe how to service field-replaceable units (FRUs) in the SPARC
Enterprise T5440 server.
Topic Links
Remove and install field-replaceable “Servicing the Front Bezel” on page 115
components. “Servicing the DVD-ROM Drive” on page 118
“Servicing the Service Processor” on page 120
“Servicing the IDPROM” on page 123
“Servicing the Battery” on page 125
“Servicing the Power Distribution Board” on
page 126
“Servicing the Fan Tray Carriage” on page 129
“Servicing the Hard Drive Backplane” on page 132
“Servicing the Motherboard” on page 135
“Servicing the Flex Cable Assembly” on page 140
“Servicing the Front Control Panel” on page 144
“Servicing the Front I/O Board” on page 146
Exploded views of FRUs “Field-Replaceable Units” on page 176
Related Information
■ “Servicing the DVD-ROM Drive” on page 118
115
▼ Remove the Front Bezel
Before you begin, complete these tasks:
■ Read the section, “Safety Information” on page 59.
■ If you are performing additional service procedures, power off the server, using
one of the methods described in the section, “Powering Off the System” on
page 63.
■ “Extend the Server to the Maintenance Position” on page 66
■ “Perform Electrostatic Discharge – Antistatic Prevention Measures” on page 69
Do the following :
Note – Avoid bending the bezel by gradually pulling it from the middle and both
ends simultaneously.
Next Steps
■ “Slide the Server Into the Rack” on page 151
Related Information
■ “Servicing the Front Bezel” on page 115
Do the following:
3. Push the DVD-ROM drive forward until it protrudes from the front of the
chassis.
Related Information
■ “Servicing the IDPROM” on page 123
■ “Servicing the Battery” on page 125
Do the following:
1. Ensure that the power cords are disconnected from the server.
2. Loosen the two captive No. 2 Phillips screws securing the service processor to
the motherboard.
Next Steps
If you are replacing a faulty service processor, you must install the IDPROM onto the
new service processor. Do the following:
■ Remove the IDPROM from the old service processor. See “Remove the IDPROM”
on page 123.
3. Press down evenly to plug the service processor into the motherboard.
4. Secure the service processor with the two captive No.2 Phillips screws.
Related Information
■ “Servicing the Service Processor” on page 120
■ “Servicing the Battery” on page 125
1. Lift the IDPROM up, off its connector on the service processor.
Related Information
■ “Servicing the Service Processor” on page 120
■ “Servicing the IDPROM” on page 123
1. Release the latch securing the battery to its holder on the service processor
board.
Next Steps
■ “Install the Service Processor” on page 122
■ “Install the Top Cover” on page 150
■ “Slide the Server Into the Rack” on page 151
■ “Connect the Power Cords to the Server” on page 153
■ “Power On the Server” on page 153
Related Information
■ “Safety Information” on page 59
■ “Servicing Power Supplies” on page 85
Note – You must remove all four power supplies from the system.
Do the following:
3. Unplug the auxiliary power cable from the power distribution board.
5. Remove the two 7 mm hex nuts securing the bus bars to the power distribution
board.
4. Install the two 7 mm nuts securing the bus bars to the power distribution board.
Next Steps
■ “Install the Top Cover” on page 150
■ “Slide the Server Into the Rack” on page 151
■ “Install a Power Supply” on page 90
Related Information
■ “Servicing Fan Trays” on page 81
■ “Servicing the Hard Drive Backplane” on page 132
■ “Servicing the Motherboard” on page 135
■ “Servicing the Front Control Panel” on page 144
■ “Servicing the Front I/O Board” on page 146
Note – You must remove all CPU modules and memory modules from the system.
Do the following:
2. Loosen the seven captive No. 2 Phillips securing the bottom of the fan tray
carriage to the motherboard assembly.
Next Steps
■ “Install a Fan Tray” on page 84
Related Information
■ “Servicing Hard Drives” on page 72
Note – You must remove all four hard drives from the server. Note the location of
each hard drive you remove. You must re-install each hard drive in the correct bay.
Do the following:
Next Steps
■ “Install the Fan Tray Carriage” on page 131
■ “Install a Fan Tray” on page 84
■ “Install a CMP/Memory Module” on page 100
■ “Install the Top Cover” on page 150
■ “Install a Hard Drive” on page 78
Note – You must install the hard drives in the correct slots.
Related Information
■ “Controlling How POST Runs” on page 26
■ “Servicing CMP/Memory Modules” on page 98
■ “Servicing PCIe Cards” on page 92
■ “Servicing the Service Processor” on page 120
■ “Servicing the Fan Tray Carriage” on page 129
■ “Motherboard Fastener Locations” on page 139
Note – You must remove all PCIe cards. Note the location of all PCIe cards so you
can install them in the correct slots during reassembly.
6. Remove the six No. 2 Phillips screws that secure the bus bar assembly to the
motherboard.
Note – Use the clips to secure the midwall panel in the open position.
8. Loosen the No. 2 Phillips screws that secure the motherboard to the chassis
floor.
See FIGURE: Motherboard Fastener Locations on page 140 for the fastener
locations.
Next Steps
If you are replacing a faulty motherboard, you must program the chassis serial
number and product part number into the new motherboard. See your service
representative.
5. Install the six No. 2 Phillips screws that secure the bus bar assembly to the
motherboard.
Related Information
■ “Servicing the Motherboard” on page 135
Related Information
■ “Safety Information” on page 59
■ “Servicing Power Supplies” on page 85
Do the following:
Next Steps
■ “Install the Top Cover” on page 150
■ “Slide the Server Into the Rack” on page 151
■ “Power On the Server” on page 153
Related Information
■ “Infrastructure Boards and Cables” on page 1
■ “Front Panel Diagram” on page 3
■ “Front Panel LEDs” on page 4
1. Unplug the front control panel cable from J9901 on the motherboard.
2. Unplug the front control panel cable from the front I/O board.
3. Plug the front control panel connector into the front I/O board.
4. Plug the front control panel connector into J9901 on the motherboard.
Next Steps
■ “Install the Fan Tray Carriage” on page 131
■ “Install a Fan Tray” on page 84
■ “Install the Top Cover” on page 150
■ “Install the Server Into the Rack” on page 150
■ “Connect the Power Cords to the Server” on page 153
■ “Power On the Server” on page 153
1. Unplug the front control panel cable from J9901 on the motherboard.
2. Unplug the front control panel cable from the front I/O board.
3. Plug the front control panel connector into the front I/O board.
4. Plug the front control panel connector into J9901 on the motherboard.
Next Steps
■ “Install the Fan Tray Carriage” on page 131
■ “Install a Fan Tray” on page 84
■ “Install the Top Cover” on page 150
■ “Install the Server Into the Rack” on page 150
■ “Connect the Power Cords to the Server” on page 153
■ “Power On the Server” on page 153
These topics describe how to return the SPARC Enterprise T5440 and SPARC
Enterprise T5440 servers to operation after you have performed service procedures.
Caution – Never attempt to run the server with the cover removed. Hazardous
voltage is present.
Caution – Equipment damage could occur if you run the server with the cover
removed. The cover must be in place for proper air flow.
Topic Links
Install the top cover after servicing internal “Install the Top Cover” on page 150
components.
Re-attach the server to the cabinet slide rails “Install the Server Into the Rack” on
after performing a bench procedure. page 150
Slide the server back into the equipment “Slide the Server Into the Rack” on page 151
rack.
Re-attach power cords and data cables to the “Connect the Power Cords to the Server” on
back panel of the server. page 153
Power on the server after performing a “Power On the Server” on page 153
service procedure.
Related Information
■ “Preparing to Service the System” on page 59
■ “Servicing Customer-Replaceable Units” on page 71
■ “Servicing Field-Replaceable Units” on page 115
149
▼ Install the Top Cover
If you removed the top cover, perform the steps in this procedure.
Note – If removing the top cover caused an emergency shutdown, you must install
the top cover and use the poweron command to restart the system. See “Power On
the Server” on page 153.
3. Secure the top cover by tightening the two captive screws along the rear edge.
Caution – The weight of the server on extended slide rails can be enough to
overturn an equipment rack. Before you begin, deploy the antitilt feature on your
cabinet.
Caution – The server weighs approximately 88 lb (40 kg). Two people are required
to lift and mount the server into a rack enclosure when using the procedures in this
chapter.
1. Slide the inner slide assemblies out from the outer rails about 2 inches (5 cm)
from the front face of the rail’s bracket.
The inner slide assemblies should be locked past the internal stop. See FIGURE:
Inserting the System Into the Rack on page 151.
Ensure that the ball bearing retainer is locked all the way forward.
2. Lift the server up and insert the inner rails into the inner slide assemblies.
Ensure that the inner rails are horizontal when the inner rails enter the inner slide
assemblies.
3. Ensure that the inner rails are engaged with the ball-bearing retainers on both
inner slide assemblies.
Note – If necessary, support the server with the mechanical lift while aligning the
inner rails parallel to the rack-mounted inner slide assemblies.
Figure Legend
2. While pushing on the release buttons, slowly push the server into the rack.
Ensure that the cables do not get in the way.
Note – As soon as the power cords are connected, standby power is applied.
Depending on the configuration of the firmware, the system might boot. See the
SPARC Enterprise T5440 Server Administration Guide for configuration and power-on
information.
■ To initiate the power-on sequence manually, use a pen or pencil to press the
Power button on the front panel. See “Front Panel Diagram” on page 3 for
Power button location.
If a CMP/memory module pair develops a fault, the SPARC Enterprise T5440 can be
reconfigured to run in a degraded state until the CMP/memory module is replaced.
In addition, you can add CMP/memory module pairs to existing systems. However,
adding or removing CMP/memory modules might affect internal hardware device
addresses, as well as the device address of any external devices attached to the
system such as external disk arrays and devices attached via an External I/O
Expansion Unit.
Topic Links
Related Information
■ “Managing Faults” on page 9
155
■ “Servicing PCIe Cards” on page 92
■ “Servicing CMP/Memory Modules” on page 98
■ “Servicing FB-DIMMs” on page 104
If a CMP module fails, the onboard devices and slots directly connected to it become
unavailable. Recovery of the I/O services connected to the failed CMP requires I/O
node reconfiguration.
For example, in a 4P system, if CMP0 goes offline, the following devices become
unavailable:
■ PCIe0
■ PCIe1
■ Onboard hard drives
In this failure scenario, the system is unable to boot from internal drives.
Related Information
■ “System Bus Topology” on page 162
■ “I/O Fabric in 2P Configuration” on page 164
■ “I/O Fabric in 4P Configuration” on page 165
2. If a replacement CMP module is not available, remove the failed CMP module and
replace it with a CMP from a different slot that does not have any directly
connected I/O devices in use (see TABLE: Devices controlled by CMPs in 2P
systems on page 164 andTABLE: Devices controlled by CMPs in 4P systems on
page 165). If this leaves a memory module without its associated CMP module,
remove the memory module.
3. If neither option (1) nor (2) is possible, you must do the following:
■ “Temporarily Disable All Memory Modules” on page 160
■ “Reconfigure the I/O and PCIe Fabric” on page 158
■ “Re-Enable All Memory Modules” on page 161
■ “Reset the LDoms Guest Configuration” on page 162
Related Information
■ “Managing Faults” on page 9
■ “Servicing CMP/Memory Modules” on page 98
■ “Servicing FB-DIMMs” on page 104
■ “I/O Connections to CMP/Memory Modules” on page 156
■ “Reconfiguring I/O Device Nodes” on page 158
■ “System Bus Topology” on page 162
■ “I/O Fabric in 2P Configuration” on page 164
■ “I/O Fabric in 4P Configuration” on page 165
Related Information
■ “Managing Faults” on page 9
■ “I/O Connections to CMP/Memory Modules” on page 156
■ “System Bus Topology” on page 162
■ “I/O Fabric in 2P Configuration” on page 164
■ “I/O Fabric in 4P Configuration” on page 165
■ “Temporarily Disable All Memory Modules” on page 160
■ “Reconfigure the I/O and PCIe Fabric” on page 158
■ “Re-Enable All Memory Modules” on page 161
■ “Reset the LDoms Guest Configuration” on page 162
Do the following:
2. Copy the reconf.pl script to the root directory of the “miniroot” of the
netinstall image. This is the Solaris_10/Tools/Boot directory of your
exported Solaris 10 8/07, Solaris 10 5/08, or Solaris 10 10/08 OS image on your
JumpStart server.
ok boot -s
7. Mount the system boot disk under the /mnt directory. Type:
8. Change to the root directory of your boot disk and copy the reconf.pl script to
the root of the boot disk. Type:
# cd /mnt
# cp /reconf.pl .
# cp /cdrom/Solaris_10/Tools/Boot/reconf.pl .
# /mnt/reconf.pl
# halt
12. Power off the system. For example, to power off using the ALOM compatibility
shell, type:
sc> poweroff
Wait for the console message which indicates that the system has been powered
off.
If you are recovering from a failed CMP module, you must temporarily disable the
FB-DIMMS on all memory modules when Solaris is halted and the system is
powered off. The FB-DIMMs are re-enabled after the I/O and PCIe devices are
reconfigured.
You can either physically remove the memory modules from the system, or remotely
disable all FB-DIMMs located on all memory modules using the
disablecomponent command.
To remove the memory modules from the system, see the instructions in the SPARC
Enterprise T5440 Server Service Manual.
Related Information
■ “I/O Fabric in 2P Configuration” on page 164
■ “I/O Fabric in 4P Configuration” on page 165
Related Information
■ “System Bus Topology” on page 162
■ “I/O Fabric in 4P Configuration” on page 165
Related Information
■ “System Bus Topology” on page 162
■ “I/O Fabric in 2P Configuration” on page 164
This section provides reference information about the system back panel ports and
pin assignments.
Topic Links
Reference for system “Serial Management Port Connector Pinouts” on page 167
connector pinouts “Network Management Port Connector Pinouts” on page 168
“Serial Port Connector Pinouts” on page 169
“USB Connector Pinouts” on page 169
“Gigabit Ethernet Connector Pinouts” on page 170
Related Information
■ “Identifying Server Components” on page 1
167
FIGURE: Serial Management Connector Diagram
1 2 3 4
1 2 3 4
A1 +5 V (fused) B1 +5 V (fused)
A2 USB0/1- B2 USB2/3-
A3 USB0/1+ B3 USB2/3+
A4 Ground B4 Ground
Description Links
Related Information
■ “Identifying Server Components” on page 1
■ “Servicing Customer-Replaceable Units” on page 71
■ “Servicing Field-Replaceable Units” on page 115
173
Customer-Replaceable Units
FIGURE: Customer-Replaceable Units (CRUs)
Figure Legend
Figure Legend
Figure Legend
1 IDPROM 4 Motherboard
2 Front Control Panel 5 Battery
3 Front I/O Board 6 Service Processor
Related Information
■ “Servicing the Service Processor” on page 120
■ “Servicing the IDPROM” on page 123
■ “Servicing the Battery” on page 125
■ “Servicing the Power Distribution Board” on page 126
179
configuration External I/O Expansion Unit
device addresses, 156 fault detected by show faulty command, 35
connecting to ALOM CMT, 22 faults detection in, 15
console command, 29, 53, 108
consolehistory command, 54 F
Fan Fault (system LED)
D interpreting to diagnose faults, 31
DC OK (power supply LED), 91 fan module
determining fault state, 31
device identifiers
Fault LED, 31
CMP/memory modules, 103
fan tray, 84 fan module LEDs
FB-DIMMs, 112 using to identify faults, 31
hard drive, 79 fan tray, 83
PCIe card, 96 device identifiers, 84
power supply, 91 installing, 82, 84
diag_level parameter, 27, 56 removing, 81, 83
diag_mode parameter, 26, 56 fan tray carriage
installing, 131
diag_trigger parameter, 27, 57
removing, 129
diag_verbosity parameter, 27, 57
fan tray LEDs
diagnostics
about, 84
about, 9
fan trays
flowchart, 11
about, 81
low level, 19
running remotely, 15 Fault (hard drive LED), 31
using SunVTS, 18 Fault (power supply LED), 86, 92
disablecomponent command, 52 fault manager daemon, fmd(1M), 18
displaying FRU status, 25 fault records, 49
dmesg command, 36 fault recovery
DVD-ROM drive I/O device, 158
installing, 119 fault recovery, CMP module, 155
removing, 118 faults
clearing POST-detected faults, 48
E detected by POST, 12, 33, 35
ejector tabs, FB-DIMM, 105 detected by PSH, 12, 34
electrostatic discharge (ESD) diagnosing with LEDs, 30 to 32
preventing, 69 environmental, 12, 13, 33
preventing using an antistatic mat, 61 environmental, displayed by show faulty
preventing using an antistatic wrist strap, 61 command, 34
safety measures, 61 FB-DIMM, 106
forwarded to ILOM, 15
emergency shutdown, 65
recovery, 16
using Power button, 5
repair, 16
enablecomponent command, 48 types of, 33
environmental faults, 12, 13, 16, 33 FB-DIMM fault button, 113
event log, checking the PSH, 45 FB-DIMM Fault LEDs, 32
EVENT_ID, FRU, 45 FB-DIMMS
exercising the system with SunVTS, 38
Index 181
CMP/memory module, 100 hard drive, 80
DVD-ROM drive, 119 network management port, 8
fan tray, 82, 84 rear panel, 7
fan tray carriage, 131 Service Required (system LED), 32
FB-DIMMs, 105 using to diagnose faults, 30
flex cable assembly, 142 using to identify device state, 30
front bezel, 117 Locator LED and button, 3, 4, 5, 7
front control panel, 145 log files, viewing, 36
front I/O board, 148 logical domains
hard drive, 75, 78 guest configuration, 162
hard drive backplane, 133
IDPROM, 124
motherboard, 138
M
PCIe card, 94 MAC addresses, stored on SCC module, 2
power distribution board, 128 maintenance position, 65, 67
power supply, 87, 90 memory
service processor, 122 also see FB-DIMMs
top cover, 150 fault handling, 21
Integrated Lights Out Manager memory modules
and fault detection in External I/O Expansion see CMP/memory modules
Unit, 15 message ID, 18
iostat -E command, 77 messages file, 35
motherboard
L about, 1
latch fastener locations, 139
power supply, 86, 89 installing, 138
slide rail, 66 removing, 135
LED
AC Present (power supply LED), 12, 92 N
DC OK (power supply LED), 91 network management port
Fan Fault (system LED), 31 LEDs, 8
Fault (fan module LED), 31 pinouts, 168
Fault (hard drive LED), 31 node reconfiguration, 155
Fault (power supply LED), 31, 86, 92 and I/O services, 156
FB-DIMM Fault (motherboard LEDs), 32 I/O device nodes, 158
Gigabit Ethernet port, 8 PCIe, 158
Locator, 4, 7 Normal mode (virtual keyswitch position), 108
Overtemp (system LED), 5, 31 also see setkeyswitch command.
Power OK (system LED), 12
Power Supply Fault (system LED), 5, 31, 88, 92
O
Ready-to-Remove (hard drive LED), 74, 76
Overtemp (system LED), 5, 31
Service Required (system LED), 4, 31, 92
Top (system LED), 5 overtemperature condition, 31
LEDs
about, 30 P
fan module, 31 PCIe card
fan tray, 84 adding, 94
front panel, 4 configuration guidelines, 97
Index 183
power distribution board, 126 replacement, 108
power supply, 86, 89 showcomponent command, 24, 51
server from rack, 67 showenvironment command, 55
service processor, 120 showfaults command
reset command, 55 syntax, 55
reset, system showfru command, 25, 56
using ILOM, 28 showkeyswitch command, 56
using POST commands, 28
showlocator command, 56
resetsc command, 55
showlogs command, 56
showplatform command, 56, 62
S
shutdown
safety information, 59
triggered by top cover removal (emergency
safety symbols, 60
shutdown), 150
sanity check for hardware components, 19 using Power button (emergency shutdown), 5
SCC module using Power button (graceful shutdown), 5
and host ID, 2 using powercycle command (graceful
and MAC addresses, 2 shutdown), 54
serial management port using powercycle -f command (emergency
pinouts, 167 shutdown), 54
serial number, chassis, 62 using poweroff command, 54
serial port (DB-9) slide rail latch, 66
pinouts, 169 Solaris log files, 12
service processor Solaris log files as diagnostic tool, 12
installing, 122 Solaris OS
removing, 120 checking log files for fault information, 12
Service Required (system LED), 32 collecting diagnostic information from, 35
about, 4 message buffer, checking, 36
cleared by enablecomponent command, 48 message log files, viewing, 36
interpreting to diagnose faults, 31 Solaris Predictive Self-Healing, 17
triggered by ILOM, 15 SunVTS, 18
triggered by power supply fault, 92 as fault diagnosis tool, 12
set command browser environment, 40
and component_state property, 48 Component Stress parameter, 40
setkeyswitch parameter, 28, 55, 56, 107 exercising the system with, 38
setlocator command, 4, 7, 55, 66 software packages, 41
show faulty command, 31, 43, 55 System Excerciser, 40
and faults detected by POST, 35 tests, 42
and PSH faults, 34 user interfaces, 38, 39, 41, 42
and Service Required LED, 32 using for fault diagnosis, 12
description and examples, 32 verifying installation, 38
environmental fault, 34 syslogd daemon, 36
reasons to use, 33 system bus topology, 162
use in detecting faults in an External I/O system console, 23
Expansion Unit, 35 system console, switching to, 23
using to check for faults, 12
system controller, 10
using to diagnose FB-DIMMs, 106
using to verify successful FB-DIMM
U
UltraSPARC T2+ multicore processor, 18
Universal Unique Identifier (UUID), 18, 45
USB ports
pinouts, 169
USB ports (front), 3
V
virtual keyswitch, 28, 107
X
XAUI card
about, 1
configuration guidelines, see PCIe configuration
guidelines
Index 185
186 SPARC Enterprise T5440 Server Service Manual • July 2009