Ibm x3550 M3 Server
Ibm x3550 M3 Server
Chapter 2. Introduction . . . . . . . . . . . . . . . . . . . . . . 5
Related documentation . . . . . . . . . . . . . . . . . . . . . . 5
Notices and statements in this document . . . . . . . . . . . . . . . . 6
Features and specifications . . . . . . . . . . . . . . . . . . . . . 7
Server controls, LEDs, and power . . . . . . . . . . . . . . . . . . 9
Front view . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Operator information panel . . . . . . . . . . . . . . . . . . . 10
Light path diagnostics panel . . . . . . . . . . . . . . . . . . . 11
Rear view . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Server power features . . . . . . . . . . . . . . . . . . . . . 14
Internal LEDs, connectors, and jumpers. . . . . . . . . . . . . . . . 16
System-board internal connectors . . . . . . . . . . . . . . . . . 16
System-board external connectors . . . . . . . . . . . . . . . . . 17
System-board switches and jumpers . . . . . . . . . . . . . . . . 18
System-board LEDs . . . . . . . . . . . . . . . . . . . . . . 21
System-board optional device connectors . . . . . . . . . . . . . . 22
Chapter 3. Diagnostics . . . . . . . . . . . . . . . . . . . . . 23
Diagnostic tools . . . . . . . . . . . . . . . . . . . . . . . . 23
Event logs . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Viewing event logs from the Setup utility . . . . . . . . . . . . . . 24
Viewing event logs without restarting the server . . . . . . . . . . . . 25
Clearing the error logs . . . . . . . . . . . . . . . . . . . . . 26
POST . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
POST error codes . . . . . . . . . . . . . . . . . . . . . . . 27
System event log . . . . . . . . . . . . . . . . . . . . . . . . 40
Integrated management module (IMM) error messages . . . . . . . . . 40
Checkout procedure . . . . . . . . . . . . . . . . . . . . . . . 99
About the checkout procedure . . . . . . . . . . . . . . . . . . 99
Performing the checkout procedure . . . . . . . . . . . . . . . . 99
Troubleshooting tables . . . . . . . . . . . . . . . . . . . . . 101
DVD drive problems . . . . . . . . . . . . . . . . . . . . . 101
General problems . . . . . . . . . . . . . . . . . . . . . . 102
Hard disk drive problems. . . . . . . . . . . . . . . . . . . . 102
Hypervisor problems . . . . . . . . . . . . . . . . . . . . . 104
Intermittent problems . . . . . . . . . . . . . . . . . . . . . 105
Keyboard, mouse, or pointing-device problems. . . . . . . . . . . . 106
Memory problems . . . . . . . . . . . . . . . . . . . . . . 107
Microprocessor problems. . . . . . . . . . . . . . . . . . . . 109
Monitor and video problems. . . . . . . . . . . . . . . . . . . 110
Optional-device problems . . . . . . . . . . . . . . . . . . . 112
Power problems . . . . . . . . . . . . . . . . . . . . . . . 113
Serial-device problems . . . . . . . . . . . . . . . . . . . . 116
Chapter 4. Parts listing, System x3550 M3 Types 4254 and 7944 . . . . . 177
Replaceable server components . . . . . . . . . . . . . . . . . . 177
Consumable parts . . . . . . . . . . . . . . . . . . . . . . 182
Power cords . . . . . . . . . . . . . . . . . . . . . . . . . 184
iv IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Removing an optional ServeRAID adapter advanced feature key . . . . . 226
Installing an optional ServeRAID adapter advanced feature key . . . . . 228
Removing a USB embedded hypervisor flash device . . . . . . . . . 229
Installing a USB embedded hypervisor flash device . . . . . . . . . . 230
Removing a hot-swap ac power supply . . . . . . . . . . . . . . 231
Installing a hot-swap ac power supply . . . . . . . . . . . . . . . 233
Removing a hot-swap fan assembly. . . . . . . . . . . . . . . . 235
Installing a hot-swap fan assembly . . . . . . . . . . . . . . . . 237
Removing the virtual media key . . . . . . . . . . . . . . . . . 238
Installing the virtual media key. . . . . . . . . . . . . . . . . . 238
Removing the optional two-port Ethernet adapter . . . . . . . . . . . 239
Installing the optional two-port Ethernet adapter . . . . . . . . . . . 240
Removing a remotely installed RAID adapter battery . . . . . . . . . 242
Installing a RAID adapter battery remotely in the server . . . . . . . . 243
Removing the system battery . . . . . . . . . . . . . . . . . . 246
Installing the system battery . . . . . . . . . . . . . . . . . . 248
Removing and replacing Tier 2 CRUs . . . . . . . . . . . . . . . . 250
Removing the bezel . . . . . . . . . . . . . . . . . . . . . 250
Installing the bezel . . . . . . . . . . . . . . . . . . . . . . 250
Removing an optional optical drive cage . . . . . . . . . . . . . . 251
Installing an optional optical drive cage . . . . . . . . . . . . . . 251
Removing the DVD cable . . . . . . . . . . . . . . . . . . . 252
Installing the DVD cable . . . . . . . . . . . . . . . . . . . . 253
Removing a SAS/SATA 4 Pac HDD option . . . . . . . . . . . . . 255
Installing a SAS/SATA 4 Pac HDD option . . . . . . . . . . . . . . 255
Removing a PCI riser-card assembly . . . . . . . . . . . . . . . 256
Installing a PCI riser-card assembly . . . . . . . . . . . . . . . . 257
Removing the PCI riser-card bracket from the riser card . . . . . . . . 258
Installing the PCI riser-card bracket to the riser card. . . . . . . . . . 258
Removing the hot-swap SAS/SATA hard disk drive backplane . . . . . . 259
Installing the hot-swap SAS/SATA hard disk drive backplane . . . . . . 260
Removing the simple-swap SATA hard disk drive backplate assembly . . . 261
Installing the simple-swap SATA hard disk drive backplate assembly. . . . 262
Removing the operator information panel assembly . . . . . . . . . . 263
Installing the operator information panel assembly . . . . . . . . . . 264
Removing and replacing FRUs . . . . . . . . . . . . . . . . . . 265
Removing the 240 VA safety cover . . . . . . . . . . . . . . . . 265
Installing the 240 VA safety cover . . . . . . . . . . . . . . . . 266
Removing a microprocessor and heat sink . . . . . . . . . . . . . 267
Installing a microprocessor and heat sink . . . . . . . . . . . . . . 269
Removing the heat sink retention module. . . . . . . . . . . . . . 274
Installing a heat sink retention module . . . . . . . . . . . . . . . 275
Removing the system board . . . . . . . . . . . . . . . . . . 275
Installing the system board . . . . . . . . . . . . . . . . . . . 277
Contents v
Configuring the Gigabit Ethernet controller . . . . . . . . . . . . . 295
Using the LSI Configuration Utility program . . . . . . . . . . . . . 295
IBM Advanced Settings Utility program. . . . . . . . . . . . . . . . 297
Updating IBM Systems Director . . . . . . . . . . . . . . . . . . 298
Updating the Universal Unique Identifier (UUID) . . . . . . . . . . . . 298
Updating the DMI/SMBIOS data . . . . . . . . . . . . . . . . . . 300
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315
vi IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Safety
Before installing this product, read the Safety Information.
Ennen kuin asennat tämän tuotteen, lue turvaohjeet kohdasta Safety Information.
Consider the following conditions and the safety hazards that they present:
v Electrical hazards, especially primary power. Primary voltage on the frame can
cause serious or fatal electrical shock.
v Explosive hazards, such as a damaged CRT face or a bulging capacitor.
v Mechanical hazards, such as loose or missing hardware.
To inspect the product for potential unsafe conditions, complete the following steps:
1. Make sure that the power is off and the power cord is disconnected.
2. Make sure that the exterior cover is not damaged, loose, or broken, and
observe any sharp edges.
3. Check the power cord:
v Make sure that the third-wire ground connector is in good condition. Use a
meter to measure third-wire ground continuity for 0.1 ohm or less between
the external ground pin and the frame ground.
v Make sure that the power cord is the correct type, as specified in “Power
cords” on page 184.
v Make sure that the insulation is not frayed or worn.
4. Remove the cover.
5. Check for any obvious non-IBM alterations. Use good judgment as to the safety
of any non-IBM alterations.
6. Check inside the server for any obvious unsafe conditions, such as metal filings,
contamination, water or other liquid, or signs of fire or smoke damage.
7. Check for worn, frayed, or pinched cables.
8. Make sure that the power-supply cover fasteners (screws or rivets) have not
been removed or tampered with.
viii IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Do not touch the reflective surface of a dental mirror to a live electrical circuit.
The surface is conductive and can cause personal injury or equipment damage if
it touches a live electrical circuit.
v Some rubber floor mats contain small conductive fibers to decrease electrostatic
discharge. Do not use this type of mat to protect yourself from electrical shock.
v Do not work alone under hazardous conditions or near equipment that has
hazardous voltages.
v Locate the emergency power-off (EPO) switch, disconnecting switch, or electrical
outlet so that you can turn off the power quickly in the event of an electrical
accident.
v Disconnect all power before you perform a mechanical inspection, work near
power supplies, or remove or install main units.
v Before you work on the equipment, disconnect the power cord. If you cannot
disconnect the power cord, have the customer power-off the wall box that
supplies power to the equipment and lock the wall box in the off position.
v Never assume that power has been disconnected from a circuit. Check it to
make sure that it has been disconnected.
v If you have to work on equipment that has exposed electrical circuits, observe
the following precautions:
– Make sure that another person who is familiar with the power-off controls is
near you and is available to turn off the power if necessary.
– When you are working with powered-on electrical equipment, use only one
hand. Keep the other hand in your pocket or behind your back to avoid
creating a complete circuit that could cause an electrical shock.
– When you use a tester, set the controls correctly and use the approved probe
leads and accessories for that tester.
– Stand on a suitable rubber mat to insulate you from grounds such as metal
floor strips and equipment frames.
v Use extreme care when you measure high voltages.
v To ensure proper grounding of components such as power supplies, pumps,
blowers, fans, and motor generators, do not service these components outside of
their normal operating locations.
v If an electrical accident occurs, use caution, turn off the power, and send another
person to get medical aid.
Safety ix
Safety statements
Important:
Each caution and danger statement in this document is labeled with a number. This
number is used to cross reference an English-language caution or danger
statement with translated versions of the caution or danger statement in the Safety
Information document.
For example, if a caution statement is labeled "Statement 1," translations for that
caution statement are in the Safety Information document under "Statement 1."
Be sure to read all caution and danger statements in this document before you
perform the procedures. Read any additional safety information that comes with the
server or optional device before you install the device.
x IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Statement 1:
DANGER
To Connect: To Disconnect:
Safety xi
Statement 2:
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or an
equivalent type battery recommended by the manufacturer. If your system has
a module containing a lithium battery, replace it only with the same module
type made by the same manufacturer. The battery contains lithium and can
explode if not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water
v Heat to more than 100°C (212°F)
v Repair or disassemble
xii IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Statement 3:
CAUTION:
When laser products (such as CD-ROMs, DVD drives, fiber optic devices, or
transmitters) are installed, note the following:
v Do not remove the covers. Removing the covers of the laser product could
result in exposure to hazardous laser radiation. There are no serviceable
parts inside the device.
v Use of controls or adjustments or performance of procedures other than
those specified herein might result in hazardous radiation exposure.
DANGER
Some laser products contain an embedded Class 3A or Class 3B laser
diode. Note the following.
Laser radiation when open. Do not stare into the beam, do not view directly
with optical instruments, and avoid direct exposure to the beam.
Safety xiii
Statement 4:
CAUTION:
Use safe practices when lifting.
Statement 5:
CAUTION:
The power control button on the device and the power switch on the power
supply do not turn off the electrical current supplied to the device. The device
also might have more than one power cord. To remove all electrical current
from the device, ensure that all power cords are disconnected from the power
source.
2
1
xiv IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Statement 8:
CAUTION:
Never remove the cover on a power supply or any part that has the following
label attached.
Hazardous voltage, current, and energy levels are present inside any
component that has this label attached. There are no serviceable parts inside
these components. If you suspect a problem with one of these parts, contact
a service technician.
Statement 26:
CAUTION:
Do not place any object on top of rack-mounted devices.
Important: This product is not suitable for use with visual display workplace devices
according to Clause 2 of the German Ordinance for Work with Visual Display Units.
Safety xv
xvi IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Chapter 1. Start here
You can solve many problems without outside assistance by following the
troubleshooting procedures in this Problem Determination and Service Guide and
on the World Wide Web. This document describes the diagnostic tests that you can
perform, troubleshooting procedures, and explanations of error messages and error
codes. The documentation that comes with your operating system and software
also contains troubleshooting information.
Diagnosing a problem
Before you contact IBM or an approved warranty service provider, follow these
procedures in the order in which they are presented to diagnose a problem with
your server:
1. Determine what has changed.
Determine whether any of the following items were added, removed, replaced,
or updated before the problem occurred:
v Hardware components
v Device drivers and firmware
v System software
v UEFI firmware
v System input power or network connections
If possible, return the server to the condition it was in before the problem
occurred.
2. View the light path diagnostics LEDs and event logs.
The server is designed for ease of diagnosis of hardware and software
problems.
v Light path diagnostics LEDs: See “Light path diagnostics” on page 119 for
information about light path diagnostics LEDs that are lit and actions that you
should take.
v Event logs: See“System event log” on page 40 for information about
notification events and diagnosis.
v Software or operating-system error codes: See the documentation for the
software or operating system for information about a specific error code. See
the manufacturer's website for documentation.
3. Run IBM Dynamic System Analysis (DSA) and collect system data.
Run Dynamic System Analysis (DSA) to collect information about the hardware,
firmware, software, and operating system. Have this information available when
you contact IBM or an approved warranty service provider. For instructions for
running DSA, see the Dynamic System Analysis Installation and User's Guide.
To download the latest version of DSA code and the Dynamic System Analysis
Installation and User's Guide, go to [Link]
docdisplay?brand=5000008&lndocid=SERV-DSA.
4. Check for and apply code updates.
Fixes or workarounds for many problems might be available in updated UEFI
firmware, device firmware, or device drivers.
2 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
For problems with operating systems or IBM software or devices, go to
[Link]
7. Check for troubleshooting procedures and RETAIN tips.
Troubleshooting procedures and RETAIN tips document known problems and
suggested solutions. To search for troubleshooting procedures and RETAIN tips,
go to [Link]
8. Use the troubleshooting tables.
See “Troubleshooting tables” on page 101 to find a solution to a problem that
has identifiable symptoms.
A single problem might cause multiple symptoms. Follow the troubleshooting
procedure for the most obvious symptom. If that procedure does not diagnose
the problem, use the procedure for another symptom, if possible.
If the problem remains, contact IBM or an approved warranty service provider
for assistance with additional problem determination and possible hardware
replacement. To open an online service request, go to [Link]
support/electronic/. Be prepared to provide information about any error codes
and collected data.
Undocumented problems
If you have completed the diagnostic procedure and the problem remains, the
problem might not have been previously identified by IBM. After you have verified
that all code is at the latest level, all hardware and software configurations are valid,
and no light path diagnostics LEDs or log entries indicate a hardware component
failure, contact IBM or an approved warranty service provider for assistance. To
open an online service request, go to [Link] Be
prepared to provide information about any error codes and collected data and the
problem determination procedures that you have used.
For information about the terms of the warranty, see the Warranty Information
document that comes with the server. For information about getting service and
assistance, see Appendix A, “Getting help and technical assistance,” on page 305.
Related documentation
In addition to this document, the following documentation also comes with the
server:
v Environmental Notices and User Guide
This document is in PDF format on the IBM System x Documentation CD. It
contains translated environmental notices.
v IBM License Agreement for Machine Code
This document is in PDF. It contains translated versions of the IBM License
Agreement for Machine code for your server.
v IBM Warranty Information
This printed document contains the warranty terms and a pointer to the IBM
Statement of Limited Warranty on the IBM website.
v Installation and User’s Guide
This document is in Portable Document Format (PDF) on the IBM System x
Documentation CD. It provides general information about setting up and cabling
the server, including information about features, and how to configure the server.
It also contains detailed instructions for installing, removing, and connecting
some optional devices that the server supports.
v Licenses and Attributions Documents
This document is in PDF. It contains information about the open-source notices.
v Rack Installation Instructions
This printed document contains instructions for installing the server in a rack.
v Safety Information
This document is in PDF on the IBM System x Documentation CD. It contains
translated caution and danger statements. Each caution and danger statement
that appears in the documentation has a number that you can use to locate the
corresponding statement in your language in the Safety Information document.
The ToolsCenter for System x and BladeCenter is an online information center that
contains information about tools for updating, managing, and deploying firmware,
device drivers, and operating systems. The ToolsCenter for System x and
BladeCenter is at [Link]
6 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Features and specifications
The following information is a summary of the features and specifications of the
server. Depending on the server model, some features might not be available, or
some specifications might not apply.
Table 1. Features and specifications
Microprocessor: Hard disk drive expansion bays Power supply:
v Supports up to two Intel Xeon™ (depending on the model): v Maximum of two hot-swap power
multi-core microprocessors (one v Up to eight 2.5-inch hot-swap SAS or supplies for redundancy support
installed) hot-swap SATA hard disk drive bays – 460-watt ac
v Level-3 cache v Up to four 2.5-inch hot-swap – 675-watt ac
v QuickPath Interconnect (QPI) links SAS/SATA or simple-swap SATA hard
disk drive bays (on server models with – 675-watt high-efficiency ac
speed up to 6.4 GT per second
an optional optical drive bay) – 675-watt dc
Note:
PCI expansion slots: Note: You cannot mix 460-watt and
v Do not install an Intel Xeon™ 5500 675-watt power supplies, high-efficiency and
series microprocessor and an Xeon™ non-high-efficiency power supplies, or ac
Supports two PCI riser slots:
5600 series microprocessor in the same and dc power supplies in the server.
v Slot 1 supports low-profile cards (PCI
server.
Express Gen2 x16 or PCI-X 1.0a
v Use the Setup utility program to 64-bit/133 MHz ). Environment:
determine the type and speed of the v Slot 2 supports half-length, full-height v Air temperature:
microprocessors. cards (PCI Express Gen2 x16 or – Server on: 10°C to 35°C (50°F to
v For a list of supported microprocessors, PCI-X 1.0a 64-bit/133 MHz). 95°F); altitude: 0 to 914.4 m (3000 ft),
see [Link] decrease system temperature by
serverproven/compat/us/. Video controller (integrated into IMM): 1.0°C for every 1000-foot increase in
v Matrox G200eV (two analog ports - altitude
Memory: one front and one rear that can be – Server off: 5°C to 45°C (41°F to
v Minimum: 2 GB connected at the same time) 113°F); maximum altitude: 3048 m
v Maximum: 192 GB Note: The maximum video resolution (10000 ft)
– 48 GB using unbuffered DIMMs is 1600 x 1200 at 75 Hz. – Shipment: -40°C to 60°C
(UDIMMs) – SVGA compatible video controller (-40°F to 140°F); maximum altitude:
– 192 GB using registered DIMMs – DDR2 250 MHz SDRAM video 3048 m (10000 ft)
(RDIMMs) memory controller v Humidity:
v Type: PC3-10600R-999, 800, 1066, and – Avocent Digital Video Compression – Server on: 20% to 80%; maximum
1333 MHz, ECC, DDR3 registered or – 16 MB of video memory (not dew point: 21°C; maximum rate of
unbuffered SDRAM DIMMs expandable) change: 5 °C/hr
v Slots: 18 dual inline – Server off: 8% to 80%; maximum dew
v Supports (depending on the model): point: 27°C
– 2 GB and 4 GB unbuffered DIMMs v Particulate contamination:
– 2 GB, 4 GB, 8 GB, and 16 GB Attention: Airborne particulates and
registered DIMMs reactive gases acting alone or in
combination with other environmental
SATA optical drives (optional): factors such as humidity or temperature
v DVD-ROM might pose a risk to the server. For
information about the limits for
v Multi-burner
particulates and gases, see “Particulate
Hot-swap fans: contamination” on page 309.
Chapter 2. Introduction 7
Table 1. Features and specifications (continued)
Integrated functions: RAID controllers (depending on the Electrical input:
v Integrated Management Module (IMM), model): v Sine-wave input (47 - 63 Hz) required
which provides service processor v A ServeRAID-BR10il v2 SAS/SATA v Input voltage low range:
control and monitoring functions, video adapter that provides RAID levels 0, 1, – Minimum: 100 V ac
controller, and (when the optional virtual and 1E (comes standard on some – Maximum: 127 V ac
media key is installed) remote hot-swap models). v Input voltage high range:
keyboard, video, mouse, and remote – Minimum: 200 V ac
v An optional ServeRAID-BR10i
hard disk drive capabilities – Maximum: 240 V ac
SAS/SATA adapter that provides RAID
v Broadcom BCM5709 Gb Ethernet v Input kilovolt-amperes (kVA),
levels 0, 1, and 1E can be ordered.
controller with TCP/IP Offload Engine approximately:
(TOE) and Wake on LAN support v An optional ServeRAID-MR10i – Minimum: 0.090 kVA
v Five Universal Serial Bus (USB) 2.0 SAS/SATA adapter that provides RAID – Maximum: 0.700 kVA
ports (two front and two rear of the levels 0, 1, 5, 6, 10, 50, and 60 can be
ordered. Notes:
chassis, and one on the SAS/SATA
RAID riser card in which the optional v An optional ServerRAID M1015 1. Power consumption and heat output
USB flash device with embedded SAS/SATA adapter that provides RAID vary depending on the number and type
hypervisor software is installed) levels 0, 1, and 10 with optional RAID of optional features installed and the
v Four Ethernet ports (two on system 5/50 and SED (Self Encrypting Drive) power-management optional features in
board and two additional ports when the upgrade. use.
optional IBM Dual-Port 1 Gb Ethernet 2. The sound levels were measured in
v An optional ServeRAID M5014
Daughter Card is installed) controlled acoustical environments
SAS/SATA adapter that provides RAID
v One System Management RJ-45 on the according to the procedures specified
levels 0, 1, 5, 10 and 50 with optional
rear to connect to a systems by the American National Standards
RAID 6/60 and SED upgrade.
management network. This system Institute (ANSI) S12.10 and ISO 7779
management connector is dedicated to v An optional ServeRAID M5015
and are reported in accordance with
the IMM functions. This connector is SAS/SATA adapter that provides RAID
ISO 9296. Actual sound-pressure levels
active with or without the optional IBM levels 0, 1, 5, 10, and 50 with optional
in a given location might exceed the
Virtual Media Key installed. RAID 6/60 and SED upgrade.
average values stated because of room
v One serial port Size: reflections and other nearby noise
v 1U sources. The noise emission level
v Height: 43 mm (1.7 inches) stated in the declared (upper limit)
v Depth: 711 mm (28.0 inches) sound-power level, in bels, for a random
v Width: 440 mm (17.3 inches) sample of system.
v Weight: approximately 15.9 kg (35.1
lb) when fully configured
8 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Server controls, LEDs, and power
This section describes the controls and light-emitting diodes (LEDs) and how to turn
the server on and off. For the location of the LEDs on the system board, see
“System-board LEDs” on page 21.
Front view
The following illustration shows the controls, LEDs, and connectors on the front of
the server.
Hard disk drive
activity LED (green)
Hard disk drive USB 1
status LED (amber) connector
Optional Optional
optical drive optical drive USB 2 Video
Drive bay 0 Drive bay 2 activity LED eject button connector connector
Rack Rack
release release
latch latch
The following illustration shows the front of the simple-swap server model.
v Rack release latches: Press the latches on each front side of the server to
remove the server from the rack.
v Hard disk drive activity LEDs: This LED is used on hot-swap SAS or SATA
hard disk drives. Each hot-swap hard disk drive has an activity LED, and when
this LED is flashing, it indicates that the drive is in use.
v Hard disk drive status LEDs: This LED is used on hot-swap SAS or SATA hard
disk drives. When this LED is lit, it indicates that the drive has failed. If an
optional IBM ServeRAID controller is installed in the server, when this LED is
flashing slowly (one flash per second), it indicates that the drive is being rebuilt.
When the LED is flashing rapidly (three flashes per second), it indicates that the
controller is identifying the drive.
v Optional DVD eject button: Press this button to release a DVD or CD from the
DVD drive.
v Optional DVD drive activity LED: When this LED is lit, it indicates that the DVD
drive is in use.
v Operator information panel: This panel contains controls and LEDs that provide
information about the status of the server.
v Operator information panel release latch: Slide the blue release latch to the
left to pull out the light path diagnostics panel and view the light path diagnostics
LEDs and buttons. See “Light path diagnostics panel” on page 11 for more
information about the light path diagnostics.
Chapter 2. Introduction 9
v Video connector: Connect a monitor to this connector. The video connectors on
the front and rear of the server can be used simultaneously.
v USB connectors: Connect a USB device, such as a USB mouse or keyboard to
any of these connectors.
v Power-control button and power-on LED: Press this button to turn the server
on and off manually or to wake the server from a reduced-power state. The
states of the power-on LED are as follows:
Off: AC power is not present, or the power supply or the LED itself has failed.
Flashing rapidly (4 times per second): The server is turned off and is not
ready to be turned on. The power-control button is disabled. This will last
approximately 20 to 40 seconds.
Flashing slowly (once per second): The server is turned off and is ready to
be turned on. You can press the power-control button to turn on the server.
Lit: The server is turned on.
Fading on and off: The server is in a reduced-power state. To wake the
server, press the power-control button or use the IMM web interface. See
“Logging on to the Web interface” on page 294 for information on logging on
to the IMM web interface.
v Ethernet activity LEDs: When any of these LEDs is lit, they indicate that the
server is transmitting to or receiving signals from the Ethernet LAN that is
connected to the Ethernet port that corresponds to that LED.
v System-locator button/LED: Use this blue LED to visually locate the server
among other servers. This LED is also used as a presence detection button. You
can use IBM Systems Director to light this LED remotely. This LED is controlled
by the IMM. When you press the System-locator button, the LED will blink and it
will continue to blink until you press it again to turn it off. The locator button is
pressed to visually locate the server among the others servers.
v System-information LED: When this amber LED is lit, it indicates that a
noncritical event has occurred. Check the error log for additional information. See
“Event logs” on page 24 for information about the error logs.
v System-error LED: When this amber LED is lit, it indicates that a system error
has occurred. A system-error LED is also on the rear of the server. An LED on
the light path diagnostics panel on the operator information panel is also lit to
help isolate the error. This LED is controlled by the IMM.
v Hard drive activity LED: When this green LED is lit, it indicates that one of the
hard disk drives is in use.
10 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Notes:
1. For a SAS drive, a hard disk drive activity LED is shown in two places: on the
hard disk drive and on the operator information panel.
2. For a SATA drive, hard disk drive activity is indicated only by the hard disk
drive activity LED on the operator information panel.
Note: The system service label on the underside of the cover also provides
information about the location of the light path diagnostics LEDs.
To access the light path diagnostics panel, slide the blue release latch on the
operator panel to the left. Pull forward on the panel until the hinge of the operator
panel is free of the server chassis. Then pull down on the panel, so that you can
view the light path diagnostics panel information.
Note: When you slide the light path diagnostics panel out of the server to check
the LEDs or checkpoint codes, do not run the server continuously with light path
diagnostics panel outside of the server. The panel should only be outside of the
server a short time. The light path diagnostics panel must remain in the server
when the server is running to ensure proper cooling.
Operator information
panel
Light path
diagnostics LEDs
Release latch
The following illustration shows the LEDs and controls on the light path diagnostics
panel.
Chapter 2. Introduction 11
v Remind button: This button places the system-error LED on the front panel into
Remind mode. In Remind mode, the system-error LED flashes every 2 seconds
until the problem is corrected, the system is restarted, or a new problem occurs.
By placing the system-error LED indicator in Remind mode, you acknowledge
that you are aware of the last failure but will not take immediate action to correct
the problem. The remind function is controlled by the IMM.
v NMI button: Press this button to force a nonmaskable interrupt to the
microprocessor. It allows you to blue screen the server and take a memory dump
(use this button only when directed by the IBM service support).
v Checkpoint code display: This display provides a checkpoint code that
indicates the point at which the system stopped during the boot block and POST.
A checkpoint code is either a byte or a word value that is produced by UEFI. The
display does not provide error codes or suggest components to be replaced.
v Reset button: Press this button to reset the server and run the power-on
self-test (POST). You might have to use a pen or the end of a straightened paper
clip to press the button. The Reset button is in the lower right-hand corner of the
light path diagnostics panel.
For additional information about the light path diagnostics panel LEDs, see “Light
path diagnostics LEDs” on page 122.
Rear view
The following illustration shows the connectors and LEDs on the rear of the server.
v PCI slot 1: Insert a low-profile PCI Express or PCI-X adapter into this slot. You
can purchase an optional PCI Express or PCI-X riser-card assembly with bracket
if you want to install a PCI adapter in this slot.
v PCI slot 2: Insert a half-length, full-height PCI Express or PCI-X adapter into this
slot. Standard models of the server come with one PCI Express riser-card
assembly installed in this slot. You can purchase an optional PCI-X riser-card
assembly with bracket if you want to install a PCI-X adapter in this slot.
v Power cord connector: Connect the power cord to this connector.
v Video connector: Connect a monitor to this connector. The video connectors on
the front and rear of the server can be used simultaneously.
12 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Systems-management Ethernet connector: Use this connector to connect the
server to a network for full systems-management information control.
v Ethernet connectors: Use either of these connectors to connect the server to a
network. When you use the Ethernet 1 connector, the network can be shared
with the IMM through a single network cable.
The following illustration shows the LEDs on the rear of the server.
v Ethernet activity LEDs: When these LEDs are lit, they indicate that the server is
transmitting to or receiving signals from the Ethernet LAN that is connected to
the Ethernet port.
v Ethernet link LEDs: When these LEDs are lit, they indicate that there is an
active link connection on the 10BASE-T, 100BASE-TX, or 1000BASE-TX
interface for the Ethernet port.
v AC power LED: Each hot-swap ac power supply has an ac power LED and a dc
power LED. When the ac power LED is lit, it indicates that sufficient power is
coming into the power supply through the power cord. During typical operation,
both the ac and dc power LEDs are lit. For any other combination of LEDs, see
“Power-supply LEDs” on page 129.
v IN OK power LED: Each hot-swap dc power supply has an IN OK power LED
and an OUT OK power LED. When the IN OK power LED is lit, it indicates that
sufficient power is coming into the power supply through the power cord. During
typical operation, both the IN OK and OUT OK power LEDs are lit. For any other
combination of LEDs, see “Power-supply LEDs” on page 129.
v DC power LED: Each hot-swap power supply has a dc power LED and an ac
power LED. When the dc power LED is lit, it indicates that the power supply is
supplying adequate dc power to the system. During typical operation, both the ac
and dc power LEDs are lit. For any other combination of LEDs, see
“Power-supply LEDs” on page 129.
v OUT OK power LED: Each hot-swap dc power supply has an IN OK power LED
and an OUT OK power LED. When the OUT OK power LED is lit, it indicates that
Chapter 2. Introduction 13
the power supply is supplying adequate dc power to the system. During typical
operation, both the IN OK and OUT OK power LEDs are lit. For any other
combination of LEDs, see “Power-supply LEDs” on page 129.
v Power-supply error LED: When the power-supply error LED is lit, it indicates
that the power supply has failed.
14 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
operating system. The amount of memory that is reserved for system resources
depends on the operating system, the configuration of the server, and the
configured PCI options.
Some operating systems require an orderly shutdown before you turn off the server.
See your operating-system documentation for information about shutting down the
operating system.
Statement 5:
CAUTION:
The power control button on the device and the power switch on the power
supply do not turn off the electrical current supplied to the device. The device
also might have more than one power cord. To remove all electrical current
from the device, ensure that all power cords are disconnected from the power
source.
2
1
The server can be turned off in any of the following ways:
v You can turn off the server from the operating system, if your operating system
supports this feature. After an orderly shutdown of the operating system, the
server will turn off automatically.
v You can press the power-control button to start an orderly shutdown of the
operating system and turn off the server, if your operating system supports this
feature.
v If the operating system stops functioning, you can press and hold the
power-control button for more than 4 seconds to turn off the server.
v The server can be turned off by Wake on LAN feature with the following
limitation:
– To install any PCI adapter, the power cords must be removed from the power
source before you remove the PCI Express riser assembly and the PCI-X
riser assembly. Otherwise, this will cause the active power management event
signal to become disabled by the system board logic and Wake on LAN might
not work. However, after the server is powered-on locally, the active power
management event signal will be enabled by the system board logic.
v The integrated management module (IMM) can turn off the server as an
automatic response to a critical system failure.
Chapter 2. Introduction 15
Internal LEDs, connectors, and jumpers
The illustrations in this section show the connectors, LEDs, and jumpers on the
internal boards. The illustrations might differ slightly from your hardware.
16 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
System-board external connectors
The following illustration shows the external connectors on the system board:
Chapter 2. Introduction 17
System-board switches and jumpers
The following illustration shows the location and description of the switches and
jumpers.
Note: If there is a clear protective sticker on the top of the switch blocks, you must
remove and discard it to access the switches.
uEFI boot recovery IMM recovery
jumper (J29) jumper (J147)
18 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 2. System board jumpers (continued)
Jumper number Jumper name Jumper setting
Notes:
1. If no jumper is present, the server responds as if the pins are set to 1 and 2.
2. Changing the position of the UEFI boot recovery jumper from pins 1 and 2 to pins 2 and
3 before the server is turned on alters which flash ROM page is loaded. Do not change
the jumper pin position after the server is turned on. This can cause an unpredictable
problem.
The following table describes the functions of the SW3 switch block on the system
board.
Table 3. System board SW3 switch block definition
Switch
number Default position Description
1 Off Clear CMOS memory. When this
switch is toggled to On, it clears the
data in CMOS memory.
2 Off Trust Platform Module (TPM)
physical presence. Turning this
switch to the on position indicates a
physical presence to the TPM.
3 Off Reserved.
4 Off Reserved.
The following table describes the functions of the SW4 switch block on the system
board.
Table 4. System board SW4 switch block definition
Switch
number Default position Description
1 Off Power-on password override.
Changing the position of this switch
bypasses the power-on password
check the next time the server is
turned on and starts the Setup utility
so that you can change or delete the
power-on password. You do not have
to move the switch back to the
default position after the power-on
password in overridden.
Chapter 2. Introduction 19
Table 4. System board SW4 switch block definition (continued)
Switch
number Default position Description
2 Off When you toggle this switch On and
then Off, you force a power-on,
which overrides the power-on and
power-off button on the server and
they become nonfunctional.
3 Off (Trained service technician only)
Forced power permission. Changing
the position of this switch overrides
the IMM power-on checking process.
4 Off Reserved.
Important:
1. Before you change any switch settings or move any jumpers, turn off the server;
then, disconnect all power cords and external cables. Review the information in
“Safety” on page vii, “Installation guidelines” on page 187, “Handling
static-sensitive devices” on page 189, and “Turning off the server” on page 15.
2. Any system-board switch or jumper blocks that are not shown in the illustrations
in this document are reserved.
20 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
System-board LEDs
The following illustration shows the light-emitting diodes (LEDs) on the system
board.
Chapter 2. Introduction 21
System-board optional device connectors
The following illustration shows the connectors for user-installable options:
22 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Chapter 3. Diagnostics
This chapter describes the diagnostic tools that are available to help you solve
problems that might occur in the server.
If you cannot locate and correct a problem by using the information in this chapter,
see Appendix A, “Getting help and technical assistance,” on page 305 for more
information.
Diagnostic tools
The following tools are available to help you diagnose and solve hardware-related
problems:
v Light path diagnostics
Use light path diagnostics to diagnose system errors quickly. See “Light path
diagnostics” on page 119 for more information.
v Dynamic System Analysis (DSA) Preboot diagnostic programs
The DSA Preboot diagnostic programs provide problem isolation, configuration
analysis, and error log collection. The diagnostic programs are the primary
method of testing the major components of the server and are stored in
integrated USB memory. The diagnostic programs collect the following
information about the server:
– System configuration
– Network interfaces and settings
– Installed hardware
– Light path diagnostics status
– Service processor status and configuration
– Vital product data, firmware, and UEFI configuration
– Hard disk drive health
– RAID controller configuration
– Controller and service processor event logs, including the following
information:
- System error logs
- Temperature, voltage, and fan speed information
- Self-monitoring Analysis, and Reporting Technology (SMART) data
- Machine check registers
- USB information
- Monitor configuration information
- PCI slot information
The diagnostic programs create a merged log that includes events from all
collected logs. The information is collected into a file that you can send to IBM
service and support. Additionally, you can view the server information locally
through a generated text report file. You can also copy the log to removable
media and view the log from a web browser. See “Running the diagnostic
programs” on page 135 for more information.
v Troubleshooting tables
These tables list problem symptoms and actions to correct the problems. See
“Troubleshooting tables” on page 101 for more information.
v IBM Electronic Service Agent
IBM Electronic Service Agent is a software tool that monitors the server for
hardware error events and automatically submits electronic service requests to
IBM service and support. In addition, it can collect and transmit system
Event logs
Error codes and messages are displayed in the following types of event logs. Some
of the error codes and messages in the logs are abbreviated. When you are
troubleshooting PCI-X slots, note that the event logs report the PCI-X buses
numerically. The numerical assignments vary depending on the configuration. You
can check the assignments by running the Setup utility (see “Using the Setup utility”
on page 285 for more information).
v POST event log: This log contains the three most recent error codes and
messages that were generated during POST. You can view the contents of the
POST event log through the Setup utility.
v System-event log: This log contains messages that were generated during
POST and all system status messages from the service processor. You can view
the contents of the system-event log from the Setup utility.
The system-event log is limited in size. When it is full, new entries will not
overwrite existing entries; therefore, you must periodically clear the system-event
log through the Setup utility. When you are troubleshooting an error, be sure to
clear the system-event log so that you can find current errors more easily.
Each system-event log entry is displayed on its own page. Messages are listed
on the left side of the screen, and details about the selected message are
displayed on the right side of the screen. To move from one entry to the next,
use the Up Arrow (↑) and Down Arrow (↓) keys.
The system-event log indicates an assertion event when an event has occurred.
It indicates a deassertion event when the event is no longer occurring.
v Integrated management module (IMM) event log: This log contains a filtered
subset of all IMM, POST, and system management interrupt (SMI) events. You
can view the IMM event log through the IMM web interface and through the
Dynamic System Analysis (DSA) program (as the ASM event log).
v DSA log: This log is generated by the Dynamic System Analysis (DSA) program,
and it is a chronologically ordered merge of the system-event log (as the IPMI
event log), the IMM chassis-event log (as the ASM event log), and the
operating-system event logs. You can view the DSA log through the DSA
program.
24 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
2. When the prompt <F1> Setup is displayed, press F1. If you have set both a
power-on password and an administrator password, you must type the
administrator password to view the error logs.
3. Select System Event Logs and use one of the following procedures:
v To view the POST error log, select POST Event Viewers.
v To view the IMM system-event log, select System Event Log.
If you have installed Dynamic System Analysis (DSA) Portable or DSA Installable,
you can use it to view the system-event log (as the IPMI event log), the IMM event
log (as the ASM event log), the operating-system event logs, or the merged DSA
log. You can also use DSA Preboot to view these logs, although you must restart
the server to use DSA Preboot.
If IPMItool is installed in the server, you can use it to view the system-event log.
Most recent versions of the Linux operating system come with a current version of
IPMItool.
You can view the IMM system event log through the Event Log link in the
integrated management module (IMM) web interface. For more information, see
“Logging on to the Web interface” on page 294.
The following table describes the methods that you can use to view the event logs,
depending on the condition of the server. The first three conditions generally do not
require that you restart the server.
Table 5. Methods for viewing event logs
Condition Action
The server is not hung and is connected to a Use any of the following methods:
network. v Run DSA Portable or DSA Installable to
view the event logs or create an output file
that you can send to a support
representative.
v In a web browser, type the IP address of
the IMM and go to the Event Log page.
v Use IPMItool to view the system-event log.
The server is not hung and is not connected Use IPMItool locally to view the system-event
to a network. log.
Chapter 3. Diagnostics 25
Table 5. Methods for viewing event logs (continued)
Condition Action
The server is not hung and the integrated In a web browser, type the IP address for the
management module (IMM) is connected to IMM and go to the Event Log page. For
a network. more information, see “Obtaining the IP
address for the IMM” on page 293 and
“Logging on to the Web interface” on page
294.
The server is hung. v If DSA Preboot is installed, restart the
server and press F2 to start DSA Preboot
and view the event logs.
v If DSA Preboot is not installed, insert the
DSA Preboot CD and restart the server to
start DSA Preboot and view the event
logs.
v Alternatively, you can restart the server
and press F1 to start the Setup utility and
view the POST event log or system-event
log. For more information, see “Viewing
event logs from the Setup utility” on page
24.
Note: The POST error log is automatically cleared each time the server is
restarted.
1. Turn on the server.
2. When the prompt <F1> Setup is displayed, press F1. If you have set both a
power-on password and an administrator password, you must type the
administrator password to view the error logs.
3. Use one of the following procedures:
v To clear the IMM system-event log, select System Event Logs --> System
Event Log. Select Clear System Event Log; then, press Enter twice.
POST
When you turn on the server, it performs a series of tests to check the operation of
the server components and some optional devices in the server. This series of tests
is called the power-on self-test, or POST.
If a power-on password is set, you must type the password and press Enter, when
you are prompted, for POST to run.
26 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
POST error codes
The following table describes the POST error codes and suggested actions to
correct the detected problems. These errors can appear as severe, warning, or
informational.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
0010002 Microprocessor not supported. 1. Reseat the following components one at a time, in
the order shown, restarting the server each time.
a. (Trained service technician only)
Microprocessor 1.
b. (Trained service technician only)
Microprocessor 2 (if installed.)
2. (Trained service technician only) Remove
microprocessor 2 and restart the server.
3. (Trained service technician only) Remove
microprocessor 1 and install microprocessor 2 in
the microprocessor 1 connector. Restart the
server. If the error is corrected, then
microprocessor 1 is bad and must be replaced.
4. Replace the following components one at a time,
in the order shown, restarting the server each
time.
a. (Trained service technician only)
Microprocessor 1.
b. (Trained service technician only)
Microprocessor 2.
c. (Trained service technician only) System
board.
0011000 Invalid microprocessor type. 1. Update the server firmware to the latest level (see
“Updating the firmware” on page 281).
2. (Trained service technician only) Remove and
replace the affected microprocessor (error LED is
lit) with a supported type (see “Installing a
microprocessor and heat sink” on page 269).
0011002 Microprocessor mismatch. 1. Run the Setup utility and select System
Information → System Summary → Processor
Details to view the microprocessor information to
compare the installed microprocessor
specifications.
2. (Trained service technician only) Remove and
replace one of the microprocessors so that they
both match.
Chapter 3. Diagnostics 27
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
0011004 Microprocessor failed BIST. 1. Update the server firmware to the latest level (see
“Updating the firmware” on page 281).
2. (Trained service technician only) Reseat
microprocessor 2.
3. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. (Trained service technician only)
Microprocessor
b. (Trained service technician only) System
board
001100A Microcode updated failed. 1. Update the server firmware to the latest level (see
“Updating the firmware” on page 281).
2. (Trained service technician only) Replace the
microprocessor.
0018010 Microprocessors of the same model have 1. Run the Setup utility and select System
mismatched stepping ID. Information → System Summary → Processor
Details to view the microprocessor information to
compare the installed microprocessor
specifications.
2. (Trained service technician only) Remove and
replace one of the microprocessors so that they
both match.
0018009 Microprocessors have mismatched core 1. Run the Setup utility and select System
speed. Information → System Summary → Processor
Details to view the microprocessor information to
compare the installed microprocessor
specifications.
2. (Trained service technician only) Remove and
replace one of the microprocessors so that they
both match.
001800B Microprocessors have one or more cache 1. Run the Setup utility and select System
levels with mismatched size. Information → System Summary → Processor
Details to view the microprocessor information to
compare the installed microprocessor
specifications.
2. (Trained service technician only) Remove and
replace one of the microprocessors so that they
both match.
28 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
0018005 Microprocessors have mismatched number of 1. Run the Setup utility and select System
COREs. Information → System Summary → Processor
Details to view the microprocessor information to
compare the installed microprocessor
specifications.
2. (Trained service technician only) Remove and
replace one of the microprocessors so that they
both match.
0018006 Microprocessors have mismatched QPI 1. Run the Setup utility and select System
speed. Information → System Summary → Processor
Details to view the microprocessor information to
compare the installed microprocessor
specifications.
2. (Trained service technician only) Remove and
replace one of the microprocessors so that they
both match.
0018007 Microprocessors have mismatched power 1. Run the Setup utility and select System
segments. Information → System Summary → Processor
Details to view the microprocessor information to
compare the installed microprocessor
specifications.
2. (Trained service technician only) Remove and
replace one of the microprocessors so that they
both match.
0018008 Microprocessors have mismatched internal 1. Run the Setup utility and select System
DDR3 frequency. Information → System Summary → Processor
Details to view the microprocessor information to
compare the installed microprocessor
specifications.
2. (Trained service technician only) Remove and
replace one of the microprocessors so that they
both match.
Chapter 3. Diagnostics 29
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
0050001 DIMM disabled. Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Make sure the DIMM is installed correctly (see
“Installing a memory module” on page 216).
2. If the DIMM was disabled because of a memory
fault, follow the suggested actions for that error
event and restart the server.
3. Check the IBM support website for an applicable
retain tip or firmware update that applies to this
memory event. If no memory fault is recorded in
the logs and no DIMM connector error LED is lit,
you can re-enable the DIMM through the Setup
utility or the Advanced Settings Utility (ASU).
30 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
0051003 Uncorrectable DIMM error Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Check the IBM support website for an applicable
retain tip or firmware update that applies to this
memory error.
2. Manually re-enable all affected DIMMs if the
server firmware version is older than UEFI v1.10.
If the server firmware version is UEFI v1.10 or
newer, disconnect and reconnect the server to the
power source and restart the server.
3. If the problem remains, replace the failing DIMM
(see “Removing a memory module” on page 215
and “Installing a memory module” on page 216).
4. (Trained service technician only) If the problem
occurs on the same DIMM connector, check the
DIMM connector. If the connector contains any
foreign material or is damaged, replace the
system board (see “Removing the system board”
on page 275 and “Installing the system board” on
page 277).
5. (Trained service technician only) Remove the
affected microprocessor and check the
microprocessor socket pins for any damaged pins.
If a damage is found, replace the system board
(see “Removing the system board” on page 275
and “Installing the system board” on page 277).
6. (Trained Service technician only) Replace the
affected microprocessor (see “Removing a
microprocessor and heat sink” on page 267 and
“Installing a microprocessor and heat sink” on
page 269).
0051004 DIMM presence detected read/write failure Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Update the server firmware to the latest level (see
“Updating the firmware” on page 281).
2. Reseat the DIMMs.
3. Install DIMMs in the correct sequence (see
“Installing a memory module” on page 216).
4. Replace the failing DIMM.
5. (Trained service technician only) Replace the
system board.
Chapter 3. Diagnostics 31
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
0051006 DIMM mismatch detected Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the
[Link] sure that the DIMMs match and are
installed in the correct sequence (see “Installing a
memory module” on page 216).
0051009 No memory detected. Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Make sure one or more DIMMs are installed in the
server.
2. Reseat the DIMMs and restart the server (see
“Removing a memory module” on page 215 and
“Installing a memory module” on page 216).
3. Make sure that the DIMMs are installed in the
correct sequence (see “Installing a memory
module” on page 216 for more information).
4. (Trained service technician only) Replace the
microprocessor that controls the failing DIMMs
(see “Removing a microprocessor and heat sink”
on page 267 and “Installing a microprocessor and
heat sink” on page 269).
5. (Trained service technician only) Replace the
system board (see “Removing the system board”
on page 275 and “Installing the system board” on
page 277).
005100A No usable memory detected. Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Make sure one or more DIMMs are installed in the
server.
2. Reseat the DIMMs and restart the server (see
“Removing a memory module” on page 215 and
“Installing a memory module” on page 216).
3. Make sure that the DIMMs are installed in the
correct sequence (see “Installing a memory
module” on page 216 for more information).
4. Clear CMOS memory to ensure that all DIMM
connectors are enabled (see “Removing the
system battery” on page 246 and “Installing the
system battery” on page 248). Note that all
firmware settings will be reset to the default
settings.
32 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
0058001 PFA threshold exceeded Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Check the IBM support website for an applicable
retain tip or firmware update that applies to this
memory error.
2. Swap the affected DIMMs (as indicated by the
error LEDs on the system board or the event logs)
to a different memory channel or microprocessor
(see “Installing a memory module” on page 216
for memory population).
3. If the error still occurs on the same DIMM, replace
the affected DIMM.
4. (Trained service technician only) If the problem
occurs on the same DIMM connector, check the
DIMM connector. If the connector contains any
foreign material or is damaged, replace the
system board (see “Removing the system board”
on page 275 and “Installing the system board” on
page 277).
5. (Trained service technician only) Remove the
affected microprocessor and check the
microprocessor socket pins for any damaged pins.
If a damage is found, replace the system board
(see “Removing the system board” on page 275
and “Installing the system board” on page 277).
6. (Trained Service technician only) Replace the
affected microprocessor (see “Removing a
microprocessor and heat sink” on page 267 and
“Installing a microprocessor and heat sink” on
page 269).
0058007 Unsupported DIMM population. Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Reseat the DIMMs and restart the server (see
“Removing a memory module” on page 215 and
“Installing a memory module” on page 216).
2. Make sure that the DIMMs are installed in the
proper sequence (see “Installing a memory
module” on page 216).
Chapter 3. Diagnostics 33
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
0058008 DIMM failed memory test. Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Check the IBM support website for an applicable
retain tip or firmware update that applies to this
memory error.
2. Manually re-enable all affected DIMMs if the
server firmware version is older than UEFI v1.10.
If the server firmware version is UEFI v1.10 or
newer, disconnect and reconnect the server to the
power source and restart the server.
3. Swap the affected DIMMs (as indicated by the
error LEDs on the system board or the event logs)
to a different memory channel or microprocessor
(see “Installing a memory module” on page 216
for memory population).
4. If the problem is related to a DIMM, replace the
failing DIMM (see “Removing a memory module”
on page 215 and “Installing a memory module” on
page 216).
5. (Trained service technician only) If the problem
occurs on the same DIMM connector, check the
DIMM connector. If the connector contains any
foreign material or is damaged, replace the
system board (see “Removing the system board”
on page 275 and “Installing the system board” on
page 277).
6. (Trained service technician only) Remove the
affected microprocessor and check the
microprocessor socket pins for any damaged pins.
If a damage is found, replace the system board
(see “Removing the system board” on page 275
and “Installing the system board” on page 277).
7. (Trained service technician only) If the problem is
related to microprocessor socket pins, replace the
system board (see “Removing the system board”
on page 275 and “Installing the system board” on
page 277).
8. (Trained Service technician only) Replace the
affected microprocessor (see “Removing a
microprocessor and heat sink” on page 267 and
“Installing a microprocessor and heat sink” on
page 269).
0058015 Start to activate spare memory channel Information only. A failed DIMM has been detected to
activate the memory online-spare feature. Check the
event log for uncorrected DIMM failure events.
34 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
00580A1 Invalid DIMM population for mirroring mode Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. If a fault LED is lit, resolve the failure.
2. Install the DIMMs in the correct sequence (see
“Installing a memory module” on page 216).
00580A4 Memory population changed. Information only. Memory has been added, moved, or
changed.
00580A5 Mirror failover complete Information only. Memory redundancy has been lost.
Check the event log for uncorrected DIMM failure
events (see “Event logs” on page 24).
00580A6 Spare memory channel activated Information only. Memory online-spare channel has
been activated to back up a failed DIMM. Check the
event log for uncorrected DIMM failure events.
0068002 CMOS battery cleared. 1. Reseat the battery.
2. Clear the CMOS memory (see “System-board
switches and jumpers” on page 18).
3. Replace the following components one at a time,
in the following order, restarting the server after
each one:
a. Battery
b. (Trained service technician only) System
board.
2011001 PCI-X SERR 1. Check the riser-card LEDs.
2. Reseat all affected adapters and riser cards.
3. Update the PCI adapter firmware.
4. Remove the adapter from the riser card.
5. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Riser card
b. (Trained service technician only) System
board
Chapter 3. Diagnostics 35
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
2018001 PCI Express uncorrected or uncorrected error 1. Check the riser-card LEDs.
2. Reseat all affected adapters and riser cards.
3. Update the PCI adapter firmware.
4. Remove the adapter from the riser card.
5. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Riser card
b. (Trained service technician only) System
board
2018002 Option ROM resource allocation failure Informational message that some devices might not
be initialized.
1. If possible, rearrange the order of the adapters in
the PCI slots to change the load order of the
optional-device ROM code.
2. Run the Setup utility, select Startup Options, and
change the boot priority to change the load order
of the optional-device ROM code.
3. Run the Setup utility and disable some other
resources, if their functions are not being used, to
make more space available.
a. Select Startup Options, then Planar
Ethernet (PXE/DHCP) to disable the
integrated Ethernet controller ROM.
b. Select Advanced Functions, then PCI Bus
Control, then PCI ROM Control Execution to
disable the ROM of the adapter in the PCI
slots.
c. Select Devices and I/O Ports to disable any
of the integrated devices.
4. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Each adapter
b. (Trained service technician only) System
board
3xx0007 (xx Firmware fault detected, system halted 1. Recover the server firmware to the latest level.
can be 00 -
2. Undo any recent configuration changes, or clear
19)
CMOS memory to restore the settings to the
default values (see “System-board switches and
jumpers” on page 18.
3. Remove any recently installed hardware.
36 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
3038003 Firmware corrupted 1. Run the Setup utility, select Load Default
Settings, and save the settings to recover the
server firmware.
2. (Trained service technician only) Replace the
system board.
3048005 Booted secondary (backup) UEFI Image Information only. The backup switch was used to boot
the secondary bank.
3048006 Booted secondary (backup) UEFI image 1. Run the Setup utility, select Load Default
because of ABR Settings, and save the settings to recover the
primary UEFI settings.
2. Turn off the server and remove it from the power
source.
3. Reconnect the server to the power source, then
turn on the server.
305000A RTC date/time is incorrect 1. Adjust the date and time settings in the Setup
utility, and then restart the server.
2. Reseat the battery.
3. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Battery
b. (Trained service technician only) System
board
3058001 System configuration invalid 1. Run the Setup utility, and select Save Settings.
2. Run the Setup utility, select Load Default
Settings, and save the settings.
3. Reseat the following components one at a time in
the order shown, restarting the server each time:
a. Battery
b. Failing device (if the device is a FRU, then it
must be reseated by a trained service
technician only)
4. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Battery
b. Failing device (if the device is a FRU, then it
must be replaced by a trained service
technician only)
c. (Trained service technician only) System board
Chapter 3. Diagnostics 37
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
3058004 Three boot failure 1. Undo any recent system changes, such as new
settings or newly installed devices.
2. Make sure that the server is attached to a reliable
power source.
3. Remove all hardware that is not listed on the
ServerProven web site at [Link]
servers/eserver/serverproven/compat/us/.
4. Make sure that the operating system is not
corrupted.
5. Run the Setup utility, save the configuration, and
then restart the server.
6. See “Problem determination tips” on page 174.
3108007 System configuration restored to default Information only. This is message is usually
settings associated with the CMOS battery clear event.
3138002 Boot configuration error 1. Remove any recent configuration changes made
to the Setup utility.
2. Run the Setup utility, select Load Default
Settings, and save the settings.
3808000 IMM communication failure 1. Shut down the system and remove the power
cords from the server for 30 seconds; then,
reconnect the server to power and restart it.
2. Update the IMM firmware to the latest level (see
“Updating the firmware” on page 281).
3. Make sure that the virtual media key is seated
and not damaged.
4. (Trained service technician only) Replace the
system board.
3808002 Error updating system configuration to IMM 1. Shut down the system and remove the power
cords from the server; then, reconnect the server
to power and restart it.
2. Run the Setup utility and select Save Settings.
3. Update the IMM firmware to the latest level (see
“Updating the firmware” on page 281).
3808003 Error retrieving system configuration from 1. Shut down the system and remove the power
IMM cords from the server; then, reconnect the server
to power and restart it.
2. Run the Setup utility and select Save Settings.
3. Update the IMM firmware to the latest level (see
“Updating the firmware” on page 281).
38 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
3808004 IMM system event log full v When using out-of-band, use the IMM Web
interface or IPMItool to clear the logs from the
operating system.
v When using the local console:
1. Run the Setup utility.
2. Select System Event Log.
3. Select Clear System Event Log.
4. Restart the server.
3818001 Core Root of Trust Measurement (CRTM) 1. Run the Setup utility, select Load Default
update failed Settings, and save the settings.
2. (Trained service technician only) Replace the
system board.
3818002 Core Root of Trust Measurement (CRTM) 1. Run the Setup utility, select Load Default
update aborted Settings, and save the settings.
2. (Trained service technician only) Replace the
system board.
3818003 Core Root of Trust Measurement (CRTM) 1. Run the Setup utility, select Load Default
flash lock failed Settings, and save the settings.
2. (Trained service technician only) Replace the
system board.
3818004 Core Root of Trust Measurement (CRTM) 1. Run the Setup utility, select Load Default
system error Settings, and save the settings.
2. (Trained service technician only) Replace the
system board.
3818005 Current Bank Core Root of Trust 1. Run the Setup utility, select Load Default
Measurement (CRTM) capsule signature Settings, and save the settings.
invalid
2. (Trained service technician only) Replace the
system board.
3818006 Opposite bank CRTM capsule signature 1. Switch the server firmware bank to the backup
invalid bank (see “Starting the backup server firmware”
on page 291).
2. Run the Setup utility, select Load Default
Settings, and save the settings.
3. Switch the bank back to the primary bank.
4. (Trained service technician only) Replace the
system board.
Chapter 3. Diagnostics 39
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support website at [Link] to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Error code Description Action
3818007 CRTM update capsule signature invalid 1. Run the Setup utility, select Load Default
Settings, and save the settings.
2. (Trained service technician only) Replace the
system board.
3828004 AEM power capping disabled 1. Check the settings and the event logs.
2. Make sure that the Active Energy Manager feature
is enabled in the Setup utility. Select System
Settings, Power, Active Energy, and Capping
Enabled.
3. Update the server firmware to the latest level (see
“Updating the firmware” on page 281).
4. Update the IMM firmware to the latest level (see
“Updating the firmware” on page 281).
Each message contains date and time information, and it indicates the source of
the message (POST or the IMM).
40 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Message Severity Description Action
Numeric sensor Ambient Temp going Error An upper critical sensor 1. Reduce the ambient
high (upper critical) has asserted. going high has asserted. temperature.
2. Check the server
airflow. Make sure that
nothing is blocking the
air from coming into or
preventing the air from
exiting the server.
Numeric sensor Ambient Temp going Error An upper nonrecoverable 1. Reduce the ambient
high (upper non-recoverable) has sensor going high has temperature.
asserted. asserted.
2. Check the server
airflow. Make sure that
nothing is blocking the
air from coming into or
preventing the air from
exiting the server.
Numeric sensor Planar 3.3V going low Error A lower critical sensor (Trained service technician
(lower critical) has asserted. going low has asserted. only) Replace the system
board.
Numeric sensor Planar 3.3V going high Error An upper critical sensor (Trained service technician
(upper critical) has asserted. going high has asserted. only) Replace the system
board.
Numeric sensor Planar 5V going low Error A lower critical sensor (Trained service technician
(lower critical) has asserted. going low has asserted. only) Replace the system
board.
Numeric sensor Planar 5V going high Error An upper critical sensor (Trained service technician
(upper critical) has asserted. going high has asserted. only) Replace the system
board.
Numeric sensor Planar VBAT going low Error A lower critical sensor Replace the 3 V battery.
(lower critical) has asserted. going low has asserted.
Numeric sensor Fan n A Tach going low Error A lower critical sensor 1. Reseat the failing fan
(lower critical) has asserted. going low has asserted. n, which is indicated by
(n = fan number) a lit LED near the fan
connector on the
system board.
2. Replace the failing fan.
(n = fan number)
Chapter 3. Diagnostics 41
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Numeric sensor Fan n B Tach going low Error A lower critical sensor 1. Reseat the failing fan
(lower critical) has asserted. going low has asserted. n, which is indicated by
(n = fan number) a lit LED near the fan
connector on the
system board.
2. Replace the failing fan.
(n = fan number)
The connector System board has Error An interconnect Reseat the front video
encountered a configuration error. configuration error has cable on the system
occurred. board.
42 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The Processor CPU nStatus has Failed Error A processor failed - IERR 1. Make sure that the
with IERR. condition has occurred. latest levels of
(n = microprocessor number) firmware and device
drivers are installed for
all adapters and
standard devices, such
as Ethernet, SCSI, and
SAS.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
2. Update the firmware
(UEFI and IMM) to the
latest level “Updating
the firmware” on page
281).
3. Run the DSA program.
4. Reseat the adapter.
5. Replace the adapter.
6. (Trained service
technician only)
Replace
microprocessor n.
7. (Trained service
technician only)
Replace the system
board.
(n = microprocessor
number)
Chapter 3. Diagnostics 43
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
An Over-Temperature Condition has Error An overtemperature 1. Make sure that the
been detected on the Processor CPU condition has occurred for fans are operating, that
nStatus. microprocessor n. there are no
(n = microprocessor number) (n = microprocessor obstructions to the
number) airflow (front and rear
of the server), that the
air baffles are in place
and correctly installed,
and that the server
cover is installed and
completely closed.
2. Make sure that the
heat sink for
microprocessor nis
installed correctly.
3. (Trained service
technician only)
Replace
microprocessor n.
(n = microprocessor
number)
44 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The Processor CPU nStatus has Failed Error A processor failed - 1. Check for a server
with FRB1/BIST condition. FRB1/BIST condition has firmware update.
(n = microprocessor number) occurred. Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
2. Make sure that the
installed
microprocessors are
compatible with each
other (see “Installing a
microprocessor and
heat sink” on page 269
for information about
microprocessor
requirements).
3. (Trained service
technician only) Reseat
microprocessor n.
4. (Trained service
technician only)
Replace
microprocessor n.
(n = microprocessor
number)
Chapter 3. Diagnostics 45
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The Processor CPU nStatus has a Error A processor configuration 1. Make sure that the
Configuration Mismatch. mismatch has occurred. installed
(n = microprocessor number) microprocessors are
compatible with each
other (see “Installing a
microprocessor and
heat sink” on page 269
for information about
microprocessor
requirements).
2. Update the server
firmware to the latest
level (see “Updating
the firmware” on page
281).
3. (Trained service
technician only)
Replace the
incompatible
microprocessor.
46 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
An SM BIOS Uncorrectable CPU Error An SMBIOS uncorrectable 1. Check for a server
complex error for Processor CPU CPU complex error has firmware update.
nStatus has asserted. asserted. Important: Some
(n = microprocessor number) cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
2. Make sure that the
installed
microprocessors are
compatible with each
other (see “Installing a
microprocessor and
heat sink” on page 269
for information about
microprocessor
requirements).
3. (Trained service
technician only) Reseat
microprocessor n.
4. (Trained service
technician only)
Replace
microprocessor n.
(n = microprocessor
number)
Chapter 3. Diagnostics 47
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor CPU nOverTemp has Error A sensor has changed to 1. Make sure that the
transitioned to critical from a less severe Critical state from a less fans are operating, that
state. severe state. there are no
(n = microprocessor number) obstructions to the
airflow (front and rear
of the server), that the
air baffles are in place
and correctly installed,
and that the server
cover is installed and
completely closed.
2. Check the ambient
temperature. You must
be operating within the
specifications.
3. Make sure that the
heat sink for
microprocessor n is
installed correctly.
4. (Trained service
technician only)
Replace
microprocessor n.
(n = microprocessor
number)
Sensor CPU nOverTemp has Error A sensor has changed to 1. Make sure that the
transitioned to non-recoverable from a Nonrecoverable state from fans are operating, that
less severe state. a less severe state. there are no
(n = microprocessor number) obstructions to the
airflow (front and rear
of the server), that the
air baffles are in place
and correctly installed,
and that the server
cover is installed and
completely closed.
2. Make sure that the
heat sink for
microprocessor n is
installed correctly.
3. (Trained service
technician only)
Replace
microprocessor n.
(n = microprocessor
number)
48 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor CPU nOverTemp has Error A sensor has changed to 1. Make sure that the
transitioned to critical from a Critical state from fans are operating, that
non-recoverable state. Nonrecoverable state. there are no
(n = microprocessor number) obstructions to the
airflow (front and fear
of the server), that the
air baffles are in place
and correctly installed,
and that the server
cover is installed and
completely closed.
2. Check the ambient
temperature. You must
be operating within the
specifications.
3. Make sure that the
heat sink for
microprocessor nis
installed correctly.
4. (Trained service
technician only)
Replace
microprocessor n.
(n = microprocessor
number)
Chapter 3. Diagnostics 49
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor CPU nOverTemp has Error A sensor has changed to 1. Make sure that the
transitioned to non-recoverable. Nonrecoverable state. fans are operating, that
(n = microprocessor number) there are no
obstructions to the
airflow (front and rear
of the server), that the
air baffles are in place
and correctly installed,
and that the server
cover is installed and
completely closed.
2. Check the ambient
temperature. You must
be operating within the
specifications.
3. Make sure that the
heat sink for
microprocessor nis
installed correctly.
4. (Trained service
technician only)
Replace
microprocessor n.
(n = microprocessor
number)
50 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Processor n is operating in a Degraded Warning Throttling has occurred for 1. Make sure that the
State. microprocessor n. (n = fans are operating, that
(n = microprocessor number) microprocessor number) there are no
obstructions to the
airflow (front and rear
of the server), that the
air baffles are in place
and correctly installed,
and that the server
cover is installed and
completely closed.
2. Check the ambient
temperature. You must
be operating within the
specifications.
3. Make sure that the
heat sink for
microprocessor nis
installed correctly.
4. (Trained service
technician only)
Replace
microprocessor n.
(n = microprocessor
number)
A bus timeout has occurred on system Error A bus timeout has 1. Remove the adapter
%1. occurred. from the PCI slot that
(%1 = is indicated by a lit
CIM_ComputerSystem.ElementName) LED.
2. Replace the riser-card
assembly.
3. Remove all PCI
adapters.
4. (Trained service
technicians only)
Replace the system
board.
Chapter 3. Diagnostics 51
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A software NMI has occurred on system Error A software NMI has 1. Check the device
%1. occurred. driver.
(%1 =
2. Reinstall the device
CIM_ComputerSystem.ElementName)
driver.
3. Update all device
drives to the latest
level.
4. Update the firmware
(UEFI and IMM) (see
“Updating the
firmware” on page
281).
52 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The System %1 encountered a POST Error A POST error has 1. Make sure the server
Error. occurred. meets the minimum
(%1 = (Sensor = ABR Status) configuration to start
CIM_ComputerSystem.ElementName)
(see “Power-supply
LEDs” on page 129).
2. Recover the server
firmware from the
backup page:
a. Restart the server.
b. At the prompt,
press F3 to recover
the firmware.
3. Update the server
firmware to the latest
level (see “Updating
the firmware” on page
281).
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
4. Remove components
one at a time,
restarting the server
each time, to see if the
problem goes away.
5. If the problem remains,
(trained service
technician) replace the
system board.
Chapter 3. Diagnostics 53
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The System %1 encountered a POST Error A POST error has 1. Make sure the server
Error. occurred. meets the minimum
(%1 = (Sensor = Firmware Error) configuration to start
CIM_ComputerSystem.ElementName)
(see “Power-supply
LEDs” on page 129).
2. Update the server
firmware on the
primary page.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
3. (Trained service
technician only)
Replace the system
board.
54 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
An Uncorrectable Bus Error has Error A bus uncorrectable error 1. Check the
occurred on system %1. has occurred. system-event log.
(%1 = CIM_ComputerSystem. (Sensor = Critical Int PCI)
2. Check the PCI error
ElementName)
LEDs.
3. Remove the adapter
from the indicated PCI
slot.
4. Check for a server
firmware update.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
5. (Trained service
technician only)
Replace the system
board.
Chapter 3. Diagnostics 55
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
An Uncorrectable Bus Error has Error A bus uncorrectable error 1. Check the
occurred on system %1. has occurred. system-event log.
(%1 = (Sensor = Critical Int CPU)
2. Check the
CIM_ComputerSystem.ElementName)
microprocessor error
LEDs.
3. Remove the failing
microprocessor from
the system board.
4. Check for a server
firmware update.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
5. Make sure that the two
microprocessors are
matching.
6. (Trained service
technician only)
Replace the system
board.
56 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
An Uncorrectable Bus Error has Error A bus uncorrectable error 1. Check the
occurred on system %1. has occurred. system-event log.
(%1 = (Sensor = Critical Int
2. Check the DIMM error
CIM_ComputerSystem.ElementName) DIMM)
LEDs.
3. Remove the failing
DIMM from the system
board.
4. Check for a server
firmware update.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
5. Make sure that the
installed DIMMs are
supported and
configured correctly.
6. (Trained service
technician only)
Replace the system
board.
Chapter 3. Diagnostics 57
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor Sys Board Fault has transitioned Error A sensor has changed to 1. Check the
to critical from a less severe state. Critical state from a less system-event log.
severe state.
2. Check for an error LED
on the system board.
3. Replace any failing
device.
4. Check for a server
firmware update.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
5. (Trained service
technician only)
Replace the system
board.
The Power Supply (Power Supply: n) Error Power supply nhas failed. 1. If the power-on LED is
has Failed. (n = power supply number) lit, complete the
(n = power supply number) following steps:
a. Reduce the server
to the minimum
configuration (see
“Power-supply
LEDs” on page
129).
b. Reinstall the
components one at
a time, restarting
the server each
time.
c. If the error recurs,
replace the
component that you
just reinstalled.
2. Reseat power supply
n.
3. Replace power supply
n.
(n = power supply number)
58 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor PS n Fan Fault has transitioned Error A sensor has changed to 1. Make sure that there
to critical from a less severe state. Critical state from a less are no obstructions,
(n = power supply number) severe state. such as bundled
cables, to the airflow
from the power-supply
fan.
2. Replace power supply
n.
(n = power supply number)
Sensor VT Fault has transitioned to Error A sensor has changed to 1. Check the
non-recoverable. Nonrecoverable state. power-supply LEDs.
2. Follow the actions in
“Power-supply LEDs”
on page 129 and
“Power problems” on
page 113.
3. Replace the failing
power supply.
4. (Trained service
technician only)
Replace the system
board.
Sensor Pwr Rail A Fault has transitioned Error A sensor has changed to 1. Turn off the server and
to non-recoverable. Nonrecoverable state. disconnect it from
power.
2. Remove the optical
drive, fans, hard disk
drives, and hard disk
drive backplane.
3. Reinstall each device,
one at a time, starting
the server each time to
isolate the failing
device.
4. Replace the failing
device.
5. (Trained service
technician only)
Replace the system
board.
Chapter 3. Diagnostics 59
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor Pwr Rail B Fault has transitioned Error A sensor has changed to 1. Turn off the server and
to non-recoverable. Nonrecoverable state. disconnect it from
power.
2. Remove the optical
drive, fans, hard disk
drives, and hard disk
drive backplane.
3. Reinstall each device,
one at a time, starting
the server each time to
isolate the failing
device.
4. Replace the failing
device.
5. (Trained service
technician only)
Replace the system
board.
Sensor Pwr Rail C Fault has Error A sensor has changed to 1. Turn off the server and
transitioned to non-recoverable. Nonrecoverable state. disconnect it from
power.
2. (Trained service
technician only)
Remove the SAS/SATA
RAID riser card, the
DIMMs in connectors 1
through 8, and the
microprocessor in
socket 1.
3. Reinstall each device,
one at a time, starting
the server each time to
isolate the failing
device.
4. Replace the failing
device.
5. (Trained service
technician only)
Replace the system
board.
60 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor Pwr Rail D Fault has Error A sensor has changed to 1. Turn off the server and
transitioned to non-recoverable. Nonrecoverable state. disconnect it from
power.
2. (Trained service
technician only)
Remove the
microprocessor from
socket 1.
3. (Trained service
technician only)
Reinstall the
microprocessor in
socket 1 and restart
the server.
4. (Trained service
technician only)
Replace the failing
microprocessor.
5. (Trained service
technician only)
Replace the system
board.
Sensor Pwr Rail E Fault has transitioned Error A sensor has changed to 1. Turn off the server and
to non-recoverable. Nonrecoverable state. disconnect it from
power.
2. (Trained service
technician only)
Remove the PCI riser
card from PCI
riser-card connector 2
and the
microprocessor from
socket 2.
3. Reinstall each device,
one at a time, starting
the server each time to
isolate the failing
device.
4. Replace the failing
device.
5. (Trained service
technician only)
Replace the system
board.
Chapter 3. Diagnostics 61
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Sensor PS n Therm Fault has Error A sensor has changed to 1. Make sure that there
transitioned to critical from a less severe Critical state from a less are no obstructions,
state. severe state. such as bundled
(n = power supply number) cables, to the airflow
from the power-supply
fan.
2. Replace power supply
n.
(n = power supply number)
Redundancy Power Unit has been Error Redundancy has been lost 1. Check the LEDs for
reduced. and is insufficient to both power supplies.
continue operation.
2. Follow the actions in
“Power-supply LEDs”
on page 129.
Redundancy Cooling Zone 1 has been Error Redundancy has been lost 1. Make sure that the
reduced. and is insufficient to connectors on fans 1
continue operation. and 2 are not
damaged.
2. Make sure that the fan
1 and 2 connectors on
the system board are
not damaged.
3. Make sure that the
fans are correctly
installed.
4. Reseat the fans.
5. Replace the fans.
Sensor RAID Error has transitioned to Error A sensor has changed to 1. Check the hard disk
critical from a less severe state. Critical state from a less drive amber status
severe state. LEDs to identify the
failed hard disk drive
or check the RAID
controller system
management software
event logs.
2. Reseat the hard disk
drive for which the
status LED is lit.
3. Replace the defective
hard disk drive.
62 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
The Drive n Status has been removed Error A drive has been removed. 1. Reseat hard disk drive
from unit Drive 0 Status. n.
(n = hard disk drive number) (n = hard disk drive
number). Wait 1 minute
or more before
reinstalling the drive.
2. Replace the hard disk
drive.
3. Make sure that the
disk firmware and
RAID controller
firmware is at the latest
level.
4. Check the SAS cable.
The Drive n Status has been disabled Error A drive has been disabled 1. Run the hard disk drive
due to a detected fault. because of a fault. diagnostic test on drive
(n = hard disk drive number) n.
2. Reseat the following
components:
a. Hard disk drive
(wait 1 minute or
more before
reinstalling the
drive).
b. Cable from the
system board to
the backplane
3. Replace the following
components one at a
time, in the order
shown, restarting the
server each time:
a. Hard disk drive
b. Cable from the
system board to
the backplane
c. Hard disk drive
backplane
(n = hard disk drive
number)
Chapter 3. Diagnostics 63
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Array %1 is in critical condition. Error An array is in Critical state. 1. Make sure that the
(%1 = (Sensor = Drive n Status) RAID controller
CIM_ComputerSystem.ElementName) (n = hard disk drive firmware and hard disk
number) drive firmware is at the
latest level.
2. Make sure that the
SAS cable is
connected correctly.
3. Replace the SAS
cable.
4. Replace the controller.
5. Replace the hard disk
drive that is indicated
by a lit status LED.
Array %1 has failed. Error An array is in Failed state. 1. Make sure that the
(%1 = (Sensor = Drive n Status) RAID controller
CIM_ComputerSystem.ElementName) (n = hard disk drive firmware and hard disk
number) drive firmware is at the
latest level.
2. Make sure that the
SAS cable is
connected correctly.
3. Replace the SAS
cable.
4. Replace the controller.
5. Replace the hard disk
drive that is indicated
by a lit status LED.
64 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory uncorrectable error detected for Error A memory uncorrectable 1. Check the IBM support
DIMM All DIMMs on Memory Subsystem error has occurred. website for an
All DIMMs. applicable retain tip or
firmware update that
applies to this memory
error.
2. Manually re-enable all
affected DIMMs if the
server firmware version
is older than UEFI
v1.10. If the server
firmware version is
UEFI v1.10 or newer,
disconnect and
reconnect the server to
the power source and
restart the server.
3. Swap the affected
DIMMs (as indicated
by the error LEDs on
the system board or
the event logs) to a
different memory
channel or
microprocessor (see
“Installing a memory
module” on page 216
for memory
population).
4. If the problem follows
the DIMM, replace the
failing DIMM (see
“Removing a memory
module” on page 215
and “Installing a
memory module” on
page 216).
(Continued on the next
page)
Chapter 3. Diagnostics 65
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory uncorrectable error detected for Error A memory uncorrectable 5. (Trained service
DIMM All DIMMs on Memory Subsystem error has occurred. technician only) If the
All DIMMs. problem occurs on the
same DIMM connector,
check the DIMM
connector. If the
connector contains any
foreign material or is
damaged, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
6. (Trained service
technician only)
Remove the affected
microprocessor and
check the
microprocessor socket
pins for any damaged
pins. If a damage is
found, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
7. (Trained Service
technician only)
Replace the affected
microprocessor (see
“Removing a
microprocessor and
heat sink” on page 267
and “Installing a
microprocessor and
heat sink” on page
269).
66 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory Logging Limit Reached for Error The memory logging limit 1. Check the IBM support
DIMM All DIMMs on Memory Subsystem has been reached. website for an
All DIMMs. applicable retain tip or
firmware update that
applies to this memory
error.
2. Swap the affected
DIMMs (as indicated
by the error LEDs on
the system board or
the event logs) to a
different memory
channel or
microprocessor (see
“Installing a memory
module” on page 216
for memory
population).
3. If the error still occurs
on the same DIMM,
replace the affected
DIMM.
4. (Trained service
technician only) If the
problem occurs on the
same DIMM connector,
check the DIMM
connector. If the
connector contains any
foreign material or is
damaged, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
(Continued on the next
page)
Chapter 3. Diagnostics 67
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory Logging Limit Reached for Error The memory logging limit 5. (Trained service
DIMM All DIMMs on Memory Subsystem has been reached. technician only)
All DIMMs. Remove the affected
microprocessor and
check the
microprocessor socket
pins for any damaged
pins. If a damage is
found, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
6. (Trained Service
technician only)
Replace the affected
microprocessor (see
“Removing a
microprocessor and
heat sink” on page 267
and “Installing a
microprocessor and
heat sink” on page
269).
Memory DIMM Configuration Error for Error A DIMM configuration error Make sure that DIMMs are
All DIMMs on Memory Subsystem All has occurred. installed in the correct
DIMMs. sequence and have the
same size, type, speed,
and technology.
68 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory DIMM disabled for All DIMMs Info DIMM disabled 1. Make sure the DIMM is
on Memory Subsystem All DIMMs. installed correctly (see
“Installing a memory
module” on page 216).
2. If the DIMM was
disabled because of a
memory fault (memory
uncorrectable error or
memory logging limit
reached), follow the
suggested actions for
that error event and
restart the server.
3. Check the IBM support
website for an
applicable retain tip or
firmware update that
applies to this memory
event. If no memory
fault is recorded in the
logs and no DIMM
connector error LED is
lit, you can re-enable
the DIMM through the
Setup utility or the
Advanced Settings
Utility (ASU).
Chapter 3. Diagnostics 69
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory uncorrectable error detected for Error A memory uncorrectable 1. Check the IBM support
DIMM One of the DIMMs on Memory error has occurred. website for an
Subsystem One of the DIMMs. applicable retain tip or
firmware update that
applies to this memory
error.
2. Manually re-enable all
affected DIMMs if the
server firmware version
is older than UEFI
v1.10. If the server
firmware version is
UEFI v1.10 or newer,
disconnect and
reconnect the server to
the power source and
restart the server.
3. Swap the affected
DIMMs (as indicated
by the error LEDs on
the system board or
the event logs) to a
different memory
channel or
microprocessor (see
“Installing a memory
module” on page 216
for memory
population).
4. If the problem follows
the DIMM, replace the
failing DIMM (see
“Removing a memory
module” on page 215
and “Installing a
memory module” on
page 216).
(Continued on the next
page)
70 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory uncorrectable error detected for Error A memory uncorrectable 5. (Trained service
DIMM One of the DIMMs on Memory error has occurred. technician only) If the
Subsystem One of the DIMMs. problem occurs on the
same DIMM connector,
check the DIMM
connector. If the
connector contains any
foreign material or is
damaged, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
6. (Trained service
technician only)
Remove the affected
microprocessor and
check the
microprocessor socket
pins for any damaged
pins. If a damage is
found, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
7. (Trained Service
technician only)
Replace the affected
microprocessor (see
“Removing a
microprocessor and
heat sink” on page 267
and “Installing a
microprocessor and
heat sink” on page
269).
Chapter 3. Diagnostics 71
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory Logging Limit Reached for Error The memory logging limit 1. Check the IBM support
DIMM One of the DIMMs on Memory has been reached. website for an
Subsystem One of the DIMMs. applicable retain tip or
firmware update that
applies to this memory
error.
2. Swap the affected
DIMMs (as indicated
by the error LEDs on
the system board or
the event logs) to a
different memory
channel or
microprocessor (see
“Installing a memory
module” on page 216
for memory
population).
3. If the error still occurs
on the same DIMM,
replace the affected
DIMM.
4. (Trained service
technician only) If the
problem occurs on the
same DIMM connector,
check the DIMM
connector. If the
connector contains any
foreign material or is
damaged, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
(Continued on the next
page)
72 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory Logging Limit Reached for Error The memory logging limit 5. (Trained service
DIMM One of the DIMMs on Memory has been reached. technician only)
Subsystem One of the DIMMs. Remove the affected
microprocessor and
check the
microprocessor socket
pins for any damaged
pins. If a damage is
found, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
6. (Trained Service
technician only)
Replace the affected
microprocessor (see
“Removing a
microprocessor and
heat sink” on page 267
and “Installing a
microprocessor and
heat sink” on page
269).
Memory DIMM Configuration Error for Error A DIMM configuration error Make sure that DIMMs are
One of the DIMMs on Memory has occurred. installed in the correct
Subsystem One of the DIMMs. sequence and have the
same size, type, speed,
and technology.
Chapter 3. Diagnostics 73
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory DIMM disabled for One of the Info DIMM disabled 1. Make sure the DIMM is
DIMMs on Memory Subsystem One of installed correctly (see
the DIMMs. “Installing a memory
module” on page 216).
2. If the DIMM was
disabled because of a
memory fault (memory
uncorrectable error or
memory logging limit
reached), follow the
suggested actions for
that error event and
restart the server.
3. Check the IBM support
website for an
applicable retain tip or
firmware update that
applies to this memory
event. If no memory
fault is recorded in the
logs and no DIMM
connector error LED is
lit, you can re-enable
the DIMM through the
Setup utility or the
Advanced Settings
Utility (ASU).
74 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory uncorrectable error detected for Error A memory uncorrectable 1. Check the IBM support
DIMM n Status on Memory Subsystem error has occurred. website for an
DIMM n Status. applicable retain tip or
(n = DIMM number) firmware update that
applies to this memory
error.
2. Manually re-enable all
affected DIMMs if the
server firmware version
is older than UEFI
v1.10. If the server
firmware version is
UEFI v1.10 or newer,
disconnect and
reconnect the server to
the power source and
restart the server.
3. Swap the affected
DIMMs (as indicated
by the error LEDs on
the system board or
the event logs) to a
different memory
channel or
microprocessor (see
“Installing a memory
module” on page 216
for memory
population).
4. If the problem follows
the DIMM, replace the
failing DIMM (see
“Removing a memory
module” on page 215
and “Installing a
memory module” on
page 216).
(Continued on the next
page)
Chapter 3. Diagnostics 75
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory uncorrectable error detected for Error A memory uncorrectable 5. (Trained service
DIMM n Status on Memory Subsystem error has occurred. technician only) If the
DIMM n Status. problem occurs on the
(n = DIMM number) same DIMM connector,
check the DIMM
connector. If the
connector contains any
foreign material or is
damaged, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
6. (Trained service
technician only)
Remove the affected
microprocessor and
check the
microprocessor socket
pins for any damaged
pins. If a damage is
found, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
7. (Trained Service
technician only)
Replace the affected
microprocessor (see
“Removing a
microprocessor and
heat sink” on page 267
and “Installing a
microprocessor and
heat sink” on page
269).
76 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory Logging Limit Reached for Error The memory logging limit 1. Check the IBM support
DIMM nStatus on Memory Subsystem has been reached. website for an
DIMMnStatus. applicable retain tip or
(n = DIMM number) firmware update that
applies to this memory
error.
2. Swap the affected
DIMMs (as indicated
by the error LEDs on
the system board or
the event logs) to a
different memory
channel or
microprocessor (see
“Installing a memory
module” on page 216
for memory
population).
3. If the error still occurs
on the same DIMM,
replace the affected
DIMM.
4. (Trained service
technician only) If the
problem occurs on the
same DIMM connector,
check the DIMM
connector. If the
connector contains any
foreign material or is
damaged, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
(Continued on the next
page)
Chapter 3. Diagnostics 77
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory Logging Limit Reached for Error The memory logging limit 5. (Trained service
DIMM nStatus on Memory Subsystem has been reached. technician only)
DIMMnStatus. Remove the affected
(n = DIMM number) microprocessor and
check the
microprocessor socket
pins for any damaged
pins. If a damage is
found, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
6. (Trained Service
technician only)
Replace the affected
microprocessor (see
“Removing a
microprocessor and
heat sink” on page 267
and “Installing a
microprocessor and
heat sink” on page
269).
Memory DIMM Configuration Error for Error A DIMM configuration error Make sure that DIMMs are
DIMM n Status on Memory Subsystem has occurred. installed in the correct
DIMM n Status. sequence and have the
(n = DIMM number) same size, type, speed,
and technology.
78 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory DIMM disabled for DIMM n Info DIMM disabled 1. Make sure the DIMM is
Status on Memory Subsystem DIMM n installed correctly (see
Status. “Installing a memory
(n = DIMM number) module” on page 216).
2. If the DIMM was
disabled because of a
memory fault (memory
uncorrectable error or
memory logging limit
reached), follow the
suggested actions for
that error event and
restart the server.
3. Check the IBM support
website for an
applicable retain tip or
firmware update that
applies to this memory
event. If no memory
fault is recorded in the
logs and no DIMM
connector error LED is
lit, you can re-enable
the DIMM through the
Setup utility or the
Advanced Settings
Utility (ASU).
Chapter 3. Diagnostics 79
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory DIMM scrub failure for DIMM n Error DIMM scrub failure. 1. Check the IBM support
Status on Memory Subsystem DIMM n website for an
Status. applicable retain tip or
(n = DIMM number) firmware update that
applies to this memory
error.
2. Manually re-enable all
affected DIMMs if the
server firmware version
is older than UEFI
v1.10. If the server
firmware version is
UEFI v1.10 or newer,
disconnect and
reconnect the server to
the power source and
restart the server.
3. Swap the affected
DIMMs (as indicated
by the error LEDs on
the system board or
the event logs) to a
different memory
channel or
microprocessor (see
“Installing a memory
module” on page 216
for memory
population).
4. If the problem is
related to a DIMM,
replace the failing
DIMM (see “Removing
a memory module” on
page 215 and
“Installing a memory
module” on page 216).
(Continued on the next
page)
80 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory DIMM scrub failure for DIMM n Error DIMM scrub failure. 5. (Trained service
Status on Memory Subsystem DIMM n technician only) If the
Status. problem occurs on the
(n = DIMM number) same DIMM connector,
check the DIMM
connector. If the
connector contains any
foreign material or is
damaged, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
6. (Trained service
technician only)
Remove the affected
microprocessor and
check the
microprocessor socket
pins for any damaged
pins. If a damage is
found, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
7. (Trained service
technician only) If the
problem is related to
microprocessor socket
pins, replace the
system board (see
“Removing the system
board” on page 275
and “Installing the
system board” on page
277).
(Continued on the next
page)
Chapter 3. Diagnostics 81
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Memory DIMM scrub failure for DIMM n Error DIMM scrub failure. 8. (Trained Service
Status on Memory Subsystem DIMM n technician only)
Status. Replace the affected
(n = DIMM number) microprocessor (see
“Removing a
microprocessor and
heat sink” on page 267
and “Installing a
microprocessor and
heat sink” on page
269).
Sensor DIMM n Temp has transitioned Error A sensor has changed to 1. Make sure that the
to critical from a less severe state. Critical state from a less fans are operating, that
(n = DIMM number) severe state. there are no
obstructions to the
airflow, that the air
baffles are in place and
correctly installed, and
that the server cover is
installed and
completely closed.
2. Make sure that
ambient temperature is
within the
specifications.
3. If a fan has failed,
complete the action for
a fan failure.
4. Replace DIMM n.
(n = DIMM number)
82 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
All PCI error Error PCI bridge (IOH) error 1. Check the riser-card
LEDs.
2. Reseat the affected
adapters and riser
card.
3. Update the server
firmware (UEFI and
IMM) and adapter
firmware.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
4. Remove both adapters.
5. Replace the riser
cards.
6. (Trained service
technicians only)
Replace the system
board.
Chapter 3. Diagnostics 83
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A PCI PERR has occurred on system Error A PCI PERR has occurred. 1. Check the riser-card
%1. (Sensor = All PCI Err) LEDs.
(%1 =
2. Reseat the affected
CIM_ComputerSystem.ElementName)
adapters and riser
card.
3. Update the server
firmware (UEFI and
IMM) and adapter
firmware.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
4. Remove both adapters.
5. Replace the PCIe
adapter.
6. Replace the riser card.
84 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A PCI SERR has occurred on system Error A PCI SERR has occurred. 1. Check the riser-card
%1. (Sensor = All PCI Err) LEDs.
(%1 =
2. Reseat the affected
CIM_ComputerSystem.ElementName)
adapters and riser
card.
3. Update the server
firmware (UEFI and
IMM) and adapter
firmware.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
4. Make sure that the
adapter is on the
serverproven list.
5. Remove both adapters.
6. Replace the PCIe
adapter.
7. Replace the riser card.
Chapter 3. Diagnostics 85
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A PCI PERR has occurred on system Error A PCI PERR has occurred. 1. Check the riser-card
%1. (Sensor = PCI Slot n; n = LEDs.
(%1 = PCI slot number)
2. Reseat the affected
CIM_ComputerSystem.ElementName)
adapters and riser
card.
3. Update the server
firmware (UEFI and
IMM) and adapter
firmware.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
4. Make sure that the
adapter is on the
serverproven list.
5. Remove the adapter
from slot n.
6. Replace the PCIe
adapter.
7. Replace riser card n.
(n = PCI slot number)
86 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A PCI SERR has occurred on system Error A PCI SERR has occurred. 1. Check the riser-card
%1. (Sensor = PCI Slot n; n = LEDs.
(%1 = PCI slot number)
2. Reseat the affected
CIM_ComputerSystem.ElementName)
adapters and riser
card.
3. Update the server
firmware (UEFI and
IMM) and adapter
firmware.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
4. Make sure that the
adapter is on the
serverproven list.
5. Remove the adapter
from slot n.
6. Replace the PCIe
adapter.
7. Replace riser card n.
(n = PCI slot number)
Chapter 3. Diagnostics 87
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A PCI PERR has occurred on system Error A PCI PERR has occurred. 1. Check the riser-card
%1. (Sensor = One of PCI Err) LEDs.
(%1 =
2. Reseat the affected
CIM_ComputerSystem.ElementName)
adapters and riser
card.
3. Update the server
firmware (UEFI and
IMM) and adapter
firmware.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
4. Make sure that the
adapter is on the
serverproven list.
5. Remove both adapters.
6. Replace the PCIe
adapter.
7. Replace the riser card.
8. (Trained service
technician only)
Replace the system
board.
88 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
A PCI SERR has occurred on system Error A PCI SERR has occurred. 1. Check the riser-card
%1. (Sensor = One of PCI Err) LEDs.
(%1 =
2. Reseat the affected
CIM_ComputerSystem.ElementName)
adapters and riser
card.
3. Update the server
firmware (UEFI and
IMM) and adapter
firmware.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
4. Make sure that the
adapter is on the
serverproven list.
5. Remove both adapters.
6. Replace the PCIe
adapter.
7. Replace the riser card.
8. (Trained service
technician only)
Replace the system
board.
Chapter 3. Diagnostics 89
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Fault in slot System board on system Error 1. Check the riser-card
%1. LEDs.
(%1 =
2. Reseat the affected
CIM_ComputerSystem.ElementName)
adapters and riser
card.
3. Update the server
firmware (UEFI and
IMM) and adapter
firmware.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
4. Make sure that the
adapter is on the
serverproven list.
5. Remove both adapters.
6. Replace the PCIe
adapter.
7. Replace the riser card.
8. (Trained service
technician only)
Replace the system
board.
Redundancy Bckup Mem Status has Error Redundancy has been lost 1. Check the
been reduced. and is insufficient to system-event log for
continue operation. DIMM failure events
(uncorrectable or PFA)
and correct the
failures.
2. Re-enable mirroring in
the Setup utility.
Sensor Planar Fault has transitioned to Error A sensor has changed to (Trained service technician
critical from a less severe state. Critical state from a less only) Replace the system
severe state. board.
IMM Network Initialization Complete. Info An IMM network has No action; information only.
completed initialization.
90 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Certificate Authority %1 has detected a Error A problem has occurred 1. Make sure that the
%2 Certificate Error. with the SSL Server, SSL certificate that you are
(%1 = IBM_CertificateAuthority. Client, or SSL Trusted CA importing is correct.
CADistinguishedName; %2 = certificate that has been
2. Try importing the
CIM_PublicKeyCertificate. imported into the IMM. The
certificate again.
ElementName) imported certificate must
contain a public key that
corresponds to the key pair
that was previously
generated by the Generate
a New Key and
Certificate Signing
Request link.
Ethernet Data Rate modified from %1 to Info A user has modified the No action; information only.
%2 by user %3. Ethernet port data rate.
(%1 = CIM_EthernetPort.Speed; %2 =
CIM_EthernetPort.Speed; %3 = user ID)
Ethernet Duplex setting modified from Info A user has modified the No action; information only.
%1 to %2 by user %3. Ethernet port duplex
(%1 = CIM_EthernetPort.FullDuplex; %2 setting.
= CIM_EthernetPort.FullDuplex; %3 =
user ID)
Ethernet MTU setting modified from %1 Info A user has modified the No action; information only.
to %2 by user %3. Ethernet port MTU setting.
(%1 = CIM_EthernetPort.
ActiveMaximumTransmissionUnit;
%2 = CIM_EthernetPort.
ActiveMaximumTransmissionUnit;
%3 = user ID)
Ethernet Duplex setting modified from Info A user has modified the No action; information only.
%1 to %2 by user %3. Ethernet port MAC address
(%1 = setting.
CIM_EthernetPort.NetworkAddresses;
%2 =
CIM_EthernetPort.NetworkAddresses;
%3 = user ID)
Ethernet interface %1 by user %2. Info A user has enabled or No action; information only.
(%1 = CIM_EthernetPort.EnabledState; disabled the Ethernet
%2 = user ID) interface.
Hostname set to %1 by user %2. Info A user has modified the No action; information only.
(%1 = host name of the IMM.
CIM_DNSProtocolEndpoint.Hostname;
%2 = user ID)
Chapter 3. Diagnostics 91
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
IP address of network interface modified Info A user has modified the IP No action; information only.
from %1 to %2 by user %3. address of the IMM.
(%1 =
CIM_IPProtocolEndpoint.IPv4Address;
%2 =
CIM_StaticIPAssignmentSettingData.
IPAddress; %3 = user ID)
IP subnet mask of network interface Info A user has modified the IP No action; information only.
modified from %1 to %2 by user %3s. subnet mask of the IMM.
(%1 =
CIM_IPProtocolEndpoint.SubnetMask;
%2 =
CIM_StaticIPAssignmentSettingData.
SubnetMask; %3 = user ID)
IP address of default gateway modified Info A user has modified the No action; information only.
from %1 to %2 by user %3s. default gateway IP address
(%1 = CIM_IPProtocolEndpoint. of the IMM.
GatewayIPv4Address; %2 =
CIM_StaticIPAssignmentSettingData.
DefaultGatewayAddress;
%3 = user ID)
OS Watchdog response %1 by %2. Info A user has enabled or No action; information only.
(%1 = Enabled or Disabled; %2 = user disabled an OS Watchdog.
ID)
DHCP[%1] failure, no IP address Info A DHCP server has failed 1. Make sure that the
assigned. to assign an IP address to network cable is
(%1 = IP address, [Link]) the IMM. connected.
2. Make sure that there is
a DHCP server on the
network that can
assign an IP address
to the IMM.
Remote Login Successful. Login ID: %1 Info A user has successfully No action; information only.
from %2 at IP address %3. logged in to the IMM.
(%1 = user ID; %2 =
ValueMap(CIM_ProtocolEndpoint.
ProtocolIFType;
%3 = IP address, [Link])
Attempting to %1 server %2 by user Info A user has used the IMM No action; information only.
%3. to perform a power
(%1 = Power Up, Power Down, Power function on the server.
Cycle, or Reset; %2 =
IBM_ComputerSystem.ElementName;
%3 = user ID)
92 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Security: Userid: '%1' had %2 login Error A user has exceeded the 1. Make sure that the
failures from WEB client at IP address maximum number of correct login ID and
%3. unsuccessful login password are being
(%1 = user ID; %2 = attempts from a Web used.
MaximumSuccessiveLoginFailures browser and has been
2. Have the system
(currently set to 5 in the firmware); %3 = prevented from logging in
administrator reset the
IP address, [Link]) for the lockout period.
login ID or password.
Security: Login ID: '%1' had %2 login Error A user has exceeded the 1. Make sure that the
failures from CLI at %3. maximum number of correct login ID and
(%1 = user ID; %2 = unsuccessful login password are being
MaximumSuccessiveLoginFailures attempts from the used.
(currently set to 5 in the firmware); %3 = command-line interface
2. Have the system
IP address, [Link]) and has been prevented
administrator reset the
from logging in for the
login ID or password.
lockout period.
Remote access attempt failed. Invalid Error A user has attempted to 1. Make sure that the
userid or password received. Userid is log in from a Web browser correct login ID and
'%1' from WEB browser at IP address by using an invalid login ID password are being
%2. or password. used.
(%1 = user ID; %2 = IP address,
2. Have the system
[Link])
administrator reset the
login ID or password.
Remote access attempt failed. Invalid Error A user has attempted to 1. Make sure that the
userid or password received. Userid is log in from a Telnet correct login ID and
'%1' from TELNET client at IP address session by using an invalid password are being
%2. login ID or password. used.
(%1 = user ID; %2 = IP address,
2. Have the system
[Link])
administrator reset the
login ID or password.
The Chassis Event Log (CEL) on Info A user has cleared the No action; information only.
system %1 cleared by user %2. IMM event log.
(%1 =
CIM_ComputerSystem.ElementName;
%2 = user ID)
IMM reset was initiated by user %1. Info A user has initiated a reset No action; information only.
(%1 = user ID) of the IMM.
Chapter 3. Diagnostics 93
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
ENET[0] DHCP-HSTN=%1, DN=%2, Info The DHCP server has No action; information only.
IP@=%3, SN=%4, GW@=%5, assigned an IMM IP
DNS1@=%6. address and configuration.
(%1 =
CIM_DNSProtocolEndpoint.Hostname;
%2 =
CIM_DNSProtocolEndpoint.DomainName;
%3 =
CIM_IPProtocolEndpoint.IPv4Address;
%4 =
CIM_IPProtocolEndpoint.SubnetMask;
%5 = IP address, [Link]; %6 =
IP address, [Link])
ENET[0] IP-Cfg:HstName=%1, IP@%2, Info An IMM IP address and No action; information only.
NetMsk=%3, GW@=%4. configuration have been
(%1 = assigned using client data.
CIM_DNSProtocolEndpoint.Hostname;
%2 =
CIM_StaticIPSettingData.IPv4Address;
%3 =
CIM_StaticIPSettingData.SubnetMask;
%4 = CIM_StaticIPSettingData.
DefaultGatewayAddress)
LAN: Ethernet[0] interface is no longer Info The IMM Ethernet interface No action; information only.
active. has been disabled.
LAN: Ethernet[0] interface is now active. Info The IMM Ethernet interface No action; information only.
has been enabled.
DHCP setting changed to by user %1. Info A user has changed the No action; information only.
(%1 = user ID) DHCP mode.
IMM: Configuration %1 restored from a Info A user has restored the No action; information only.
configuration file by user %2. IMM configuration by
(%1 = CIM_ConfigurationData. importing a configuration
ConfigurationName; %2 = user ID) file.
94 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Watchdog %1 Screen Capture Error An operating-system error 1. Reconfigure the
Occurred. has occurred, and the watchdog timer to a
(%1 = OS Watchdog or Loader screen capture was higher value.
Watchdog) successful.
2. Make sure that the
IMM Ethernet over
USB interface is
enabled.
3. Reinstall the RNDIS or
cdc_ether device driver
for the operating
system.
4. Disable the watchdog.
5. Check the integrity of
the installed operating
system.
Watchdog %1 Failed to Capture Screen. Error An operating-system error 1. Reconfigure the
(%1 = OS Watchdog or Loader has occurred, and the watchdog timer to a
Watchdog) screen capture failed. higher value.
2. Make sure that the
IMM Ethernet over
USB interface is
enabled.
3. Reinstall the RNDIS or
cdc_ether device driver
for the operating
system.
4. Disable the watchdog.
5. Check the integrity of
the installed operating
system.
6. Update the IMM
firmware.
Important: Some
cluster solutions
require specific code
levels or coordinated
code updates. If the
device is part of a
cluster solution, verify
that the latest level of
code is supported for
the cluster solution
before you update the
code.
Chapter 3. Diagnostics 95
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Running the backup IMM main Error The IMM has resorted to Update the IMM firmware.
application. running the backup main Important: Some cluster
application. solutions require specific
code levels or coordinated
code updates. If the device
is part of a cluster solution,
verify that the latest level
of code is supported for
the cluster solution before
you update the code.
Please ensure that the IMM is flashed Error The server does not Update the IMM firmware
with the correct firmware. The IMM is support the installed IMM to a version that the server
unable to match its firmware to the firmware version. supports.
server. Important: Some cluster
solutions require specific
code levels or coordinated
code updates. If the device
is part of a cluster solution,
verify that the latest level
of code is supported for
the cluster solution before
you update the code.
IMM reset was caused by restoring Info The IMM has been reset No action; information only.
default values. because a user has
restored the configuration
to its default settings.
IMM clock has been set from NTP Info The IMM clock has been No action; information only.
server %1. set to the date and time
(%1 = IBM_NTPService.ElementName) that is provided by the
Network Time Protocol
server.
SSL data in the IMM configuration data Error There is a problem with the 1. Make sure that the
is invalid. Clearing configuration data certificate that has been certificate that you are
region and disabling SSL+H25. imported into the IMM. The importing is correct.
imported certificate must
2. Try to import the
contain a public key that
certificate again.
corresponds to the key pair
that was previously
generated through the
Generate a New Key and
Certificate Signing
Request link.
96 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Flash of %1 from %2 succeeded for Info A user has successfully No action; information only.
user %3. updated one of the
(%1 = following firmware
CIM_ManagedElement.ElementName; components:
%2 = Web or LegacyCLI; %3 = user ID) v IMM main application
v IMM boot ROM
v Server firmware (UEFI)
v Diagnostics
v System power
backplane
v Remote expansion
enclosure power
backplane
v Integrated service
processor
v Remote expansion
enclosure processor
Flash of %1 from %2 failed for user %3. Info An attempt to update a Try to update the firmware
(%1 = firmware component from again.
CIM_ManagedElement.ElementName; the interface and IP
%2 = Web or LegacyCLI; %3 = user ID) address has failed.
The Chassis Event Log (CEL) on Info The IMM event log is 75% To avoid losing older log
system %1 is 75% full. full. When the log is full, entries, save the log as a
(%1 = older log entries are text file and clear the log.
CIM_ComputerSystem.ElementName) replaced by newer ones.
The Chassis Event Log (CEL) on Info The IMM event log is full. To avoid losing older log
system %1 is 100% full. When the log is full, older entries, save the log as a
(%1 = log entries are replaced by text file and clear the log.
CIM_ComputerSystem.ElementName) newer ones.
%1 Platform Watchdog Timer expired for Error A Platform Watchdog Timer 1. Reconfigure the
%2. Expired event has watchdog timer to a
(%1 = OS Watchdog or Loader occurred. higher value.
Watchdog; %2 = OS Watchdog or
2. Make sure that the
Loader Watchdog)
IMM Ethernet over
USB interface is
enabled.
3. Reinstall the RNDIS or
cdc_ether device driver
for the operating
system.
4. Disable the watchdog.
5. Check the integrity of
the installed operating
system.
Chapter 3. Diagnostics 97
Table 6. IMM error messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
IMM Test Alert Generated by %1. Info A user has generated a No action; information only.
(%1 = user ID) test alert from the IMM.
Security: Userid: '%1' had %2 login Error A user has exceeded the 1. Make sure that the
failures from an SSH client at IP maximum number of correct login ID and
address %3. unsuccessful login password are being
(%1 = user ID; %2 = attempts from SSH and used.
MaximumSuccessiveLoginFailures has been prevented from
2. Have the system
(currently set to 5 in the firmware); %3 = logging in for the lockout
administrator reset the
IP address, [Link]) period.
login ID or password.
98 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Checkout procedure
The checkout procedure is the sequence of tasks that you should follow to
diagnose a problem in the server.
Important: If the server is part of a shared hard disk drive cluster, run one test
at a time. Do not run any suite of tests, such as “quick” or “normal” tests,
because this might enable the hard disk drive diagnostic tests.
v If the server is halted and a POST error code is displayed, see “Event logs” on
page 24. If the server is halted and no error message is displayed, see
“Troubleshooting tables” on page 101 and “Solving undetermined problems” on
page 173.
v For information about power-supply problems, see “Solving power problems” on
page 172.
v For intermittent problems, check the error log; see “Event logs” on page 24 and
“Diagnostic programs and messages” on page 134.
Chapter 3. Diagnostics 99
v No: Go to step 2.
v Yes: Shut down all failing servers that are related to the cluster. Go to step 2.
2. Complete the following steps:
a. Check the power supply LEDs (see “Power-supply LEDs” on page 129).
b. Turn off the server and all external devices.
c. Check all internal and external devices for compatibility at
[Link]
d. Check all cables and power cords.
e. Set all display controls to the middle positions.
f. Turn on all external devices.
g. Turn on the server. If the server does not start, see “Troubleshooting tables”
on page 101.
h. Check the system-error LED on the operator information panel. If it is lit or
flashing, check the light path diagnostics LEDs (see “Light path diagnostics”
on page 119).
Note: When you slide the light path diagnostics panel out of the server to
check the LEDs or checkpoint codes, do not run the server continuously
with light path diagnostics panel outside of the server. The panel should only
be outside of the server a short time. The light path diagnostics panel must
remain in the server when the server is running to ensure proper cooling.
i. Check for the following results:
v Successful completion of POST (see “POST” on page 26 for more
information)
v Successful completion of startup
100 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Troubleshooting tables
Use the troubleshooting tables to find solutions to problems that have identifiable
symptoms.
If you cannot find a problem in these tables, see “Running the diagnostic programs”
on page 135 for information about testing the server.
If you have just added new software or a new optional device and the server is not
working, complete the following steps before you use the troubleshooting tables:
1. Check the system-error LED on the operator information panel; if it is lit, check
the light path diagnostics LEDs (see “Light path diagnostics” on page 119).
2. Remove the software or device that you just added.
3. Run the diagnostic tests to determine whether the server is running correctly.
4. Reinstall the new software or new device.
102 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
A newly installed hard disk drive 1. Make sure that the installed hard disk drive or RAID adapter is supported. For
is not recognized. a list of supported optional devices, see [Link]
serverproven/compat/us/.
2. Observe the associated amber hard disk drive status LED. If the LED is lit, it
indicates a drive fault.
3. If the LED is lit, remove the drive from the bay, wait 45 seconds, and reinsert
the drive, making sure that the drive assembly connects to the hard disk drive
backplane.
4. Observe the associated green hard disk drive activity LED and the amber
status LED:
v If the green activity LED is flashing and the amber status LED is not lit, the
drive is recognized by the controller and is working correctly. Run the DSA
diagnostics program to determine whether the drive is detected.
v If the green activity LED is flashing and the amber status LED is flashing
slowly, the drive is recognized by the controller and is rebuilding.
v If neither LED is lit or flashing, check the hard disk drive backplane (go to
step 5).
v If the green activity LED is flashing and the amber status LED is lit, replace
the drive. If the activity of the LEDs remains the same, go to step 5. If the
activity of the LEDs changes, return to step 2.
5. Make sure that the hard disk drive backplane is correctly seated. When it is
correctly seated, the drive assemblies correctly connect to the backplane
without bowing or causing movement of the backplane.
6. Reseat the backplane power cable and repeat steps 2 through 4.
7. Reseat the backplane signal cable and repeat steps 2 through 4.
8. Suspect the backplane signal cable or the backplane:
v If the server has eight hot-swap bays:
a. Replace the affected backplane signal cable.
b. Replace the affected backplane.
9. See “Problem determination tips” on page 174.
Multiple hard disk drives fail. Make sure that the hard disk drive, SAS/SATA adapter, and server device drivers
and firmware are at the latest level.
Important: Some cluster solutions require specific code levels or coordinated
code updates. If the device is part of a cluster solution, verify that the latest level of
code is supported for the cluster solution before you update the code.
Multiple hard disk drives are 1. Review the storage subsystem logs for indications of problems within the
offline. storage subsystem, such as backplane or cable problems.
2. See “Problem determination tips” on page 174.
Hypervisor problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
If an optional embedded 1. Make sure that the optional embedded hypervisor flash device is selected on
hypervisor flash device is not the boot manager <F12> Select Boot Device) at startup.
listed in the expected boot
2. Make sure that the embedded hypervisor flash device is seated in the
order, does not appear in the
connector correctly (see “Removing a USB embedded hypervisor flash device”
list of boot devices, or a similar
on page 229 and “Installing a USB embedded hypervisor flash device” on page
problem has occurred.
230).
3. See the documentation that comes with the optional embedded hypervisor flash
device for setup and configuration information.
4. Make sure that other software works on the server.
104 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Intermittent problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
A problem occurs only 1. Make sure that:
occasionally and is difficult to v All cables and cords are connected securely to the rear of the server and
diagnose. attached devices.
v When the server is turned on, air is flowing from the fan grille. If there is no
airflow, the fan is not working. This can cause the server to overheat and
shut down.
2. Check the system-error log or IMM system event log (see “Event logs” on page
24).
The server resets (restarts) 1. If the reset occurs during POST and the POST watchdog timer is enabled (click
occasionally. System Settings --> Integrated Management Module --> POST Watchdog
Timer in the Setup utility to see the POST watchdog setting), make sure that
sufficient time is allowed in the watchdog timeout value (POST Watchdog
Timer). If the server continues to reset during POST, see “POST” on page 26
and “Diagnostic programs and messages” on page 134.
2. If the reset occurs after the operating system starts, disable any automatic
server restart (ASR) utilities, such as the IBM Automatic Server Restart IPMI
Application for Windows, or any ASR devices that are be installed.
Note: ASR utilities operate as operating-system utilities and are related to the
IPMI device driver. If the reset continues to occur after the operating system
starts, the operating system might have a problem; see “Software problems” on
page 118.
3. If neither condition applies, check the system-error log or IMM system-event log
(see “Event logs” on page 24).
106 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Memory problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v For additional memory troubleshooting information, refer to the "Troubleshooting Memory - IBM
BladeCenter and System x" document at [Link]
docdisplay?brand=5000020&lndocid=MIGR-5081319.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The amount of system memory Note: Each time you install or remove a DIMM, you must disconnect the server
that is displayed is less than the from the power source; then, wait 10 seconds before restarting the server.
amount of installed physical 1. Make sure that:
memory.
v No error LEDs are lit on the operator information panel.
v No DIMM error LEDs are lit on the system board.
v Memory mirroring does not account for the discrepancy.
v The memory modules are seated correctly.
v You have installed the correct type of memory.
v If you changed the memory, you updated the memory configuration in the
Setup utility.
v All banks of memory are enabled. The server might have automatically
disabled a memory bank when it detected a problem, or a memory bank
might have been manually disabled.
v There is no memory mismatch when the server is at the minimum memory
configuration.
2. Reseat the DIMMs, and then restart the server.
3. Check the POST error log:
v If a DIMM was disabled by a systems-management interrupt (SMI), replace
the DIMM.
v If a DIMM was disabled by the user or by POST, reseat the DIMM; then, run
the Setup utility and enable the DIMM.
4. Check that all DIMMs are initialized in the Setup utility; then, run memory
diagnostics (see “Running the diagnostic programs” on page 135).
5. Reverse the DIMMs between the channels (of the same microprocessor), and
then restart the server. If the problem is related to a DIMM, replace the failing
DIMM.
6. Re-enable all DIMMs using the Setup utility, and then restart the server.
7. (Trained service technician only) Install the failing DIMM into a DIMM connector
for microprocessor 2 (if installed) to verify that the problem is not the
microprocessor or the DIMM connector.
8. (Trained service technician only) Replace the system board.
108 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Microprocessor problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The server goes directly to the 1. Correct any errors that are indicated by the light path diagnostics LEDs (see
POST Event Viewer when it is “Light path diagnostics” on page 119).
turned on.
2. Make sure that the server supports all the microprocessors and that the
microprocessors match in speed and cache size. To view the microprocessor
information, run the Setup utility and select System Information → System
Summary → Processor Details.
3. (Trained service technician only) Make sure that microprocessor 1 is seated
correctly.
4. (Trained service technician only) Remove microprocessor 2 and restart the
server.
5. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. (Trained service technician only) Microprocessor
b. (Trained service technician only) System board
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
Testing the monitor. 1. Make sure that the monitor cables are firmly connected.
2. Try using a different monitor on the server, or try using the monitor that is being
tested on a different server.
3. Run the diagnostic programs. If the monitor passes the diagnostic programs,
the problem might be a video device driver.
4. (Trained service technician only) Replace the system board.
The screen is blank. 1. If the server is attached to a KVM switch, bypass the KVM switch to eliminate it
as a possible cause of the problem: connect the monitor cable directly to the
correct connector on the rear of the server.
2. The IMM remote presence function is disabled if you install an optional video
adapter. To use the IMM remote presence function, remove the optional video
adapter.
3. Make sure that:
v The server is turned on. If there is no power to the server, see “Power
problems” on page 113.
v The monitor cables are connected correctly.
v The monitor is turned on and the brightness and contrast controls are
adjusted correctly.
4. Make sure that the correct server is controlling the monitor, if applicable.
5. Make sure that damaged server firmware is not affecting the video; see
“Updating the firmware” on page 281.
6. Observe the checkpoint LEDs on the system board; if the codes are changing,
go to step 6.
7. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Monitor
b. Video adapter (if one is installed)
c. (Trained service technician only) System board.
8. See “Solving undetermined problems” on page 173.
110 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The monitor works when you 1. Make sure that:
turn on the server, but the
v The application program is not setting a display mode that is higher than the
screen goes blank when you
capability of the monitor.
start some application
programs. v You installed the necessary device drivers for the application.
2. Run video diagnostics (see “Running the diagnostic programs” on page 135).
v If the server passes the video diagnostics, the video is good; see “Solving
undetermined problems” on page 173.
v (Trained service technician only) If the server fails the video diagnostics,
replace the system board.
The monitor has screen jitter, or 1. If the monitor self-tests show that the monitor is working correctly, consider the
the screen image is wavy, location of the monitor. Magnetic fields around other devices (such as
unreadable, rolling, or distorted. transformers, appliances, fluorescents, and other monitors) can cause screen
jitter or wavy, unreadable, rolling, or distorted screen images. If this happens,
turn off the monitor.
Attention: Moving a color monitor while it is turned on might cause screen
discoloration.
Move the device and the monitor at least 305 mm (12 in.) apart, and turn on
the monitor.
Notes:
a. To prevent diskette drive read/write errors, make sure that the distance
between the monitor and any external diskette drive is at least 76 mm (3
in.).
b. Non-IBM monitor cables might cause unpredictable problems.
2. Reseat the monitor cable.
3. Replace the components listed in step 2 one at a time, in the order shown,
restarting the server each time:
a. Monitor cable
b. Video adapter (if one is installed)
c. Monitor
d. (Trained service technician only) System board.
Wrong characters appear on the 1. If the wrong language is displayed, update the server firmware to the latest
screen. level (see “Updating the firmware” on page 281) with the correct language.
2. Reseat the monitor cable.
3. Replace the components listed in step 2 one at a time, in the order shown,
restarting the server each time:
a. Monitor cable
b. Video adapter (if one is installed)
c. Monitor
d. (Trained service technician only) System board.
112 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Power problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The power-control button does 1. Make sure that the power-control button is working correctly:
not work, and the reset button
a. Disconnect the server power cords.
does work (the server does not
start). b. Reconnect the power cords.
Note: The power-control button c. (Trained service technician only) Reseat the operator information panel
will not function until cables, and then repeat steps 1a and 1b.
approximately 20 to 40 seconds v (Trained service technician only) If the server starts, reseat the operator
after the server has been information panel. If the problem remains, replace the operator
connected to power. information panel.
v If the server does not start, bypass the power-control button by using the
force power-on jumper (see “System-board switches and jumpers” on
page 18). If the server starts, reseat the operator information panel. If the
problem remains, replace the operator information panel.
2. Make sure that the reset button is working correctly:
a. Disconnect the server power cords.
b. Reconnect the power cords.
c. (Trained service technician only) Reseat the light path diagnostics panel
cable, and then repeat steps 1a and 1b.
v (Trained service technician only) If the server starts, replace the light path
diagnostics panel.
v If the server does not start, go to step 3.
3. Make sure that both power supplies installed in the server are of the same
type. Mixing different power supplies in the server will cause a system error
(the system-error LED on the front panel turns on and the PS and CNFG LEDs
on the operator information panel are lit).
4. Make sure that:
v The power cords are correctly connected to the server and to a working
electrical outlet.
v The type of memory that is installed is correct.
v The DIMMs are fully seated.
v The LEDs on the power supply do not indicate a problem.
v The microprocessors are installed in the correct sequence.
5. Reseat the following components:
a. DIMMs
b. Power supplies
c. (Trained service technician only) Power switch connector
6. Replace the components listed in step 5 one at a time, in the order shown,
restarting the server each time.
(Continued on the next page)
114 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The OVER SPEC LED on the 1. Turn off the server and disconnect the power from the server.
light path diagnostics panel is
2. (Trained service technician only) Remove the microprocessor in socket 1.
lit, and the 12v channel D LED
Note: The server will not power-on when the microprocessor in socket 1 or 2
on the system board is lit.
is not installed in the server. (Trained service technician only) Toggle the switch
block (SW4), bit 3 to allow the server to power-on. See “System-board switches
and jumpers” on page 18 for the location of the SW4 switch block on the
system board.
3. Restart the server to see whether the problem remains.
4. (Trained service technician only) Reinstall the microprocessor in socket 1 and
restart the server.
5. Replace any failing device.
6. (Trained service technician only) Replace the system board.
The OVER SPEC LED on the 1. Turn off the server and disconnect the power from the server.
light path diagnostics panel is
2. Remove the PCI riser card from PCI riser connector 2 and the microprocessor
lit, and the 12v channel E LED
in socket 2.
on the system board is lit.
3. Restart the server to see whether the problem remains.
4. Reinstall each device that you removed in step 2 one at a time, starting the
server each time, to isolate the failing device.
5. Replace any failing device.
6. (Trained service technician only) Replace the system board.
The OVER SPEC LED on the 1. Turn off the server and disconnect the power from the server.
light path diagnostics panel is
2. Remove all PCI Express and PCI-X cards, all PCI riser cards, the operator
lit, and the 12v channel AUX
information panel, and the Ethernet adapter (if installed).
LED on the system board is lit.
Note: The server will not power-on when the microprocessor in socket 1 or 2
is not installed in the server. (Trained service technician only) Toggle the switch
block (SW4), bit 3 to allow the server to power-on. See “System-board switches
and jumpers” on page 18 for the location of the SW4 switch block on the
system board.
3. Restart the server to see whether the problem remains.
4. Reinstall each device that was removed in step 2 one at a time, starting the
server each time, to isolate the failing device.
5. Replace any failing device.
6. (Trained service technician only) Replace the system board.
Serial-device problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The number of serial ports that 1. Make sure that:
are identified by the operating v Each port is assigned a unique address in the Setup utility and none of the
system is less than the number serial ports is disabled.
of installed serial ports. v The serial-port adapter (if one is present) is seated correctly.
2. Reseat the serial port adapter.
3. Replace the serial port adapter.
116 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
A serial device does not work. 1. Make sure that:
v The device is compatible with the server.
v The serial port is enabled and is assigned a unique address.
v The device is connected to the correct connector (see “Internal LEDs,
connectors, and jumpers” on page 16).
2. Reseat the following components:
a. Failing serial device
b. Serial cable
3. Replace the components listed in step 2 one at a time, in the order shown,
restarting the server each time.
4. (Trained service technician only) Replace the system board.
ServerGuide problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The ServerGuide Setup and 1. Make sure that the server supports the ServerGuide program and has a
Installation CD will not start. startable (bootable) CD or DVD drive.
2. If the startup (boot) sequence settings have been changed, make sure that the
CD or DVD drive is first in the startup sequence.
3. If more than one CD or DVD drive is installed, make sure that only one drive is
set as the primary drive. Start the CD from the primary drive.
The MegaRAID Storage 1. Make sure that the hard disk drive is connected correctly.
Manager program cannot view
2. Make sure that the SAS/SATA hard disk drive cables are securely connected.
all installed drives, or the
operating system cannot be
installed.
The operating-system Make more space available on the hard disk.
installation program
continuously loops.
Software problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
You suspect a software 1. To determine whether the problem is caused by the software, make sure that:
problem. v The server has the minimum memory that is needed to use the software. For
memory requirements, see the information that comes with the software. If
you have just installed an adapter or memory, the server might have a
memory-address conflict.
v The software is designed to operate on the server.
v Other software works on the server.
v The software works on another server.
2. If you received any error messages when using the software, see the
information that comes with the software for a description of the messages and
suggested solutions to the problem.
3. Contact the software vendor.
118 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Universal Serial Bus (USB) port problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at [Link] to check for technical
information, hints, tips, and new device drivers or to submit a request for information.
Symptom Action
A USB device does not work. 1. Make sure that:
v The correct USB device driver is installed.
v The operating system supports USB devices.
2. Make sure that the USB configuration options are set correctly in the Setup
utility (see “Using the Setup utility” on page 285 for more information).
3. If you are using a USB hub, disconnect the USB device from the hub and
connect it directly to the server.
Video problems
See “Monitor and video problems” on page 110.
When LEDs are lit to indicate an error, they remain lit when the server is turned off,
provided that the server is still connected to power and the power supply is
operating correctly.
Before you work inside the server to view light path diagnostics LEDs, read the
safety information that begins on page “Safety” on page vii and “Handling
static-sensitive devices” on page 189.
If an error occurs, view the light path diagnostics LEDs in the following order:
1. Look at the operator information panel on the front of the server.
v If the information LED is lit, it indicates that information about a suboptimal
condition in the server is available in the IMM system-event log or in the
system-error log.
v If the system-error LED is lit, it indicates that an error has occurred; go to
step 2.
The following illustration shows the operator information panel:
Note: When you slide the light path diagnostics panel out of the server to
check the LEDs or checkpoint codes, do not run the server continuously with
the light path diagnostics panel outside of the server. The panel should only be
outside of the server a short time. The light path diagnostics panel must remain
in the server when the server is running to ensure proper cooling.
The following illustration shows the light path diagnostics panel:
Note any LEDs that are lit, and then reinstall the light path diagnostics panel in
the server.
Note: When you slide the light path diagnostics panel out of the server to
check the LEDs or checkpoint codes, do not run the server continuously with
the light path diagnostics panel outside of the server. The panel should only be
outside of the server a short time. The light path diagnostics panel must remain
in the server when the server is running to ensure proper cooling.
Look at the system service label inside the server cover, which gives an
overview of internal components that correspond to the LEDs on the light path
diagnostics panel. This information and the information in “Light path diagnostics
LEDs” on page 122 can often provide enough information to diagnose the error.
3. Remove the server cover and look inside the server for lit LEDs. Certain
components inside the server have LEDs that are lit to indicate the location of a
problem.
The following illustration shows the LEDs and connectors on the system board.
120 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Remind button: Press this button to place the system-error LED on the front
information panel into Remind mode. By placing the system-error LED indicator
in Remind mode, you acknowledge that you are aware of the last failure but will
not take immediate action to correct the problem. In Remind mode, the
system-error LED flashes rapidly until one of the following conditions occurs:
– All known errors are corrected.
– The server is restarted.
– A new error occurs, causing the system-error LED to be lit again.
v NMI button: The NMI button on the front panel will come on when this button is
pressed. Press this button to force a nonmaskable interrupt to the
microprocessor. You might have to use a pen or the end of a straightened paper
clip to press the button. It allows you to blue screen the server and take a
memory dump (use this button only when directed by the IBM service support).
v Checkpoint code display: This display provides a checkpoint code that
indicates the point at which the system stopped during the boot block and POST.
A checkpoint code is either a byte or a word value that is produced by UEFI. The
display does not provide error codes or suggest components to be replaced.
v Reset button: Press this button to reset the server and run the power-on
self-test (POST). You might have to use a pen or the end of a straightened paper
clip to press the button. The Reset button is in the lower-right corner of the light
path diagnostics panel.
Note: Check the system-error log or system event log for additional information
before replacing a FRU.
Table 7. Light path diagnostics panel LEDs
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a
trained service technician.
LED Description Action
None, but the An error has occurred and cannot Use the Setup utility to check the system error log for
system error LED be isolated. The error is not information about the error.
is lit. represented by a path.
OVER SPEC The power supplies are using If the OVER SPEC LED on the light path diagnostics panel is
more power than their maximum lit, or any of the six 12 V channel error LEDs (A, B, C, D, E,
rating. or AUX) on the system board are lit, use one of the following
procedures.
122 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 7. Light path diagnostics panel LEDs (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a
trained service technician.
LED Description Action
OVER SPEC The power supplies are using If the 12 V channel C error LED is lit, complete the following
(Continued) more power than their maximum steps:
rating. 1. Turn off the server and disconnect the power from the
server.
2. Remove the SAS/SATA RAID riser card, DIMMs in
connectors 1 through 9, and the microprocessor in socket
1.
Note: The server will not power-on when the
microprocessor in socket 1 or 2 is not installed in the
server. (Trained service technician only) Toggle the switch
block (SW4), bit 3 to allow the server to power-on. See
“System-board switches and jumpers” on page 18 for the
location of the SW4 switch block on the system board.
3. Restart the server to see whether the problem remains.
4. Reinstall each device that you removed in step 2 one at a
time, starting the server each time, to isolate the failing
device.
5. Replace any failing device.
6. (Trained service technician only) Replace the system
board.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a
trained service technician.
LED Description Action
OVER SPEC The power supplies are using If the 12 V channel E error LED is lit, complete the following
(Continued) more power than their maximum steps:
rating. 1. Turn off the server and disconnect the power from the
server.
2. Remove the PCI riser card from PCI riser connector 2 and
the microprocessor in socket 2.
3. Restart the server to see whether the problem remains.
4. Reinstall each device that you removed in step 2 one at a
time, starting the server each time, to isolate the failing
device.
5. Replace any failing device.
6. (Trained service technician only) Replace the system
board.
124 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 7. Light path diagnostics panel LEDs (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a
trained service technician.
LED Description Action
PS Power supply 1 or 2 has failed. 1. Check the power-supply that has an lit amber LED (see
When both the PS and CNFG “Power-supply LEDs” on page 129).
LEDs are lit, the power supply
2. Make sure that the power supplies are seated correctly.
configuration is invalid.
3. Remove one of the power supplies to isolate the failed
power supply.
4. Make sure that both power supplies installed in the server
are of the same type.
5. Replace the failed power supply.
PCI An error has occurred on a PCI 1. Check the LEDs on the PCI slots to identify the
bus or on the system board. An component that caused the error.
additional LED is lit next to a
2. Check the system-error log for information about the error.
failing PCI slot.
3. If you cannot isolate the failing adapter by using the LEDs
and the information in the system-error log, remove one
adapter at a time from the failing PCI bus; and restart the
server after each adapter is removed.
4. Replace the following components, in the order shown,
restarting the server each time:
v PCI riser card
v (Trained service technician only) Replace the system
board.
5. For more information, go to [Link]
support/[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
SP A service processor error has 1. Shut down the system and remove the power cords from
been detected. the server; then, reconnect the server to power and restart
the server.
2. Update the IMM firmware (see “Updating the firmware” on
page 281).
3. (Trained service technician only) Replace the system
board, if the problem remains.
4. For more information, go to [Link]
support/[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
FAN A fan has failed, is operating too 1. Reseat the failing fan, which is indicated by a lit LED near
slowly, or has been removed. The the fan connector on the system board.
TEMP LED might also be lit.
2. Replace the failing fan.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a
trained service technician.
LED Description Action
TEMP The system temperature has 1. Make sure that the heat sink is seated correctly.
exceeded a threshold level. A
2. Determine whether a fan has failed. If it has, replace it.
failing fan can cause the TEMP
LED to be lit. 3. Make sure that the room temperature is not too high. See
Table 1 on page 7 for the server temperature information.
4. Make sure that the air vents are not blocked.
5. For more information, go to [Link]
support/[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
MEM When only the MEM LED is lit, a Note: Each time you install or remove a DIMM, you must
memory error has occurred. disconnect the server from the power source; then, wait 10
When both the MEM and CNFG seconds before restarting the server.
LEDs are lit, the memory 1. If the MEM LED and the CNFG LED are lit, complete the
configuration is invalid or the PCI following step:
Option ROM is out of resource.
a. Check the system-event log in the Setup utility or IMM
error messages. Follow steps indicated in “POST error
codes” on page 27 and “Integrated management
module (IMM) error messages” on page 40.
2. If the CNFG LED is not lit, the system might detect a
memory error. Complete the following steps to correct the
problem:
a. Update the server firmware to the latest level (see
“Updating the firmware” on page 281).
b. Reseat or swap the DIMMs.
c. Check the system-event log in the Setup utility or IMM
error messages. Follow steps indicated in “POST error
codes” on page 27 and “Integrated management
module (IMM) error messages” on page 40.
NMI A nonmaskable interrupt has Check the system-error log for information about the error.
occurred, or the NMI button was
pressed.
126 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 7. Light path diagnostics panel LEDs (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a
trained service technician.
LED Description Action
CNFG A hardware configuration error 1. If the CNFG LED and the CPU LED are lit, complete the
has occurred. following steps to correct the problem:
a. Check the microprocessors that were just installed to
make sure that they are compatible with each other
(see “Installing a microprocessor and heat sink” on
page 269 for additional information about
microprocessor requirements).
b. (Trained service technician only) Replace the
incompatible microprocessor.
c. Check the system-error logs for information about the
error. Replace any components that are identified in
the error log.
2. If the CNFG LED and the MEM LED are lit, complete the
following step:
a. Check the system-event log in the Setup utility or IMM
error messages. Follow steps indicated in “POST error
codes” on page 27 and “Integrated management
module (IMM) error messages” on page 40.
3. If the CNFG LED and the PS LED are lit, the system
issues an invalid power configuration error. Make sure that
both power supplies installed in the server are of the
same type.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a
trained service technician.
LED Description Action
CPU An invalid microprocessor 1. If the CNFG LED is lit, the system issues an invalid
configuration or a microprocessor microprocessor configuration error. Complete the following
has failed (both the CPU LED and steps to correct the problem:
the CNFG LED might be lit).
a. Check the microprocessors that were just installed to
make sure that they are compatible with each other
(see “Installing a microprocessor and heat sink” on
page 269 for additional information about
microprocessor requirements) and use the Setup utility
and select System Information → System Summary
→ Processor Details to verify the microprocessors
information.
b. (Trained service technician only) Replace the
incompatible microprocessor.
c. Check the system-error logs for information about the
error. Replace any components that are identified in
the error log.
2. If a microprocessor failure occurs, complete the following
steps:
a. (Trained service technician only) Make sure that the
failing microprocessor, which is indicated by a lit LED
on the system board, is installed correctly. See
“Installing a microprocessor and heat sink” on page
269 for information about installation and
requirements.
b. For more information, go to [Link]
systems/support/[Link]/
docdisplay?brandind=5000008&lndocid=SERV-CALL.
VRM Reserved.
128 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 7. Light path diagnostics panel LEDs (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which
components are customer replaceable units (CRU) and which components are field replaceable units
(FRU).
v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a
trained service technician.
LED Description Action
DASD A hard disk drive has failed or is 1. Check the LEDs on the hard disk drives for the drive with
missing. a lit status LED and reseat the hard disk drive.
2. Reseat the hard disk drive backplane.
3. For more information, see “Hard disk drive problems” on
page 102.
4. If the error remains, replace the following components in
the order listed, restarting the server after each:
a. Replace the hard disk drive.
b. Replace the hard disk drive backplane.
5. If the error remains, replace the following components one
at a time, in the order listed, restarting the server after
each:
a. Replace the hard disk drive.
b. Replace the hard disk drive backplane.
6. If the problem remains, go to [Link]
support/[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
RAID Reserved.
BRD An error has occurred on the 1. Check the LEDs on the system board to identify the
system board. component that caused the error. The BRD LED can be lit
due to any of the following reasons:
v Battery
v Missing PCI riser-card assembly
v Failed voltage regulator
2. Check the system-error log for information about the error.
3. Replace any failed or missing replacement components,
such as the battery or PCI riser-card assembly.
4. If a voltage regulator has failed, (trained service technician
only) replace the system board.
Power-supply LEDs
The following minimum configuration is required for the DC LED on the power
supply to be lit:
v Power supply
v Power cord
The following illustration shows the locations of the power-supply LEDs on the ac
power supply.
The following table describes the problems that are indicated by various
combinations of the power-supply LEDs on an ac power supply and suggested
actions to correct the detected problems.
130 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
AC power-supply LEDs
AC DC Error (!) Description Action Notes
On On Off Normal operation
Off Off Off No ac power to the 1. Check the ac power to the This is a normal
server or a problem server. condition when no ac
with the ac power power is present.
2. Make sure that the power
source.
cord is connected to a
functioning power source.
3. Restart the server. If the error
remains, check the
power-supply LEDs.
4. Replace the power-supply.
Off Off On No ac power to the v Make sure that the power cord This happens only
server or a problem is connected to a functioning when a second power
with the ac power power source. supply is providing
source and the power to the server.
v Replace the power supply.
power-supply had
detected an internal
problem.
Off On Off Faulty Replace the power supply.
power-supply
Off On On Faulty Replace the power supply.
power-supply
On Off Off Power-supply not 1. (Trained service technician Typically indicates a
fully seated, faulty only) Reseat the power power-supply is not
system board, or supply. fully seated.
faulty power-supply
2. If a power channel error LED
on the system board is not lit,
replace the power-supply
(see the documentation that
comes with the power supply
for instructions).
3. If a power channel error LED
on the system board is lit,
(trained service technician
only) replace the system
board.
On Off On Faulty Replace the power supply.
power-supply
On On On Power-supply is Replace the power supply.
faulty but still
operational
The following illustration shows the locations of the power-supply LEDs on the dc
power supply.
The following table describes the problems that are indicated by various
combinations of the power-supply LEDs on a dc power supply and suggested
actions to correct the detected problems.
132 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
DC power-supply LEDs
IN OK OUT OK Error (!) Description Action Notes
On On Off Normal operation
Off Off Off No dc power to the 1. Check the dc power to the This is a normal
server or a problem server. condition when no dc
with the dc power power is present.
2. Make sure that the power
source.
cord is connected to a
functioning power source.
3. Restart the server. If the error
remains, check the
power-supply LEDs.
4. Replace the power-supply.
Off Off On No dc power to the v Make sure that the power cord
server or a problem is connected to a functioning
with the dc power power source.
source and the
v Replace the power supply
power-supply had
(see the documentation that
detected an internal
comes with the power supply
problem.
for instructions).
Off On Off Faulty Replace the power supply.
power-supply
Off On On Faulty Replace the power supply.
power-supply
On Off Off Power-supply not 1. Reseat the power supply. Typically indicates a
fully seated, faulty power-supply is not
2. If a power channel error LED
system board, or fully seated.
on the system board is not lit,
faulty power-supply.
replace the power-supply
(see the documentation that
comes with the power supply
for instructions).
3. If a power channel error LED
on the system board is lit,
(trained service technician
only) replace the system
board.
On Off On Faulty Replace the power supply.
power-supply
On On On Power-supply is Replace the power supply.
faulty but still
operational
Make sure that the server has the latest version of the diagnostic programs. To
download the latest version, go to [Link]
134 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Utilities are available to reset and update the diagnostics code on the integrated
USB flash device, if the diagnostic partition becomes damaged and does not start
the diagnostic programs. For more information and to download the utilities, go to
[Link]
5072294&brandind=5000008.
Note: After you exit from the stand-alone memory diagnostic environment, you
must restart the server to access the stand-alone memory diagnostic
environment again.
5. Type gui to display the graphical user interface, or type cmd to display the DSA
interactive menu.
6. Follow the instructions on the screen to select the diagnostic test to run.
If the diagnostic programs do not detect any hardware errors but the problem
remains during normal server operation, a software error might be the cause. If you
suspect a software problem, see the information that comes with your software.
A single problem might cause more than one error message. When this happens,
correct the cause of the first error message. The other error messages usually will
not occur the next time you run the diagnostic programs.
If the server stops during testing and you cannot continue, restart the server and try
running the diagnostic programs again. If the problem remains, replace the
component that was being tested when the server stopped.
Diagnostic messages
The following table describes the messages that the diagnostic programs might
generate and suggested actions to correct the detected problems. Follow the
suggested actions in the order in which they are listed in the column.
Table 9. DSA Preboot messages
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
089-801-xxx CPU CPU Stress Aborted Internal 1. Turn off and restart the system.
Test program error.
2. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
3. Run the test again.
4. Make sure that the system firmware is at the
latest level. The installed firmware level is shown
in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
5. Run the test again.
6. Turn off and restart the system if necessary to
recover from a hung state.
7. Run the test again.
8. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
136 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
089-802-xxx CPU CPU Stress Aborted System 1. Turn off and restart the system.
Test resource
availability 2. Make sure that the DSA code is at the latest
error. level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
3. Run the test again.
4. Make sure that the system firmware is at the
latest level. The installed firmware level is
shown in the DSA event log in the
Firmware/VPD section for this component. For
the latest level of firmware, go to
[Link]
[Link]?uid=psg1 MIGR-4JTS2T and
select your system to display a matrix of
available firmware.
5. Run the test again.
6. Turn off and restart the system if necessary to
recover from a hung state.
7. Run the test again.
8. Make sure that the system firmware is at the
latest level. The installed firmware level is
shown in the DSA event log in the
Firmware/VPD section for this component. For
more information, see “Updating the firmware”
on page 281.
9. Run the test again.
10. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
138 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-802-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: the power source. You must disconnect the system
test cannot be from ac power to reset the IMM.
completed for
an unknown 2. After 45 seconds, reconnect the system to the
reason. power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
166-803-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: the power source. You must disconnect the system
node is busy; from ac power to reset the IMM.
try later.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
140 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-806-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: power source. You must disconnect the system
timeout while from ac power to reset the IMM.
processing the
command. 2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
166-807-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: out of power source. You must disconnect the system
space. from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
142 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-810-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: power source. You must disconnect the system
request data from ac power to reset the IMM.
length is
invalid. 2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
166-811-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: power source. You must disconnect the system
request data from ac power to reset the IMM.
field length limit
is exceeded. 2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
144 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-814-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: power source. You must disconnect the system
requested from ac power to reset the IMM.
sensor, data, or
record is not 2. After 45 seconds, reconnect the system to the
present. power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
166-815-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: invalid power source. You must disconnect the system
data field in the from ac power to reset the IMM.
request.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
146 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-818-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: cannot power source. You must disconnect the system
execute a from ac power to reset the IMM.
duplicated
request. 2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
166-819-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: a power source. You must disconnect the system
command from ac power to reset the IMM.
response could
not be 2. After 45 seconds, reconnect the system to the
provided; the power source and turn on the system.
SDR repository 3. Run the test again.
is in update 4. Make sure that the DSA code is at the latest
mode. level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
148 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-822-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: the power source. You must disconnect the system
destination is from ac power to reset the IMM.
unavailable.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
166-823-xxx IMM IMM I2C Test Aborted IMM I2C test 1. Turn off the system and disconnect it from the
aborted: cannot power source. You must disconnect the system
execute the from ac power to reset the IMM.
command;
insufficient 2. After 45 seconds, reconnect the system to the
privilege level. power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in the
DSA event log in the Firmware/VPD section for
this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
150 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-901-xxx IMM IMM I2C Test Failed The IMM 1. Turn off the system and disconnect it from the
indicates a power source. You must disconnect the system
failure in the H8 from ac power to reset the IMM.
bus (Bus 0)
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the DSA event log in the Firmware/VPD section
for this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. Shut down the system and remove the power
cords from the server.
8. (Trained service technician only) Reseat the
system board.
9. Reconnect the system to power and turn on the
system.
10. Run the test again.
11. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
152 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-903-xxx IMM IMM I2C Test Failed The IMM 1. Turn off the system and disconnect it from the
indicates a power source. You must disconnect the system
failure in the from ac power to reset the IMM.
DIMM bus (Bus
2). 2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the DSA event log in the Firmware/VPD section
for this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. Disconnect the system from the power source.
8. Replace the DIMMs one at a time, and run the
test again after replacing each DIMM.
9. Reconnect the system to the power source and
turn on the system.
10. Run the test again.
11. Turn off the system and disconnect it from the
power source.
12. Reseat all of the DIMMs.
13. Reconnect the system to the power source and
turn on the system.
14. Run the test again.
15. Turn off the system and disconnect it from the
power source.
16. (Trained service technician only) Reseat the
system board.
17. Reconnect the system to the power source and
turn on the system.
18. Run the test again.
19. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
154 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-905-xxx IMM IMM I2C Test Failed The IMM Note: Ignore the error if the hard disk drive
indicates a backplane is not installed.
failure in the 1. Turn off the system and disconnect it from the
HDD bus (Bus power source. You must disconnect the system
4). from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to the
power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
5. Make sure that the IMM firmware is at the latest
level. The installed firmware level is shown in
the DSA event log in the Firmware/VPD section
for this component. For more information, see
“Updating the firmware” on page 281.
6. Run the test again.
7. Turn off the system and disconnect it from the
power source.
8. Reseat the hard disk drive backplane.
9. Reconnect the system to the power source and
turn on the system.
10. Run the test again.
11. Turn off the system and disconnect it from the
power source.
12. (Trained service technician only) Reseat the
system board.
13. Reconnect the system to the power source and
turn on the system.
14. Run the test again.
15. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
156 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
201-802-xxx Memory Memory Test Aborted Test canceled: 1. Turn off and restart the system.
the end
address in the 2. Run the test again.
E820 function 3. Make sure that all DIMMs are enabled in the
is less than 16 Setup utility.
MB. 4. Make sure that the server firmware is at the
latest level. The installed firmware level is shown
in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
5. Run the test again.
6. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
201-803-xxx Memory Memory Test Aborted Test canceled: 1. Turn off and restart the system.
could not
enable the 2. Run the test again.
processor 3. Make sure that the server firmware is at the
cache. latest level. The installed firmware level is shown
in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
4. Run the test again.
5. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
201-804-xxx Memory Memory Test Aborted Test canceled: 1. Turn off and restart the system.
the memory
controller buffer 2. Run the test again.
request failed. 3. Make sure that the server firmware is at the
latest level. The installed firmware level is shown
in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
4. Run the test again.
5. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
158 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
201-808-xxx Memory Memory Test Aborted Test canceled: 1. Turn off and restart the system.
memory
controller 2. Run the test again.
display/alter 3. Make sure that the server firmware is at the
buffer execute latest level. The installed firmware level is shown
error. in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
4. Run the test again.
5. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
201-809-xxx Memory Memory Test Aborted Test canceled 1. Turn off and restart the system.
program error:
operation 2. Run the test again.
running fast 3. Make sure that the DSA code is at the latest
scrub. level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
4. Make sure that the server firmware is at the
latest level. The installed firmware level is shown
in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
5. Run the test again.
6. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
201-810-xxx Memory Memory Test Aborted Test stopped: 1. Turn off and restart the system.
unknown error
code xxx 2. Run the test again.
received in 3. Make sure that the DSA code is at the latest
COMMONEXIT level. For the latest level of DSA code, go to
procedure. [Link]
[Link]?uid=psg1SERV-DSA.
4. Make sure that the server firmware is at the
latest level. The installed firmware level is shown
in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
5. Run the test again.
6. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
160 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
202-801-xxx Memory Memory Aborted Internal 1. Turn off and restart the system.
Stress Test program error.
2. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
3. Make sure that the server firmware is at the
latest level. The installed firmware level is shown
in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
4. Run the test again.
5. Turn off and restart the system if necessary to
recover from a hung state.
6. Run the memory diagnostics to identify the
specific failing DIMM.
7. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
202-802-xxx Memory Memory Failed General error: 1. Make sure that all memory is enabled by
Stress Test memory size is checking the Available System Memory in the
insufficient to Resource Utilization section of the DSA event
run the test. log. If necessary, enable all memory in the Setup
utility (see “Using the Setup utility” on page 285).
2. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
3. Run the test again.
4. Run the standard memory test to validate all
memory.
5. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
162 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
215-802-xxx Optical Drive v Verify Aborted The media tray 1. Close the media tray and wait 15 seconds.
Media is open.
2. Run the test again.
Installed
3. Insert a new CD/DVD into the drive and wait for
v Read/ 15 seconds for the media to be recognized.
Write Test
4. Run the test again.
v Self-Test
5. Check the drive cabling at both ends for loose
Messages or broken connections or damage to the cable.
and actions Replace the cable if it is damaged.
apply to all 6. Run the test again.
three tests. 7. Make sure that the DSA code is at the latest
level. For the latest level of DSA code, go to
[Link]
[Link]?uid=psg1SERV-DSA.
8. Run the test again.
9. For additional troubleshooting information, go to
[Link]
[Link]?uid=psg1MIGR-41559.
10. Run the test again.
11. Replace the CD/DVD drive.
12. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
215-803-xxx Optical Drive v Verify Failed The disc might 1. Wait for the system activity to stop.
Media be in use by
the system. 2. Run the test again
Installed
3. Turn off and restart the system.
v Read/
4. Run the test again.
Write Test
5. Replace the DVD drive.
v Self-Test
6. If the failure remains, go to the IBM Web site for
Messages more troubleshooting information at
and actions [Link]
apply to all [Link]/docdisplay?brandind=5000008
three tests. &lndocid=SERV-CALL.
164 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
215-903-xxx Optical Drive v Verify Aborted Could not 1. Insert a CD/DVD into the DVD drive or try a
Media access the new media, and wait for 15 seconds.
Installed drive.
2. Run the test again.
v Read/ 3. Check the drive cabling at both ends for loose
Write Test or broken connections or damage to the cable.
v Self-Test Replace the cable if it is damaged.
4. Run the test again.
Messages
and actions 5. Make sure that the DSA code is at the latest
apply to all level. For the latest level of DSA code, go to
three tests. [Link]
[Link]?uid=psg1SERV-DSA.
6. Run the test again.
7. For additional troubleshooting information, go to
[Link]
[Link]?uid=psg1MIGR-41559.
8. Run the test again.
9. Replace the DVD drive.
10. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
215-904-xxx Optical Drive v Verify Failed A read error 1. Insert a CD/DVD into the DVD drive or try a new
Media occurred. media, and wait for 15 seconds.
Installed 2. Run the test again.
v Read/ 3. Check the drive cabling at both ends for loose or
Write Test broken connections or damage to the cable.
v Self-Test Replace the cable if it is damaged.
4. Run the test again.
Messages
and actions 5. For additional troubleshooting information, go to
apply to all [Link]
three tests. [Link]?uid=psg1MIGR-41559.
6. Run the test again.
7. Replace the DVD drive.
8. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
166 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
405-902-xxx BroadCom Test Failed 1. Make sure that the component firmware is at the
Ethernet EEPROM latest level. The installed firmware level is shown
Device in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
2. Run the test again.
3. Replace the component that is causing the error.
If the error is caused by an adapter, replace the
adapter. Check the PCI Information and Network
Settings information in the DSA event log to
determine the physical location of the failing
component.
4. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
405-903-xxx BroadCom Test Internal Failed 1. Make sure that the component firmware is at the
Ethernet Memory latest level. The installed firmware level is shown
Device in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
2. Run the test again.
3. Check the interrupt assignments in the PCI
Hardware section of the DSA event log. If the
Ethernet device is sharing interrupts, if possible,
use the Setup utility see “Using the Setup utility”
on page 285) to assign a unique interrupt to the
device.
4. Replace the component that is causing the error.
If the error is caused by an adapter, replace the
adapter. Check the PCI Information and Network
Settings information in the DSA event log to
determine the physical location of the failing
component.
5. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
168 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 9. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
v See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page 177 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a Trained
service technician.
v Go to the IBM support Web site at [Link] to check for technical information, hints, tips,
and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
405-906-xxx BroadCom Test Loop Failed 1. Check the Ethernet cable for damage and make
Ethernet back at sure that the cable type and connection are
Device Physical correct.
Layer
2. Make sure that the component firmware is at the
latest level. The installed firmware level is shown
in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
3. Run the test again.
4. Replace the component that is causing the error.
If the error is caused by an adapter, replace the
adapter. Check the PCI Information and Network
Settings information in the DSA event log to
determine the physical location of the failing
component.
5. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
405-907-xxx BroadCom Test LEDs Failed 1. Make sure that the component firmware is at the
Ethernet latest level. The installed firmware level is shown
Device in the DSA event log in the Firmware/VPD
section for this component. For more information,
see “Updating the firmware” on page 281.
2. Run the test again.
3. Replace the component that is causing the error.
If the error is caused by an adapter, replace the
adapter. Check the PCI Information and Network
Settings information in the DSA event log to
determine the physical location of the failing
component.
4. If the failure remains, go to the IBM Web site for
more troubleshooting information at
[Link]
[Link]/docdisplay?brandind=5000008
&lndocid=SERV-CALL.
If the server firmware has become corrupted, such as from a power failure during
an update, you can recover the server firmware in one of two ways:
v In-band method: Recover server firmware, using either the boot block jumper
(Automated Boot Recovery) and a server Firmware Update Package Service
Pack.
v Out-of-band method: Use the IMM Web Interface to update the firmware, using
the latest server firmware update package.
Note: You can obtain a server update package from one of the following sources:
v Download the server firmware update from the World Wide Web.
v Contact your IBM service representative.
To download the server firmware update package from the World Wide Web, go to
[Link]
The flash memory of the server consists of a primary bank and a backup bank. It is
essential that you maintain the backup bank with a bootable firmware image. If the
primary bank becomes corrupted, you can either manually boot the backup bank
with the boot block jumper, or in the case of image corruption, this will occur
automatically with the Automated Boot Recovery function.
To recover the server firmware and restore the server operation to the primary
bank, complete the following steps:
1. Turn off the server, and disconnect all power cords and external cables.
2. Remove the server cover. See “Removing the cover” on page 195 for more
information.
3. Locate the UEFI boot recovery jumper block (J29) on the system board.
170 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
uEFI boot recovery IMM recovery
jumper (J29) jumper (J147)
4. Move the jumper from pins 1 and 2 to pins 2 and 3 to enable the UEFI
recovery mode.
5. Reinstall the server cover; then, reconnect all power cords.
6. Restart the server. The power-on self-test (POST) starts.
7. Boot the server to an operating system that is supported by the firmware
update package that you downloaded.
8. Perform the firmware update by following the instructions that are in the
firmware update package readme file.
9. Copy the downloaded firmware update package into a directory.
10. From a command line, type filename-s, where filename is the name of the
executable file that you downloaded with the firmware update package.
11. Turn off the server and disconnect all power cords and external cables, and
then remove the server cover.
12. Move the UEFI boot recovery jumper back to the primary position (pins 1 and
2).
13. Reinstall the server cover, and then reconnect all the power cables.
14. Restart the server.
172 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
6. Reconnect all ac power cords and turn on the server. If the server starts
successfully, reseat the adapters and devices one at a time until the problem is
isolated.
If the server does not start from the minimum configuration, see “Power-supply
LEDs” on page 129 to replace the components in the minimum configuration one at
a time until the problem is isolated.
If the Ethernet controller still cannot connect to the network but the hardware
appears to be working, the network administrator must investigate other possible
causes of the error.
If the power supplies are working correctly, complete the following steps:
1. Turn off the server.
2. Make sure that the server is cabled correctly.
3. Remove or disconnect the following devices, one at a time, until you find the
failure. Turn on the server and reconfigure it each time.
v Any external devices.
v Surge-suppressor device (on the server).
v Printer, mouse, and non-IBM devices.
v Each adapter.
v Hard disk drives.
v Memory modules. The minimum configuration requirement is 2 GB DIMM in
slot 3.
4. Turn on the server.
If the problem is solved when you remove an adapter from the server but the
problem recurs when you reinstall the same adapter, suspect the adapter; if the
problem recurs when you replace the adapter with a different one, suspect the riser
card.
If you suspect a networking problem and the server passes all the system tests,
suspect a network cabling problem that is external to the server.
The model number and serial number are located on the ID label on the front of the
server as shown in the following illustration.
Note: The illustrations in this document might differ slightly from your hardware.
ID label
174 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
– Is the failure repeatable?
– Has this configuration ever worked?
– What changes, if any, were made before the configuration failed?
– Is this the original reported failure?
v Diagnostic program type and version level
v Hardware configuration (print screen of the system summary)
v IMM firmware level
v Operating system software
You can solve some problems by comparing the configuration and software setups
between working and nonworking servers. When you compare servers to each
other for diagnostic purposes, consider them identical only if all the following factors
are exactly the same in all the servers:
v Machine type and model
v IMM firmware level
v Adapters and attachments, in the same locations
v Address jumpers, terminators, and cabling
v Software versions and levels
v Diagnostic program type and version level
v Configuration option settings
v Operating-system control-file setup
See Appendix A, “Getting help and technical assistance,” on page 305 for
information about calling IBM for service.
For information about the terms of the warranty, see the Warranty Information
document that comes with the server.
178 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
The following table lists the part numbers for the server components.
Table 10. Parts listing, Types 4254 and 7944
CRU part CRU part
number number FRU part
Index Description (Tier 1) (Tier 2) number
1 Top cover 59Y3927
2 Filler, PCI 59Y3969
3 PCI-X riser card 69Y4570
4 PCI Express riser card, x16, assembly 43V7066
5 Memory, 2 GB dual-rank DDR3 UDIMM 44T1573
5 Memory, 4 GB dual-rank DDR3-1333 RDIMM 1.35V 49Y1412
5 Memory, 2 GB single-rank DDR 3 RDIMM 49Y1421
5 Memory, 4 GB dual-rank DDR3 UDIMM 49Y1422
5 Memory, 2 GB single-rank DDR3 RDIMM 49Y1423
5 Memory, 4 GB single-rank DDR3 RDIMM 49Y1424
5 Memory, 4 GB dual-rank DDR3 RDIMM 49Y1425
5 Memory, 1 GB single-rank DDR3-1333 RDIMM 49Y1442
5 Memory, 2 GB dual-rank DDR3-1333 RDIMM 49Y1443
5 Memory, 2 GB single-rank DDR3-1333 RDIMM 49Y1444
5 Memory, 4 GB dual-rank DDR3-1333 RDIMM 49Y1445
5 Memory, 8 GB dual-rank DDR3-1333 RDIMM 49Y1446
5 Memory, 8 GB dual-rank PC3L-10600R-999 RDIMM 1.35V 49Y1415
5 Memory, 8 GB dual-rank PC3L-8500R-777 RDIMM 1.35V 49Y1416
5 Memory, 16 GB dual-rank PC3L-10600 DDR3-1333 RDIMM 49Y1565
1.35V
5 Memory, 16 GB quad-rank PC3-8500R-777 RDIMM 46C7489
6 Virtual media key 46C7528
7 Power supply bay filler 49Y4821
8 Power supply, 675 Watt, ac 39Y7236
8 Power supply, 675 Watt, ac 39Y7227
8 Power supply, 675 Watt, ac 39Y7225
8 Power supply, 675 Watt HE, ac 39Y7218
8 Power supply, 675 Watt, dc 39Y7215
8 Power supply, 460 Watt 39Y7229
8 Power supply, 460 Watt 39Y7231
9 Chassis assembly (without front bezel) 59Y3931
10 DVD-ROM drive 44W3254
10 DVD-RW drive 44W3256
11 Rack latch kit 49Y4815
12 Bezel, 8 hard disk bays 59Y3916
12 Bezel, 4 hard disk and 1 optical drive bays 59Y3917
13 Operator information panel assembly 44E4372
14 Filler, hard disk drive bay 59Y3925
Chapter 4. Parts listing, System x3550 M3 Types 4254 and 7944 179
Table 10. Parts listing, Types 4254 and 7944 (continued)
CRU part CRU part
number number FRU part
Index Description (Tier 1) (Tier 2) number
15 Hard disk drive, 2.5-inch simple-swap, 31.4 GB 43W7684
16 Filler, EMC 44T2248
17 Hard disk drive, 2.5-inch hot-swap, 73 GB, 10 K 43W7537
17 Hard disk drive, 2.5-inch hot-swap, 73 GB, 15 K 43W7546
17 Hard disk drive, 2.5-inch hot-swap, 146 GB, 10 K 43W7538
17 Hard disk drive, 2.5-inch hot-swap, 146 GB, 15 K 42D0678
17 Hard disk drive, 2.5-inch hot-swap, 146 GB, Slim-HS, 15 K 44W2295
17 Hard disk drive, 2.5-inch hot-swap, 300 GB, 10 K 42D0638
17 Hard disk drive, 2.5-inch hot-swap, 300 GB, Slim-HS, 10 K 44W2265
17 Hard disk drive, 2.5-inch hot-swap, 500 GB, 7.2 K 42D0708
17 Hard disk drive, 2.5-inch hot-swap, 600 GB, 10 K 49Y2004
18 Backplate assembly, simple-swap 59Y3919
19 Backplane, SAS 59Y3915
20 Fan 43V6929
21 System board 81Y6625
21 System board 69Y5082
22 ServeRAID-M1015 adapter 46M0861
23 ServeRAID-BR10il v2 adapter 49Y4737
24 SAS/SATA riser card 43V7067
25 Retention module, heat sink 49Y4822
26 Microprocessor, Intel Xeon E5506 2.13 GHz, 4 MB, 80W 46D1270
(quad core)
26 Microprocessor, Intel Xeon X5670 2.93 GHz, 12 MB, 95W 49Y7038
(six core)
26 Microprocessor, Intel Xeon X5650 2.66 GHz, 12 MB, 95W 49Y7040
(six core)
26 Microprocessor, Intel Xeon E5640 2.66 GHz, 12 MB, 80W 49Y7051
(quad core)
26 Microprocessor, Intel Xeon E5630 2.53 GHz, 12 MB, 80W 49Y7052
(quad core)
26 Microprocessor, Intel Xeon E5620 2.40 GHz, 12 MB, 80W 49Y7053
(quad core)
26 Microprocessor, Intel Xeon L5640 2.26 GHz, 12 MB, 60W 49Y7054
(six core)
26 Microprocessor, Intel Xeon L5630 2.13 GHz, 12 MB, 40W 59Y3691
(quad core)
26 Microprocessor, Intel Xeon E5507 2.26 GHz, 4 MB, 80W 69Y0782
(quad core)
26 Microprocessor, Intel Xeon L5609 1.86 GHz, 12 MB, 40W 69Y0783
(quad core)
26 Microprocessor, Intel Xeon X5680 3.33 GHz, 12 MB, 130W 69Y0849
(six core)
180 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 10. Parts listing, Types 4254 and 7944 (continued)
CRU part CRU part
number number FRU part
Index Description (Tier 1) (Tier 2) number
26 Microprocessor, Intel Xeon E5645 2.40 GHz, 12 MB, 80W 69Y4714
(six core)
26 Microprocessor, Intel Xeon E5603 1.60 GHz, 4 MB, 80W 81Y5952
(quad core)
26 Microprocessor, Intel Xeon E5606 2.13 GHz, 8 MB, 80W 81Y5953
(quad core)
26 Microprocessor, Intel Xeon E5607 2.26 GHz, 8 MB, 80W 81Y5954
(quad core)
26 Microprocessor, Intel Xeon E5649 2.53 GHz, 12 MB, 80W 81Y5955
(six core)
26 Microprocessor, Intel Xeon X5647 2.93 GHz, 12 MB, 130W 81Y5956
(quad core)
26 Microprocessor, Intel Xeon X5672 3.20 GHz, 12 MB, 95W 81Y5957
(quad core)
26 Microprocessor, Intel Xeon X5675 3.06GHz, 12MB, 95W 81Y5958
(six core)
26 Microprocessor, Intel Xeon X5687 3.60 GHz, 12 MB, 130W 81Y5959
(quad core)
26 Microprocessor, Intel Xeon X5690 3.46GHz, 12 MB, 130W 81Y5960
(six core)
27 Heat sink assembly 49Y4820
27 Heat sink assembly, 130W 69Y1207
28 Air baffle kit (Microprocessor air baffle, clamp, and DIMM 59Y3914
air duct)
Battery, 3.0 volt 33F8354
Cable, hard disk drive configuration 59Y3918
Cable, operator panel 46C4139
Cable, SAS power 59Y3920
Cable, SAS signal, 300 mm 49Y4850
Cable, SAS signal, 450 mm 59Y3921
Cable, SAS signal, 710 mm 69Y1328
Cable, SATA DVD 59Y3922
Cable, USB/video 59Y3923
Cable, line cord 39M5377
Cage, hard disk drive 59Y3968
Cage, optical drive 59Y3924
CPU extraction tool 81Y9398
Safety cover 49Y4823
Filler, DVD drive bay 49Y4868
2-port 1 Gb Ethernet card 69Y4509
2-port 1 Gb Ethernet card, mechanical kit 69Y4586
Labels, chassis 59Y3998
Chapter 4. Parts listing, System x3550 M3 Types 4254 and 7944 181
Table 10. Parts listing, Types 4254 and 7944 (continued)
CRU part CRU part
number number FRU part
Index Description (Tier 1) (Tier 2) number
Labels, system service 59Y3926
Riser-card bracket: full-height, half-length 43V6936
Riser-card bracket: low-profile 43V6939
Bracket assembly, rear I/O 43V6938
SAS adapter retainer 49Y4852
ServeRAID-MR10i adapter 43W4297
ServeRAID-M5015 adapter 46M0851
ServeRAID-M5014 adapter 46M0918
ServeRAID M5000 advanced feature key 46M0931
ServeRAID-BR10i adapter 44E8690
ServeRAID B5015 Solid State Drive (SSD) adapter 46M0970
ServeRAID-MR10i battery carrier kit 44E8763
Cable, ServeRAID battery (with interposer card) 44E8844
IBM 3Gb SAS HBA v2 44E8701
NetXtreme II 1000 Express quad port Ethernet adapter 49Y4222
QLogic 10Gb SFP+ SR optical transceiver 42C1816
QLogic 10Gb CNA for IBM System x 42C1802
Brocade 10Gb SFP+ SR optical transceiver 42C1819
Brocade 10Gb CNA for IBM System x 42C1822
Video adapter, NVIDIA FX 1700 43V5765
Video adapter, NVIDIA FX 570 43V5782
Video adapter, NVIDIA FX 580 43V5890
Slide rail kit 59Y3792
Slide rail kit 69Y5085
CMA kit 49Y4817
Slide rail kit, Gen-II 69Y4391
CMA kit, Gen-II 69Y4392
Miscellaneous parts kit 69Y5639
Screw kit 59Y4922
Thermal grease kit 41Y9292
Hypervisor, embedded USB flash device 42D0545
Alcohol wipes 59P4739
Consumable parts
Consumable parts are not covered by the IBM Statement of Limited Warranty. The
following consumable parts are available for purchase from the retail store.
182 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Table 11. Consumable parts, Types 4254 and 7944
Index Description Part number
ServeRAID M5000 battery 43W4342
If you need help with your order, call the toll-free number that is listed on the retail
parts page, or contact your local IBM representative for assistance.
Chapter 4. Parts listing, System x3550 M3 Types 4254 and 7944 183
Power cords
For your safety, IBM provides a power cord with a grounded attachment plug to use
with this IBM product. To avoid electrical shock, always use the power cord and
plug with a properly grounded outlet.
IBM power cords used in the United States and Canada are listed by Underwriter's
Laboratories (UL) and certified by the Canadian Standards Association (CSA).
For units intended to be operated at 115 volts: Use a UL-listed and CSA-certified
cord set consisting of a minimum 18 AWG, Type SVT or SJT, three-conductor cord,
a maximum of 15 feet in length and a parallel blade, grounding-type attachment
plug rated 15 amperes, 125 volts.
For units intended to be operated at 230 volts (U.S.): Use a UL-listed and
CSA-certified cord set consisting of a minimum 18 AWG, Type SVT or SJT,
three-conductor cord, a maximum of 15 feet in length and a tandem blade,
grounding-type attachment plug rated 15 amperes, 250 volts.
For units intended to be operated at 230 volts (outside the U.S.): Use a cord set
with a grounding-type attachment plug. The cord set should have the appropriate
safety approvals for the country in which the equipment will be installed.
IBM power cords for a specific country or region are usually available only in that
country or region.
184 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
IBM power cord part
number Used in these countries and regions
39M5151 Abu Dhabi, Bahrain, Botswana, Brunei Darussalam, Channel
Islands, China (Hong Kong S.A.R.), Cyprus, Dominica, Gambia,
Ghana, Grenada, Iraq, Ireland, Jordan, Kenya, Kuwait, Liberia,
Malawi, Malaysia, Malta, Myanmar (Burma), Nigeria, Oman,
Polynesia, Qatar, Saint Kitts and Nevis, Saint Lucia, Saint Vincent
and the Grenadines, Seychelles, Sierra Leone, Singapore, Sudan,
Tanzania (United Republic of), Trinidad and Tobago, United Arab
Emirates (Dubai), United Kingdom, Yemen, Zambia, Zimbabwe
39M5158 Liechtenstein, Switzerland
39M5165 Chile, Italy, Libyan Arab Jamahiriya
39M5172 Israel
39M5095 220 - 240 V
Chapter 4. Parts listing, System x3550 M3 Types 4254 and 7944 185
186 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Chapter 5. Removing and replacing server components
Replaceable components are of four types:
v Consumables: Purchase and replacement of consumables (components, such
as batteries and printer cartridges, that have depleting life) is your responsibility.
If IBM acquires or installs a consumable component at your request, you will be
charged for the service.
v Tier 1 customer replaceable unit (CRU): Replacement of Tier 1 CRUs is your
responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for
the installation.
v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or
request IBM to install it, at no additional charge, under the type of warranty
service that is designated for your server.
v Field replaceable unit (FRU): FRUs must be installed only by Trained service
technicians.
See Chapter 4, “Parts listing, System x3550 M3 Types 4254 and 7944,” on page
177 to determine whether a component is a Tier 1 CRU, Tier 2 CRU, or FRU.
For information about the terms of the warranty, see the Warranty Information
document that comes with the server.
Installation guidelines
Attention: Static electricity that is released to internal server components when
the server is powered-on might cause the system to halt, which might result in the
loss of data. To avoid this potential problem, always use an electrostatic-discharge
wrist strap or other grounding system when removing or installing a hot-swap
device.
188 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v You have followed the cabling instructions that come with optional adapters.
v You have replaced a failed fan within 48 hours.
v You have replaced a hot-swap fan within 30 seconds of removal.
v You have kept the preinstalled air deflector in place unless directed to remove it
in this publication or by IBM Service. See “Removing the microprocessor 2 air
baffle” on page 196 for the location of the air deflector in the server.
You might have to have the server turned on while the cover is off, to look at light
path diagnostics LEDs or replace hot swap components. Follow these guidelines
when you work inside a server that is turned on:
v Avoid wearing loose-fitting clothing on your forearms. Button long-sleeved shirts
before working inside the server; do not wear cuff links while you are working
inside the server.
v Do not allow your necktie or scarf to hang inside the server.
v Remove jewelry, such as bracelets, necklaces, rings, and loose-fitting wrist
watches.
v Remove items from your shirt pocket, such as pens and pencils, that could fall
into the server as you lean over it.
v Avoid dropping any metallic objects, such as paper clips, hairpins, and screws,
into the server.
The following illustration shows the internal routing and connector for the optional
optical drive cable.
Attention: To disconnect the optional optical drive cable, you must first press the
connector release tab, and then disconnect the cable from the connector on the
system board. Do not disconnect the cable by using excessive force.
190 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Chapter 5. Removing and replacing server components 191
The following illustration shows the internal routing and connector for the operator
information panel cable. The following notes describe additional information you
must consider when you install or remove the operator information panel cable:
v You may remove the optional optical drive cable to obtain more room before you
install or remove the operator information panel cable.
v To remove the operator information panel cable, slightly press the cable toward
the chassis; then, pull to remove the cable from the connector on the system
board. Pulling the cable out of the connector by excessive force might cause
damage to the cable or connector.
v To connect the operator information panel cable on the system board, press
evenly on the cable. Pressing on one side of the cable might cause damage to
the cable or connector.
The following illustration shows the internal routing and connector for the USB/video
cable. The following notes describes other information you must consider when you
install or remove the USB/video cable:
v You may remove the optional optical drive cable to obtain more room before you
install or remove the operator information panel cable.
v The USB/video cable is routed under the video cable and then both the USB and
video cables are routed under the cable retention tab and the top cover latch
receptacle.
v To remove the USB/video cable, slightly press the cable toward the chassis;
then, pull to remove the cable from the connector on the system board. Pulling
the cable out of the connector by excessive force might cause damage to the
cable or connector.
v To connect the USB/video cable on the system board, press evenly on the cable.
Pressing on one side of the cable might cause damage to the cable or connector.
192 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Chapter 5. Removing and replacing server components 193
The following illustration shows the internal routing for the SATA power cable and
signal cable.
194 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Removing and replacing consumable parts and Tier 1 CRUs
Replacement of consumable parts and Tier 1 CRUs is your responsibility. If IBM
installs a consumable part or Tier 1 CRU at your request, you will be charged for
the installation.
Note: The illustrations in this document might differ slightly from your hardware.
Note: When you disconnect the power source from the server, you lose the
ability to view the LEDs because the LEDs are not lit when the power source is
removed. Before you disconnect the power source, make a note of which LEDs
are lit, including the LEDs that are lit on the operation information panel, on the
light path diagnostics panel, and LEDs inside the server on the system board;
then, see “Light path diagnostics LEDs” on page 122 for information on how to
solve the problem.
Cover
Tab
3. If the server has been installed in a rack, slide the server out from the rack
enclosure.
4. Press down firmly on the blue tab on the top (near the front of the server) of the
cover and slide the cover toward the rear of the server until the cover has
disengaged from the chassis.
5. Lift the server cover off the server and set it aside.
Attention: For proper cooling and airflow, replace the server cover before you
turn on the server.
Cover
Tab
196 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Baffle tab
Microprocessor 2
air baffle
Baffle pin
198 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Installing the DIMM air baffle
To install the DIMM air baffle, complete the following steps:
1. Align the DIMM air baffle over the DIMMs so that the baffle pin on the left side
of the air baffle aligns with the pin hole next to DIMM connector 8 on the system
board, and lower it into the server.
Baffle pin
7. If you are instructed to return the adapter, follow all packaging instructions, and
use any packaging materials for shipping that are supplied to you.
200 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Installing an adapter
The following notes describe the types of adapters that the server supports and
other information that you must consider when you install an adapter:
v Locate the documentation that comes with the adapter and follow those
instructions in addition to the instructions in this section.
v The server provides one SAS/SATA RAID riser-card slot on the system board.
See “System-board optional device connectors” on page 22 for the location of the
SAS/SATA RAID riser-card slot. The ServeRAID-BR10il v2, ServeRAID-M1015,
ServeRAID-M5014, or ServeRAID-M5015 adapter comes installed on some
models of the server. You can replace the IBM ServeRAID SAS/SATA adapter
with an optional IBM ServeRAID SAS/SATA adapter in the slot. For configuration
information, see the ServeRAID documentation at [Link]
support/,
Important: To ensure that any of your ServeRAID 10i, 10is, or 10M adapters
function properly on UEFI-based servers, make sure that the adapter firmware
level is updated to at least 11.x.x-XXX , and the supporting drivers.
Attention: Some cluster solutions require specific code levels or coordinated
code updates. If the device is part of a cluster solution, verify that the latest level
of code is supported for the cluster solution before you update the code.
v Do not set the maximum digital video adapter resolution above 1600 x 1200 at
75 Hz for an LCD monitor. This is the highest resolution that is supported for any
add-on video adapter that you install in the server.
v Any high-definition video-out connector or stereo connector on any add-on video
adapter is not supported
v The server does not support full-length, full-height PCI-X adapters or legacy 5 V
PCI adapters.
v When you install any PCI adapter, the power cords must be disconnected from
the power source before you remove the PCI Express riser-card assembly and
the PCI-X riser-card assembly. Otherwise, the active power management event
signal will be disabled by the system-board logic, and the Wake on LAN feature
might not work. However, after the server is powered-on locally, the active power
manager active power management event signal will be enabled by the
system-board logic.
v If you switch the PCI adapter from slot 1 riser assembly to slot 2, you will need to
remove the screw from the standard bracket and then replace the standard
bracket with the low-profile bracket.
v The server provides two PCI riser-card slots on the system board. A PCI riser
assembly with a bracket is installed in slot 2. The following information indicates
the riser-card slots and the type of adapters that the riser cards support:
– Standard models of the server come with two PCI Express riser-card
assemblies installed. If you want to replace them with PCI-X riser-card
assemblies, you must order the PCI-X riser-card assembly option, which
includes the bracket.
– A PCI Express riser-card assembly has a black connector and supports PCI
Express adapters, and a PCI-X riser-card assembly has a white (light in color)
connector and supports PCI-X adapters.
– PCI riser slot 1 (the farthest slot from the power supplies). This slot supports
only low-profile adapters.
– PCI riser slot 2 (the closest slot to the power supplies). This slot supports only
full-height, half-length adapters. You must install a PCI riser-card assembly in
slot 2 even if you do not install an adapter.
The following table lists the supported configurations for the PCI rise-cardr slots.
Notes:
1. The instructions in this section apply to any PCI adapter (for example, video
graphics adapters or network adapters).
2. Do not set the maximum digital video adapter resolution above 1600 x 1200 at
75 Hz for an LCD monitor. This is the highest resolution that is supported for
any add-on video adapter that you install in the server.
3. Any high-definition video-out connector or stereo connector on any add-on video
adapter is not supported.
202 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Attention: When you install an adapter, make sure that the adapter is
correctly seated in the riser-card assembly and that the riser-card assembly is
securely seated in the riser-card connector on the system board before you
turn on the server. An incorrectly seated adapter might cause damage to the
system board, the riser-card assembly, or the adapter.
5. Install the riser-card assembly in the server (see “Installing a PCI riser-card
assembly” on page 257).
6. Perform any configuration tasks that are required for the adapter.
7. Install the cover (see “Installing the cover” on page 195).
8. Slide the server into the rack.
9. Reconnect the power cord and any cables that you removed.
10. Turn on the peripheral devices and the server.
6. If you are instructed to return the SAS/SATA RAID riser-card assembly, follow all
packaging instructions, and use any packaging materials for shipping that are
supplied to you.
204 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
6. Connect the signal cables to the SAS/SATA adapter (see step 9 on page 224).
7. Install the cover (see “Installing the cover” on page 195).
8. Slide the server into the rack.
9. Reconnect the power cord and any cables that you removed.
10. Turn on the peripheral devices and the server.
Drive handle
4. Grasp the handle and slide the drive out of the drive bay.
5. If you are instructed to return the drive assembly, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
To install a 2.5-inch hot-swap SAS or SATA hard disk drive, complete the following
steps:
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 187.
2. Remove the filler panel from the empty drive bay.
3. Touch the static-protective package that contains the drive to any unpainted
metal surface on the server; then, remove the drive from the package and place
it on a static-protective surface.
4. Install the hard disk drive in the drive bay:
a. Make sure that the tray handle is in the open (unlocked) position.
b. Align the drive assembly with the guide rails in the bay.
206 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Drive-tray
assembly
Drive handle
Filler panel
c. Gently push the drive assembly into the bay until the drive stops.
d. Rotate the tray handle to the closed (locked) position.
e. Check the hard disk drive status LED to verify that the hard disk drive is
operating correctly. If the amber hard disk drive status LED for a drive is lit
continuously, that drive is faulty and must be replaced. If the green hard disk
drive activity LED is flashing, the drive is being accessed.
Attention:
v To avoid damage to the hard disk drive connectors, make sure that the server
cover is in place and fully closed whenever you install or remove a hard disk
drive.
Note: When you disconnect the power source from the server, you lose the
ability to view the LEDs because the LEDs are not lit when the power source is
removed. Before you disconnect the power source, make a note of which LEDs
are lit, including the LEDs that are lit on the operation information panel, on the
light path diagnostics panel, and LEDs inside the server on the system board;
then, see Chapter 3, “Diagnostics,” on page 23 for information about how to
solve the problem.
3. Remove the filler panel from the drive bay.
4. Slide the blue release latch to the right with one finger (to release the drive)
while using another finger to grasp the black drive handle and pull the hard disk
drives out of the drive bay.
5. If you are instructed to return the drive assembly, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
208 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v The sequence for installing the hard disk drives is to install the drives starting
from bay 0, 1, 2, and 3.
v Simple-swap models of the server do not support the Hypervisor USB flash
device and the installation of the SAS/SATA riser card is not supported.
v The simple-swap server models are available only in non-RAID configurations.
v The electromagnetic interference (EMI) integrity and cooling of the server are
protected by having all bays and PCI and PCI Express slots covered or
occupied. When you install a drive, PCI, or PCI Express adapter, save the EMC
shield and filler panel from the bay or PCI or PCI Express adapter slot cover in
the event that you later remove the device.
v For a complete list of supported optional devices for the server, see
[Link]
To install a 2.5-inch simple-swap SATA hard disk drive, complete the following
steps:
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 187.
2. Remove the filler panel from the empty drive bay.
3. Touch the static-protective package that contains the drive to any unpainted
metal surface on the server; then, remove the drive from the package and place
it on a static-protective surface.
4. Install the hard disk drive in the drive bay:
a. Grasp the black drive handle and slide the blue release latch to the right
and align the drive assembly with the guide rails in the bay.
b. Gently push the drive assembly into the bay until the drive stops.
5. Reinstall the drive bay filler panel that you removed earlier.
6. If you are installing additional simple-swap hard disk drives, do so now.
7. Turn on the peripheral devices and the server.
5. Slide the drive retention clip from the side of the drive. Save the clip to use
when you install the replacement drive or replace the DVD drive filler panel.
6. Attach the drive retention clip to the side of the DVD drive filler panel hat you
removed in step 6 on page 211.
7. Slide the DVD drive filler panel into the DVD drive bay until the DVD drive filler
panel clicks into place.
210 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
DVD drive filler panel
8. If you are instructed to return the DVD drive, follow all packaging instructions,
and use any packaging materials for shipping that are supplied to you.
If you need to install an optional DVD drive, complete the following steps:
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 187.
2. Turn off the server and peripheral devices and disconnect the power cords and
all external cables.
3. Remove the cover (see “Removing the cover” on page 195).
4. If your server does not come with an optical drive bay, install the optical drive
cage and replace the bezel (see “Installing an optional optical drive cage” on
page 251).
5. Disconnect the DVD drive cable (see “Removing the DVD cable” on page
252).
6. Remove the DVD drive filler panel if it is installed. locate the blue release tab
on the rear of the DVD drive filler panel; then, while you press the tab, push
the DVD drive filler panel out of the drive bay.
7. Remove the retention clip from the side of the DVD drive filler panel. Save the
DVD drive filler panel for future use.
Note: If you are installing an optical drive that contains a laser, observe the
following safety precaution.
Statement 3:
CAUTION:
When laser products (such as CD-ROMs, DVD drives, fiber optic devices,
or transmitters) are installed, note the following:
v Do not remove the covers. Removing the covers of the laser product
could result in exposure to hazardous laser radiation. There are no
serviceable parts inside the device.
v Use of controls or adjustments or performance of procedures other
than those specified herein might result in hazardous radiation
exposure.
212 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
DANGER
Some laser products contain an embedded Class 3A or Class 3B laser
diode. Note the following.
Laser radiation when open. Do not stare into the beam, do not view
directly with optical instruments, and avoid direct exposure to the
beam.
8. Touch the static-protective package that contains the new optical drive to any
unpainted metal surface on the server; then, remove the drive from the
package and place it on a static-protective surface.
9. Follow the instructions that come with the drive to set any jumpers or switches.
10. Attach the drive retention clip that you removed from the DVD drive filler panel
to the side of the new DVD drive.
11. Align the DVD drive in the drive bay and slide the DVD drive into the optical
drive bay until the DVD drive clicks into place.
Note: The DVD cable should go on the top of the operation information panel
cable (in the middle) and the Video/USB cable (on the bottom) when all three
cables are installed in the server.
13. Reconnect the power cord and any cables that you removed.
14. Turn on the peripheral devices and the server.
214 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Removing a memory module
To remove a dual inline memory module (DIMM), complete the following steps:
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 187.
2. Turn off the server and peripheral devices and disconnect all power cords.
3. Remove the cover (see “Removing the cover” on page 195).
4. Remove the DIMM air baffle (see “Removing the DIMM air baffle” on page 198).
5. Carefully open the retaining clips on each end of the DIMM connector and
remove the DIMM.
Attention: To avoid breaking the retaining clips or damaging the DIMM
connectors, open and close the clips gently.
6. If you are instructed to return the DIMM, follow all packaging instructions, and
use any packaging materials for shipping that are supplied to you.
Note: To determine the type of a DIMM, see the label on the DIMM. The
information on the label is in the format xxxxx nRxxx PC3-xxxxx-xx-xx-xxx. The
numeral in the sixth numerical position indicates whether the DIMM is single-rank
(n=1), dual-rank (n=2), or quad-rank (n=4).
v The following rules apply to DDR3 DIMM speed as it relates to the number of
DIMMs in a channel:
– When you install 1 DIMM per channel, the memory runs at 1333 MHz
– When you install 2 DIMMs per channel, the memory runs at 1066 MHz
– When you install 3 DIMMs per channel, the memory runs at 800 MHz
216 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
– All channels in a server run at the fastest common frequency
– Do not install registered and unbuffered DIMMs in the same server
v The maximum memory speed is determined by the combination of the
microprocessor, DIMM speed, and the number of DIMMs installed in each
channel.
v In two-DIMM-per-channel configuration, a server with an Intel Xeon X5600 series
microprocessor automatically operates with a maximum memory speed of up to
1333 MHz when one of the following conditions is met:
– Two 1.5 V single-rank or dual-rank RDIMMs are installed in the same channel.
In the Setup utility, Memory speed is set to Max performance mode
– Two 1.35 V single-rank or dual-ranl RDIMMs are installed in the same
channel. In the Setup utility, Memory speed is set to Max performance and
LV-DIMM power is set to Enhance performance mode. The 1.35 V RDIMMs
will function at 1.5 V
v The server supports a maximum of 18 single-rank or dual-rank RDIMMs. The
server supports up to 12 single-rank or dual-rank UDIMMs or quad-rank
RDIMMs.
v The server supports three single-rank or dual-rank DIMMs per channel. The
server supports a maximum of two quad-rank RDIMMs per channel. The
following table shows an example of the maximum amount of memory that you
can install using ranked DIMMs:
Table 13. Maximum memory installation using ranked DIMMs
Number of DIMMs DIMM type DIMM size Total memory
12 Single-rank UDIMMs 2 GB 24 GB
12 Dual-rank UDIMMs 4 GB 48 GB
18 Single-rank RDIMMs 2 GB 36 GB
18 Dual-rank RDIMMs 2 GB 36 GB
18 Dual-rank RDIMMs 4 GB 72 GB
18 Dual-rank RDIMMs 8 GB 144 GB
12 Quad-rank RDIMMs 16 GB 192 GB
v The RDIMM options that are available for the server are 2 GB, 4 GB, 8 GB, and
16 GB. The server supports a minimum of 2 GB and a maximum of 192 GB of
system memory using RDIMMs.
For 32-bit operating systems only: Some memory is reserved for various
system resources and is unavailable to the operating system. The amount of
memory that is reserved for system resources depends on the operating system,
the configuration of the server, and the configured PCI devices.
v The UDIMM options that are available for the server are 2 GB and 4 GB. The
server supports a minimum of 2 GB and a maximum of 48 GB of system memory
using UDIMMs.
The following illustration shows the location of the DIMM connectors on the system
board.
218 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Memory mirroring
Memory-mirroring mode replicates and stores data on two pairs of DIMMs within
two channels simultaneously. If a failure occurs, the memory controller switches
from the primary pair of memory DIMMs to the backup pair of DIMMs. To enable
memory mirroring through the Setup utility, select System Settings → Memory. For
more information, see “Using the Setup utility” on page 285. When you use the
memory mirroring feature, consider the following information:
v When you use memory mirroring, you must install a pair of DIMMs at a time.
One DIMM must be in channel 0, and the mirroring DIMM must be in the same
slot in channel 1. The two DIMMs in each pair must be identical in size, type,
and rank (single or dual), and organization, but not in speed. The channels run at
the speed of the slowest DIMM in any of the channels.
v Channel 2, DIMM connectors 7, 8, 9, 16, 17, and 18 are not used in
memory-mirroring mode.
v The maximum available memory is reduced to half of the installed memory when
memory mirroring is enabled. For example, if you install 64 GB of memory using
RDIMMs, only 32 GB of addressable memory is available when you use memory
mirroring.
The following diagram shows the memory channel interface layout with the DIMM
installation sequence for mirroring mode. The numbers within the boxes indicate the
DIMM population sequence in pairs within the channels, and the numbers next to
the boxes indicate the DIMM connectors within the channels. For example, the
following illustration shows the first pair of DIMMs (indicated by ones (1) inside the
boxes) should be installed in DIMM connectors 1 on channel 0 and DIMM
connector 2 on channel 1. DIMM connectors 3, 6, 9, 12, 15, and 18 on channel 2
are not used in memory-mirroring mode.
1 2 3 CH0 CH0 6 5 4
3 2 1 10 11 12
CPU1 CPU2
QPI
1 2 3 CH1 CH1 6 5 4
6 5 4 13 14 15
CH2 CH2
9 8 7 16 17 18
The following table lists the DIMM connectors on each memory channel.
Table 15. Connectors on each memory channel
Memory channel DIMM connector
Channel 0 1, 2, 3, 10, 11, 12
Channel 1 4, 5, 6, 13, 14, 15
Channel 2 7, 8, 9, 16, 17, 18
The following illustration shows the memory connector layout that is associated with
each microprocessor. For example, DIMM connectors 10, 11, 12, 13, 14, 15, 16, 17,
Note: You can install DIMMs for microprocessor 2 as soon as you install
microprocessor 2; you do not have to wait until all of the DIMM slots for
microprocessor 1 are filled.
CPU2 6 5 4 6 5 4
10 11 12 13 14 15 16 17 18
1 2 3 1 2 3 CPU1
9 8 7 6 5 4 3 2 1
Figure 2. Memory connectors associated with each microprocessor for memory mirroring
The following table shows the installation sequence for installing DIMMs in
memory-mirroring mode:
Table 16. Memory mirroring mode DIMM population sequence
Number of installed
Number of DIMMs microprocessor DIMM connector
First pair of DIMMs 1 3, 6
Second pair of DIMMs 1 2, 5
Third pair of DIMMs 1 1, 4
Fourth pair of DIMMs 2 12, 15
Fifth pair of DIMMs 2 11, 14
Sixth pair of DIMMs 2 10, 13
Note: DIMM connectors 7, 8, 9, 16, 17, and 18 are not used in memory-mirroring mode.
Online-spare memory
The memory online-spare feature disables the failed memory from the system
configuration and activates an online-spare DIMM to replace the failed active DIMM.
You can enable either online-spare memory or memory mirroring in the Setup utility
(see “Using the Setup utility” on page 285). When you use the memory online-spare
feature, consider the following information:
v The memory online-spare feature is supported on server models with an Intel
Xeon™ 5600 series microprocessor.
v When you enable the memory online-spare feature, you must install three DIMMs
per microprocessor at a time. The first DIMM must be in channel 0, the second
DIMM in channel 1, and the third DIMM in channel 2. The DIMMs must be
identical in size, type, rank, and organization, but not in speed. The channels run
at the speed of the slowest DIMM in any of the channels.
220 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v The maximum available memory is reduced to 2/3 of the installed memory when
memory online-spare mode is enabled. For example, if you install 72 GB of
memory using RDIMMs, only 48 GB of addressable memory is available when
you use memory online-spare.
The following table shows the installation sequence for installing DIMMs for each
microprocessor in memory online-spare mode:
Table 17. Memory online-spare mode DIMM population
sequence
Installed
Microprocessor DIMM connector
Microprocessor 1 3, 6, 9
3, 6, 9, 2, 5, 8
3, 6, 9, 2, 5, 8, 1, 4, 7
Microprocessor 2 12, 15, 18,
12, 15, 18, 11, 14, 17,
12, 15, 18, 11, 14, 17, 10, 13, 16
Installing a DIMM
To install a DIMM, complete the following steps:
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 187.
2. Turn off the server and peripheral devices and disconnect the power cords and
all external cables, if necessary.
3. Remove the cover (see view).
4. Open the retaining clip on each end of the DIMM connector.
Attention: To avoid breaking the retaining clips or damaging the DIMM
connectors, open and close the clips gently.
5. Touch the static-protective package that contains the DIMM to any unpainted
metal surface on the outside of the server. Then, remove the DIMM from the
package.
6. Turn the DIMM so that the DIMM keys align correctly with the connector.
7. Insert the DIMM into the connector by aligning the edges of the DIMM with the
slots at the ends of the DIMM connector (see “System-board optional device
connectors” on page 22 for the locations of the DIMM connectors).
8. Firmly press the DIMM straight down into the connector by applying pressure
on both ends of the DIMM simultaneously. The retaining clips snap into the
locked position when the DIMM is firmly seated in the connector.
Depending on the server model, the server comes with a ServeRAID-BR10il v2,
ServeRAID-M1015, ServeRAID-M5014, or a ServeRAID-M5015 SAS/SATA adapter
installed.
You can replace the ServeRAID controller with another supported ServeRAID
controller. For a list of supported ServeRAID controllers, see [Link]
servers/eserver/serverproven/compat/us/.
To remove the SAS/SATA adapter from a SAS/SATA RAID riser card, complete the
following steps:
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 187.
2. Turn off the server and peripheral devices and disconnect all power cords; then,
remove the cover (see “Removing the cover” on page 195).
3. Locate the SAS/SATA RAID riser card and controller assembly near the left-front
corner of the server.
4. Disconnect the SAS signal cables from the connectors on the SAS/SATA
adapter and note their locations.
5. Remove the SAS/SATA RAID riser card from the server (see “Removing the
SAS/SATA RAID riser-card assembly” on page 203).
222 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Release tab
6. Pull the SAS/SATA adapter horizontally out of the connector on the SAS/SATA
RAID riser card.
Note: If you have installed the optional ServeRAID adapter advanced feature
key, remove it and keep it in future use (see “Removing an optional ServeRAID
adapter advanced feature key” on page 226).
7. If you are instructed to return the SAS/SATA adapter, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
Note: If you have the optional ServeRAID adapter advanced feature key,
install it first (see “Installing an optional ServeRAID adapter advanced feature
key” on page 228).
5. Align the SAS/SATA adapter so that the keys align correctly with the connector
on the SAS/SATA RAID riser-card assembly.
6. Insert the SAS/SATA adapter into the connector on the SAS/SATA RAID riser
card until it is firmly seated.
Attention: Incomplete insertion might cause damage to the server or the
adapter.
7. Install the SAS/SATA RAID riser-card assembly (see “Installing the SAS/SATA
RAID riser-card assembly” on page 204).
8. Route the signal cables from the drive backplane over the blue adapter
retention bracket.
9. Connect the signal cables to the SAS/SATA adapter:
v ServeRAID-BR10il v2 adapter: Take the signal cable that is attached to the
drive backplane for drive bays 0 through 3 and connect it to the SAS/SATA
connector on the ServeRAID adapter.
224 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v ServeRAID-BR10i, ServeRAID-M1015, ServeRAID-M5014, or
ServeRAID-M5015 adpater: Take the signal cable that is attached to the
drive backplane for drive bays 4 through 7 and connect it to the ServeRAID
adapter connector that is closest to the power supply cage. Connect the
other signal cable so that is attached to the drive backplane for drive bays 0
through 3 and connect it to the other connector on the adapter.
10. Reconnect the power cord and any cables that you removed.
11. Replace the cover (see “Installing the cover” on page 195).
12. Turn on the peripheral devices and the server.
Note: When you restart the server, you are prompted to import the existing RAID
configuration to the new SAS/SATA adapter.
226 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
4. Grasp the feature key and lift to remove it from connector on the ServeRAID
adapter.
5. If you are instructed to return the feature key, follow all packaging instructions,
and use any packaging materials for shipping that are supplied to you.
228 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
5. Reconnect the power cord and any cables that you removed.
6. Install the cover (see “Installing the cover” on page 195).
7. Slide the server into the rack.
8. Turn on the peripheral devices and the server.
6. Reconnect the power cord and any cables that you removed.
7. Install the cover (see “Installing the cover” on page 195).
8. Slide the server into the rack.
9. Turn on the peripheral devices and the server.
230 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Removing a hot-swap ac power supply
When you remove or install a hot-swap power supply, observe the following
precautions.
Statement 5:
CAUTION:
The power control button on the device and the power switch on the power
supply do not turn off the electrical current supplied to the device. The device
also might have more than one power cord. To remove all electrical current
from the device, ensure that all power cords are disconnected from the power
source.
2
1
Statement 8:
CAUTION:
Never remove the cover on a power supply or any part that has the following
label attached.
Hazardous voltage, current, and energy levels are present inside any
component that has this label attached. There are no serviceable parts inside
these components. If you suspect a problem with one of these parts, contact
a service technician.
5. If you are instructed to return the power supply, follow all packaging instructions,
and use any packaging materials for shipping that are supplied to you.
232 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Installing a hot-swap ac power supply
The following notes describe the type of power supply that the server supports and
other information that you must consider when you install a power supply:
v Before you install an additional power supply or replace a power supply with one
of a different wattage, you may use the IBM Power Configurator utility to
determine current system power consumption. For more information and to
download the utility, go to [Link]
[Link].
v The server comes with one hot-swap 12-volt output power supply that connects
to power supply bay 1. The input voltage is 110 V ac or 220 V ac auto-sensing.
v You cannot mix 460-watt and 675-watt power supplies, high-efficiency and
non-high-efficiency power supplies, or ac and dc power supplies in the server.
v The following information applies when you install 460-watt power supplies in the
server:
– A warning message is generated if the total power consumption exceeds 400
watts and the server only has one operational 460-watt power supply. In this
case, the server can still operate under normal condition. Before you install
additional components in the server, you must install an additional power
supply
– The server automatically shuts down if the total power consumption exceeds
the total power supply output capacity
– You can enable the power capping feature in the Setup utility to control and
monitor power consumption in the server (see “Setup utility menu choices” on
page 286)
The following table shows the system status when you install 460-watt power
supplies in the server:
Table 18. System status with 460-watt power supplies installed
Total system power
Number of 460-watt power supplies installed
consumption (in
watts) One Two Two with one failure
< 400 Normal Normal, redundant Normal
power
400 ~ 460 Normal, status Normal, redundant Normal, status
warning power warning
> 460 System shutdown Normal System shutdown
v Power supply 1 is the default/primary power supply. If power supply 1 fails, you
must replace the power supply immediately.
v You can order an optional power supply for redundancy.
v These power supplies are designed for parallel operation. In the event of a
power-supply failure, the redundant power supply continues to power the system.
The server supports a maximum of two power supplies.
v For instructions on how to install a hot-swap dc power supply, refer to the
documentation that comes with the dc power supply.
Statement 5:
2
1
Statement 8
CAUTION:
Never remove the cover on a power supply or any part that has the following
label attached.
Hazardous voltage, current, and energy levels are present inside any
component that has this label attached. There are no serviceable parts inside
these components. If you suspect a problem with one of these parts, contact
a service technician.
234 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
4. Grasp the handle on the rear of the power supply and slide the power supply
forward into the power-supply bay until it clicks. Make sure that the power
supply connects firmly into the power-supply connector.
Attention: Do not install 460-watt and 675-watt power supplies, high-efficiency
and non-high-efficiency power supplies, or ac and dc power supplies in the
server.
5. Route the power cord through the handle so that it does not accidentally
become unplugged.
6. Connect the power cord for the new power supply to the power-cord connector
on the power supply.
7. Connect the other end of the power cord to a properly grounded electrical
outlet.
8. Make sure that the ac power LED and the dc power LED on the ac power
supply are lit, indicating that the power supply is operating correctly. The two
green LEDs are to the right of the power-cord connector.
9. If you are replacing a power supply with one of a different wattage, apply the
power information label provided with the new power supply over the existing
power information label on the server.
x.x/x.x
xx/xx HZ
Fan connector on
the system board
Fan 6
Fan 3
Fan 2
Fan 1 Fan connector
Note: The LED near the connector of the failing fan assembly is lit.
3. Grasp the orange fan tabs on both ends of the existing fan and pull it up out of
the server. If you are removing fans 3 or 4, lift up the clear tab on the DIMM air
baffle first.
236 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Installing a hot-swap fan assembly
The server comes standard with five dual-motor hot-swap cooling fans.
Fan connector on
the system board
Fan 6
Fan 3
Fan 2
Fan 1 Fan connector
4. Orient the fan over the fan slot in the fan assembly bracket so that the fan
connector aligns with the connector on the system board.
4. If you are instructed to return the virtual media key, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
238 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
4. Replace the cover (see “Installing the cover” on page 195).
5. Slide the server into the rack.
6. Reconnect the power cord and any cables that you removed.
7. Turn on the peripheral devices and the server.
Rubber
stopper
Rubber
stopper
Ethernet
adapter connector
6. Remove the adapter filler panel on the rear of the chassis (if it has not been
removed already).
240 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
7. Install the two standoffs on the system board.
8. Insert the bottom tabs of the metal clip into the port openings from outside the
chassis.
9. While you slightly press the top of the metal clip, rotate the metal clip toward
the front of the server until the metal clip clicks into place. Make sure the metal
clip is securely engaged on the chassis.
Attention: Pressing the top of the metal clip with excessive force may cause
damage to the metal clip.
10. Touch the static-protective package that contains the new adapter to any
unpainted metal surface on the server. Then, remove the adapter from the
package.
11. Align the adapter with the adapter connector on the system board; then, tilt the
adapter so that the port connectors on the adapter line up with the port
openings on the chassis.
Attention: Make sure the port connectors on the adapter are aligned
properly with the chassis on the rear of the server. An incorrectly seated
adapter might cause damage to the system board or the adapter.
13. Reinstall the PCI riser-card assembly in PCI riser connector 1 if you have
removed it previously (see “Installing a PCI riser-card assembly” on page 257).
14. Replace the cover (see “Installing the cover” on page 195).
15. Slide the server into the rack.
16. Reconnect the power cords and any cables that you removed.
17. Turn on the peripheral devices and the server.
242 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
server and remove the battery retention clip.
Battery
retention clip
Microprocessor 2
air baffle
Battery
Release
tab
4. Disconnect the remote battery cable from the remote battery cable connector on
the battery carrier and lift the battery and battery carrier from the slot. See the
following illustration for the location of the connectors on the battery carrier.
5. Disconnect the battery cable from the battery cable connector on the battery
carrier.
6. Remove the three screws that attach the battery carrier to the battery and
remove the battery from the battery carrier.
Note: If your battery and battery carrier are attached with a clip, squeeze the
clip on the side of the battery carrier to remove the battery from the battery
carrier.
If you are instructed to return the RAID adapter battery, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to you.
b. Place the battery carrier in the battery mounting slot on the microprocessor
2 air baffle and ensure that battery carrier posts align with the rings on the
battery mounting slot so that the battery carrier is secure in the slot.
Note: The positioning of the remote battery depends on the type of remote
battery that you install.
244 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Battery
retention clip
Battery
Posts
Posts
Microprocessor 2
air baffle
Battery
mounting
slots
Rings
c. Take the battery retention clip and place it underneath the other tab while
pressing release tab toward the front of the server; then, press it down until
it snaps in place to hold the battery carrier firmly in place.
6. Install the cover.
7. Slide the server into the rack.
8. Reconnect the power cords and all external cables, and turn on the server and
peripheral devices.
Note: In the U. S., call 1-800-IBM-4333 for information about battery disposal.
v If you replace the original lithium battery with a heavy-metal battery or a battery
with heavy-metal components, be aware of the following environmental
consideration. Batteries and accumulators that contain heavy metals must not be
disposed of with normal domestic waste. They will be taken back free of charge
by the manufacturer, distributor, or representative, to be recycled or disposed of
in a proper manner.
v To order replacement batteries, call 1-800-IBM-SERV within the United States,
and 1-800-465-7999 or 1-800-465-6666 within Canada. Outside the U.S. and
Canada, call your support center or business partner.
Note: After you replace the battery, you must reconfigure the server and reset
the system date and time.
246 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Statement 2:
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or
an equivalent type battery recommended by the manufacturer. If your
system has a module containing a lithium battery, replace it only with the
same module type made by the same manufacturer. The battery contains
lithium and can explode if not properly used, handled, or disposed of.
Do not:
– Throw or immerse into water
– Heat to more than 100°C (212°F)
– Repair or disassemble
1. Read the safety information that begins on page vii and “Installation guidelines”
on page 187.
2. Turn off the server and peripheral devices and disconnect all power cords; then,
remove the cover (see “Removing the cover” on page 195).
3. If necessary, lift the riser-card assembly out of the way (see “Removing an
adapter” on page 200).
c. Use your thumb and index finger to lift the battery from the socket.
5. Dispose of the battery as required by local ordinances or regulations. See the
IBM Environmental Notices and User's Guide on the IBM System x
Documentation CD for more information.
248 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Statement 2:
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or an
equivalent type battery recommended by the manufacturer. If your system has
a module containing a lithium battery, replace it only with the same module
type made by the same manufacturer. The battery contains lithium and can
explode if not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water
v Heat to more than 100°C (212°F)
v Repair or disassemble
b. Press the battery down into the socket until it clicks into place. Make sure
that the battery clip holds the battery securely.
c. If you removed a rubber cover from the battery holder, use your fingers to
install the battery cover on top of the battery connector.
3. Install the cover (see “Installing the cover” on page 195).
4. Slide the server into the rack.
5. Reconnect the external cables; then, reconnect the power cords and turn on the
peripheral devices and the server.
6. Start the Setup utility and reset the configuration.
v Set the system date and time.
v Set the power-on password.
v Reconfigure the server.
See “Using the Setup utility” on page 285 for details.
The illustrations in this document might differ slightly from your hardware.
Video connector
screws
Bezel Bezel
250 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Bezel Bezel
Bezel
Bezel
252 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Attention: You must press the connector release tab in order to disconnect
the DVD cable from the system board. Do not disconnect the DVD cable by
using excessive force.
7. From the front of the server, grasp the cable connector latch and slide it up
toward the fan bracket; then, press the spring latch (right of the cable
connector) and slide the cable connector to the right.
Spring latch
on DVD
Cable drive cage
connector
latch
8. Remove the cable from the optical drive cage connector and set it aside.
9. If you are instructed to return the DVD drive, follow all packaging instructions,
and use any packaging materials for shipping that are supplied to you.
5. Reinstall the DVD drive (see “Installing an optional DVD drive” on page 211).
254 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
6. Reinstall the fans (see “Installing a hot-swap fan assembly” on page 237).
7. Replace the cover (see “Installing the cover” on page 195).
8. Slide the server into the rack.
9. Reconnect the power cord and any cables that you removed.
10. Turn on the peripheral devices and the server.
Bezel
The SAS/SATA 4 Pac HDD option kit contains the following components:
v One 2.5-inch hard disk drive backplane (with bracket)
v Four drive bay filler panels
v One SAS signal cable
v One 4 Pac hard disk drive cage and screws
v One bezel (depending on the option you purchase)
Bezel
Note: When you disconnect the power source from the server, you lose the
ability to view the LEDs because the LEDs are not lit when the power source is
removed. Before you disconnect the power source, make a note of which LEDs
are lit, including the LEDs that are lit on the operation information panel, on the
light path diagnostics panel, and LEDs inside the server on the system board;
then, see “Light path diagnostics LEDs” on page 122 for information on how to
solve the problem.
3. Remove the cover (see “Removing the cover” on page 195).
4. If an adapter is installed in the PCI riser-card assembly, disconnect any cables
that are connected to the adapter.
5. Grasp the rear of the PCI riser-card assembly from the rear and lift it out of the
PCI riser-card slot on the system board.
256 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
PCI riser-card assembly
(full-height, half-length adapters)
PCI riser
connector 2
PCI riser
connector 1
6. Remove the adapter, if one is present, from the PCI riser-card assembly.
7. Set the adapter and PCI riser-card assembly aside.
PCI riser
connector 2
PCI riser
connector 1
6. If you are instructed to return the PCI bracket, follow all packaging instructions,
and use any packaging materials for shipping that are supplied to you.
258 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
1. Read the safety information that begins on page vii and the “Installation
guidelines” on page 187.
2. Turn off the server and peripheral devices and disconnect the power cords and
all external cables.
3. Remove the cover (see “Removing the cover” on page 195).
4. Align the holes on the PCI riser card and the PCI bracket and install the screw
that attaches the PCI riser card to the PCI bracket.
Note: You can also choose to disconnect the cables after removing the
backplane from the brackets, if that is easier for you.
6. Lift the hard disk drive backplane up to remove it from the server.
Backplane
bracket slots
7. If you are instructed to return the hard disk drive backplane, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
Note: You can reconnect the cables to the hard disk drive backplane before
installing the backplane onto the brackets or you can connect the cables after
you install the backplane, if that is easier for you.
260 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Hard disk drive
backplane
Backplane
bracket slots
Note: You can also choose to disconnect the cables after you remove the
backplate from the brackets, if that is easier for you.
6. Lift the hard disk drive backplate up to disengage the backplate assembly from
the backplane bracket and remove it from the server.
262 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
3. Insert the hard disk drive backplate into the slots on the backplate bracket and
push the hard disk drive backplate assembly down until the backplate is seated
firmly.
4. Connect the power and signal cables to the system board. Route the power
cable from the drive backplate through the hole on the right of the blue adapter
retention bracket and connect it to the Simple-swap SATA power connector
(see “System-board internal connectors” on page 16); then, route the signal
cable from the drive backplate over the blue adapter retention bracket and
connect it to the Simple-swap SATA signal connector, as shown in the
following illustration.
Operator information
panel
5. From the front of the server, carefully pull the assembly out of the server while
you move it slightly from side to side.
6. If you are instructed to return the operator information panel assembly, follow all
packaging instructions, and use any packaging materials for shipping that are
supplied to you.
Release tab
Operator information
panel
The following illustration shows the cable routing for the operator information
panel. The following notes describe additional information when you install the
cable:
v You may remove the optional optical drive cable to obtain more room before
you install or remove the operator information panel cable.
264 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v To connect the operator information panel cable on the system board, press
evenly on the cable. Pressing on one side of the cable might cause damage
to the cable or connector.
v The operation information panel cable should go in between the Video/USB
cable (on the bottom) and the CD/DVD cable (on the top) when all three
cables are installed in the server.
The illustrations in this document might differ slightly from the hardware.
Alignment
tabs
5. Disconnect the hard disk drive backplane cables from the connector on the front
of the safety cover.
6. Slide the safety cover forward to disengage it from the system board, then lift it
out of the server.
7. If you are instructed to return the safety cover, follow all packaging instructions,
and use any packaging materials for shipping that are supplied to you.
266 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Screw
Safety cover
Alignment
tabs
3. Slide the safety cover toward the back of the server until it is secure.
4. Connect the hard disk drive backplane cables into the connector in front of the
safety cover.
5. Install the screw to secure the safety cover.
6. Install the SAS/SATA riser-card assembly (see“Installing the SAS/SATA RAID
riser-card assembly” on page 204).
7. Install the cover (see “Installing the cover” on page 195).
8. Slide the server into the rack.
9. Reconnect the power cords and any cables that you removed.
10. Turn on the peripheral devices and the server.
8. Locate the microprocessor installation tool that comes with the new
microprocessor.
9. Twist the handle on the microprocessor tool counterclockwise so that it is in
the open position.
Handle
Installation tool
10. Align the installation tool with the alignment pins on the microprocessor socket
and lower the tool down over the microprocessor.
268 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Installation tool
Alignment pins
Microprocessor
11. Twist the handle on the installation tool clockwise and lift the microprocessor
out of the socket.
Handle
Installation tool
12. Carefully lift the microprocessor straight up and out of the socket, and place it
on a static-protective surface. Remove the microprocessor from the installation
tool by twisting the handle counterclockwise.
13. If you do not intend to install a microprocessor in the socket, install the socket
dust cover that you removed in step 2b on page 270 on the socket.
Attention: The pins on the socket are fragile. Any damage to the pins may
require replacing the system board.
14. If you are instructed to return the microprocessor, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
Note: Removing the heat sink from the microprocessor destroys the even
distribution of the thermal grease and requires replacing the thermal grease.
v To order an additional optional microprocessor, contact your IBM marketing
representative or authorized reseller.
To install an additional microprocessor and heat sink, complete the following steps:
270 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Attention: When you handle static-sensitive devices, take precautions to
avoid damage from static electricity. For details about handling these
devices, see “Handling static-sensitive devices” on page 189.
c. Touch the static-protective package that contains the new microprocessor to
any unpainted metal surface on the server; then, remove the microprocessor
from the package.
Attention:
v Do not touch the microprocessor contacts; handle the microprocessor by
the edges only. Contaminants on the microprocessor contacts, such as oil
from your skin, can cause connection failures between the contacts and
the socket.
v Handle the microprocessor carefully. Dropping the microprocessor during
installation or removal can damage the contacts.
v Do not use excessive force when you press the microprocessor into the
socket.
v Make sure that the microprocessor is oriented and aligned and positioned
in the socket before you try to close the lever.
d. If there is a plastic protective cover on the bottom of the microprocessor,
carefully remove it.
Protective
cover
Microprocessor
e. Locate the microprocessor installation tool that comes with the new
microprocessor.
f. Twist the handle of the installation tool counterclockwise so that it is in the
open position.
Handle
Installation tool
g. Align the microprocessor alignment slots with the alignment pins on the
microprocessor installation tool and place the microprocessor on the
underside of the tool so that the tool can grasp the microprocessor correctly.
Installation tool
Alignment
pin slots
Alignment
pins
Handle
Installation tool
Installation tool
Microprocessor
Alignment pins
272 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Handle
Installation tool
d. Rotate the heat-sink release lever to the closed position and hook it
underneath the lock tab.
4. If you installed the second microprocessor, install the sixth fan (see “Installing a
hot-swap fan assembly” on page 237).
5. If you removed the microprocessor 2 air baffle, install it (see “Installing the
microprocessor 2 air baffle” on page 197).
Thermal grease
The thermal grease must be replaced whenever the heat sink has been removed
from the top of the microprocessor and is going to be reused or when debris is
found in the grease.
When you are installing the heat sink on the same microprocessor that is was
removed from, make sure that the following requirements are met:
v The thermal grease on the heat sink and microprocessor is not contaminated.
v Additional thermal grease is not added to the existing thermal grease on the heat
sink and microprocessor.
Note:
Microprocessor
Note: If the grease is properly applied, approximately half of the grease will
remain in the syringe.
6. Install the heat sink onto the microprocessor as described in “Installing a
microprocessor and heat sink” on page 269.
274 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Attention: When you remove a microprocessor and heat sink, be sure to
keep each heat sink with its microprocessor for reinstallation.
6. Use a screwdriver and remove the four screws that secure the retention module
to the system board; then, lift the retention module from the system board.
7. If you are instructed to return the heat sink retention module, follow all the
packaging instructions, and use any packaging materials for shipping that are
supplied to you.
4. Reinstall the microprocessor and heat sink (see “Installing a microprocessor and
heat sink” on page 269).
5. Reinstall the air baffle.
6. Install the cover (see “Installing the cover” on page 195).
7. Slide the server into the rack.
8. Reconnect the power cords and any cables that you removed.
9. Turn on the peripheral devices and the server.
Note: When you replace the system board, you must either update the server
with the latest firmware or restore the pre-existing firmware that the customer
provides on a diskette or CD image. Make sure that you have the latest
firmware or a copy of the pre-existing firmware before you proceed.
4. Remove all PCI riser-card assemblies and adapters (see “Removing a PCI
riser-card assembly” on page 256 and “Removing an adapter” on page 200).
5. Remove the SAS/SATA RAID riser-card assembly (see “Removing the
SAS/SATA RAID riser-card assembly” on page 203).
6. Remove the microprocessor 2 and DIMM air baffles from the system board
(see “Removing the microprocessor 2 air baffle” on page 196 and “Removing
the DIMM air baffle” on page 198).
7. (Trained service technician only) Remove all heat sinks and microprocessors,
and set them aside on a static-protective surface for reinstallation (see
“Removing a microprocessor and heat sink” on page 267).
8. Remove the memory modules and set them aside on a static-protective
surface for reinstallation (see “Removing a memory module” on page 215).
Note: Make a note of the location of each DIMM as you remove it, so that you
can later reinstall it in the same connector.
9. Remove the virtual media key from the system board and set it aside. You will
have to install the virtual media key on the new system board.
10. Remove the system battery (see “Removing the system battery” on page 246).
11. Disconnect all cables from the system board. Make a list of each cable as you
disconnect it; you can then use this as a checklist when you install the new
system board.
12. Remove the hot-swap fans (see “Removing a hot-swap fan assembly” on page
235).
13. Press the fan brackets release latches (one is to the left of fan 6 and one is to
the right of fan 1) inward toward the fans and rotate the fan assembly brackets
up toward the front of the server.
14. Grasp the system board handle and lift up the right side of the system board
slightly so that it disengages from the locator pin; then, slide the system board
slightly toward the left of the server.
276 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
15. Lift up the system board and carefully remove it from the server, being careful
not to damage any surrounding components.
16. Remove the socket dust covers from the microprocessor sockets on the new
system board and place them on the microprocessor sockets of the old system
board that you are removing.
17. If you are instructed to return the system board, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
Important: Some cluster solutions require specific code levels or coordinated code
updates. If the device is part of a cluster solution, verify that the latest level of code
is supported for the cluster solution before you update the code
3. Grasp the fan assembly brackets and rotate them down toward the chassis.
4. Reinstall the hot-swap fans (see “Installing a hot-swap fan assembly” on page
237).
5. (Trained service technician only) Reinstall the microprocessor and heat sink
(see “Installing a microprocessor and heat sink” on page 269).
6. Reinstall the SAS/SATA RAID riser-card assembly base (see “Installing the
SAS/SATA RAID riser-card assembly” on page 204).
7. Reinstall the DIMMs (see “Installing a memory module” on page 216).
8. Reinstall the system battery (see “Installing the system battery” on page 248).
9. Reinstall the microprocessor 2 and DIMM air baffles (see “Installing the
microprocessor 2 air baffle” on page 197 and “Installing the DIMM air baffle” on
page 199).
10. Reinstall the PCI riser-card assemblies and adapters, if any were installed (see
“Installing a PCI riser-card assembly” on page 257 and “Installing an adapter”
on page 201).
11. Reinstall the virtual media key (see “Installing the virtual media key” on page
238).
12. Reconnect to the system board the cables that you disconnected in step 11 of
“Removing the system board” on page 275.
13. Install the cover (see “Installing the cover” on page 195).
14. Slide the server into the rack.
15. Reconnect the power cords and any cables that you removed.
16. Turn on the peripheral devices and the server.
278 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Important: Perform the following updates:
v Start the Setup utility and reset the configuration.
– Set the system date and time.
– Set the power-on password.
– Reconfigure the server.
See “Using the Setup utility” on page 285 for details.
v Either update the server with the latest RAID firmware or restore the pre-existing
firmware from a diskette or CD image.
v Update the UUID (see “Updating the Universal Unique Identifier (UUID)” on page
298).
v Update the DMI/SMBIOS (see “Updating the DMI/SMBIOS data” on page 300).
You can install code updates that are packaged as an UpdateXpress System Pack
or UpdateXpress CD image. An UpdateXpress System Pack contains an
integration-tested bundle of online firmware and device-driver updates for your
server. Use UpdateXpress System Pack Installer to acquire and apply
UpdateXpress System Packs and individual firmware and device-driver updates. For
additional information and to download the UpdateXpress System Pack Installer, go
to the ToolsCenter for System x and BladeCenter at [Link]
infocenter/toolsctr/v1r0/[Link] and click UpdateXpress System Pack Installer.
When you click an update, an information page is displayed, including a list of the
problems that the update fixes. Review this list for your specific problem; however,
even if your problem is not listed, installing the update might solve the problem.
Be sure to separately install any listed critical updates that have release dates that
are later than the release date of the UpdateXpress System Pack or UpdateXpress
image.
The firmware for the server is periodically updated and is available for download
from the IBM website. To check for the latest level of firmware, such as UEFI
firmware, vital product data (VPD) code, device drivers, and integrated management
module firmware, go to [Link] .
Attention: Before you update the firmware, be sure to back up any data that is
stored in the Trusted Platform Module (TPM), in case any of the TPM
characteristics are changed by the new firmware. For instructions, see your
encryption software documentation.
Download the latest firmware for the server; then, install the firmware, using the
instructions that are included with the downloaded files.
When you replace a device in the server, you might have to either update the
firmware that is stored in memory on the device or restore the pre-existing firmware
from a diskette or CD image.
v UEFI firmware is stored in ROM on the system board.
v IMM firmware is stored in ROM on the IMM on the system board.
v Ethernet firmware is stored in ROM on the Ethernet controller.
v ServeRAID firmware is stored in ROM on the ServeRAID adapter.
v SATA firmware is stored in ROM on the integrated SATA controller.
v SAS/SATA firmware is stored in ROM on the SAS/SATA controller on the system
board.
In addition to the ServerGuide Setup and Installation CD, you can use the following
configuration programs to customize the server hardware:
v Setup utility
The Setup utility is part of the basic input/output system firmware. Use it to
change interrupt request (IRQ) settings, change the startup-device sequence, set
the date and time, and set passwords. For information about using this program,
see “Using the Setup utility” on page 285.
v Boot Manager program
The Boot Manager program is part of the server firmware. Use it to override the
startup sequence that is set in the Setup utility and temporarily assign a device to
be first in the startup sequence. For more information about using this program,
see “Using the Boot Manager program” on page 291.
v Integrated Management Module
Use the integrated management module (IMM) for configuration, to update the
firmware and sensor data record/field replaceable unit (SDR/FRU) data, and to
remotely manage a network. For information about using these programs, see
“Using the integrated management module” on page 291.
v VMware ESXi embedded hypervisor
The VMware ESXi embedded hypervisor is available on the server models that
come with an installed USB embedded hypervisor flash device. The USB flash
device is installed in the USB connector on the SAS/SATA RAID riser-card.
Hypervisor is virtualization software that enables multiple operating systems to
run on a host system at the same time. For more information about using the
embedded hypervisor, see “Using the embedded hypervisor” on page 294.
v Remote presence capability and blue-screen capture
The remote presence and blue-screen capture feature are integrated into the
Integrated Management Module (IMM). The virtual media key is required to
enable the remote presence functions. When the optional virtual media key is
installed in the server, it activates the remote presence functions. Without the
virtual media key, you will not be able to access the network remotely to mount
or unmount drives or images on the client system. However, you will still be able
to access the host graphical user interface through the Web interface without the
Virtual Media Key. You can order the optional IBM Virtual Media Key, if one did
not come with your server. For more information about how to enable the remote
presence function, see “Using the remote presence capability and blue-screen
capture” on page 292.
v Ethernet controller configuration
For information about configuring the Ethernet controller, see “Configuring the
Gigabit Ethernet controller” on page 295.
v IBM Advanced Settings Utility (ASU) program
Use this program as an alternative to the Setup utility for modifying UEFI
settings. Use the ASU program online or out of band to modify UEFI settings
282 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
from the command line without the need to restart the server to access the Setup
utility. For more information about using this program, see “IBM Advanced
Settings Utility program” on page 297.
v LSI Configuration Utility program
Use the LSI Configuration Utility program to configure the integrated SAS/SATA
controller with RAID capabilities and the devices that are attached to it. For
information about using this program, see “Using the LSI Configuration Utility
program” on page 295.
The following table lists the different server configurations and the applications
that are available for configuring and managing RAID arrays.
Table 19. Server configuration and applications for configuring and managing RAID arrays
RAID array configuration RAID array management
(before operating system is (after operating system is
Server configuration installed) installed)
ServeRAID-BR10i adapter LSI Utility (Setup utility, press MegaRAID Storage Manager
(LSI 1068E) Ctrl+C), ServerGuide (for monitoring storage only)
ServeRAID-BR10il v2 LSI Utility (Setup utility, press MegaRAID Storage Manager
adapter (LSI 1064E) Ctrl+C), ServerGuide (MSM), Director
ServeRAID-MR10i adapter MegaRAID BIOS MegaRAID Storage Manager
(LSI 1078) Configuration Utility (press (MSM), Director
Ctrl+H to start), ServerGuide
ServeRAID-M5014 adapter MegaCLI (Command Line MegaRAID Storage Manager
(LSI SAS2108) Interface), ServerGuide (MSM), Director
ServeRAID-M5015 adapter MegaCLI (Command Line MegaRAID Storage Manager
(LSI SAS2108) Interface), ServerGuide (MSM), Director
ServeRAID-M1015 adapter MegaCLI (Command Line MegaRAID Storage Manager
(LSI SAS2008) Interface), ServerGuide (MSM), Director
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
Note: Features and functions can vary slightly with different versions of the
ServerGuide program.
When you start the ServerGuide Setup and Installation CD, the program prompts
you to complete the following tasks:
v Select your language.
v Select your keyboard layout and country.
v View the overview to learn about ServerGuide features.
v View the readme file to review installation tips for your operating system and
adapter.
v Start the operating-system installation. You will need your operating-system CD.
284 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Typical operating-system installation
The ServerGuide program can reduce the time it takes to install an operating
system. It provides the device drivers that are required for your hardware and for
the operating system that you are installing. This section describes a typical
ServerGuide operating-system installation.
Note: Features and functions can vary slightly with different versions of the
ServerGuide program.
1. After you have completed the setup process, the operating-system installation
program starts. (You will need your operating-system CD to complete the
installation.)
2. The ServerGuide program stores information about the server model, service
processor, hard disk drive controllers, and network adapters. Then, the program
checks the CD for newer device drivers. This information is stored and then
passed to the operating-system installation program.
3. The ServerGuide program presents operating-system partition options that are
based on your operating-system selection and the installed hard disk drives.
4. The ServerGuide program prompts you to insert your operating-system CD and
restart the server. At this point, the installation program for the operating system
takes control to complete the installation.
286 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Select this choice to enable or disable UEFI to interact with PCI mass
storage devices that are non-UEFI compliant.
– Integrated Management Module
Select this choice to view or change the settings for the integrated
management module.
- POST Watchdog Timer
Select this choice to view or enable the POST watchdog timer.
- POST Watchdog Timer Value
Select this choice to view or set the POST loader watchdog timer value.
- Reboot System on NMI
Enable or disable restarting the system whenever a nonmaskable interrupt
(NMI) occurs. Disabled is the default.
- Commands on USB Interface Preference
Select this choice to enable or disable the Ethernet over USB interface on
IMM.
- Network Configuration
Select this choice to view the system management network interface port,
the IMM MAC address, the current IMM IP address, and host name; define
the static IMM IP address, subnet mask, and gateway address, specify
whether to use the static IP address or have DHCP assign the IMM IP
address, save the network changes, and reset the IMM.
- Reset IMM to Defaults
Select this choice to view or reset IMM to the default settings.
- Reset IMM
Select this choice to reset IMM.
– System Security
Select this choice to view or configure Trusted Platform Module (TPM)
support.
– Adapters and UEFI Drivers
Select this choice to view information about the UEFI 1.10 and UEFI 2.0
compliant adapters and drivers installed in the server.
– Video
Select this choice to view or configure the video device options.
Note: The configuration forms for UEFI 2.1 and greater compliant add-on
video devices might be located here.
v Date and Time
Select this choice to set the date and time in the server, in 24-hour format
(hour:minute:second).
This choice is on the full Setup utility menu only.
v Start Options
Select this choice to view or change the start options, including the startup
sequence, keyboard NumLock state, PXE boot option, and PCI device boot
priority. Changes in the startup options take effect when you start the server.
The startup sequence specifies the order in which the server checks devices to
find a boot record. The server starts from the first boot record that it finds. If the
server has Wake on LAN hardware and software and the operating system
supports Wake on LAN functions, you can specify a startup sequence for the
288 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Restore Settings
Select this choice to cancel the changes that you have made in the settings and
restore the previous settings.
v Load Default Settings
Select this choice to cancel the changes that you have made in the settings and
restore the factory settings.
v Exit Setup
Select this choice to exit from the Setup utility. If you have not saved the
changes that you have made in the settings, you are asked whether you want to
save the changes or exit without saving them.
Passwords
From the User Security menu choice, you can set, change, and delete a power-on
password and an administrator password. The User Security choice is on the full
Setup utility menu only.
If you set only a power-on password, you must type the power-on password to
complete the system startup and to have access to the full Setup utility menu.
If you set a power-on password for a user and an administrator password for a
system administrator, you must type the power-on password to complete the system
startup. A system administrator who types the administrator password has access to
the full Setup utility menu; the system administrator can give the user authority to
set, change, and delete the power-on password. A user who types the power-on
password has access to only the limited Setup utility menu; the user can set,
change, and delete the power-on password, if the system administrator has given
the user that authority.
Power-on password: If a power-on password is set, when you turn on the server,
you must type the power-on password to complete the system startup. You can use
any combination of 6 - 20 printable ASCII characters for the password.
When a power-on password is set, you can enable the Unattended Start mode, in
which the keyboard and mouse remain locked but the operating system can start.
You can unlock the keyboard and mouse by typing the power-on password.
If you forget the power-on password, you can regain access to the server in any of
the following ways:
v If an administrator password is set, type the administrator password at the
password prompt. Start the Setup utility and reset the power-on password.
v Remove the battery from the server and then reinstall it. See “Removing the
system battery” on page 246 for instructions on removing the battery.
v Change the position of the power-on password switch (enable switch 1 of the
system board switch block (SW4) to bypass the power-on password check (see
Table 4 on page 19 for more information).
Attention: If you set an administrator password and then forget it, there is no way
to change, override, or remove it. You must replace the system board.
290 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Using the Boot Manager program
The Boot Manager program is a built-in, menu-driven configuration utility program
that you can use to temporarily redefine the first startup device without changing
settings in the Setup utility.
The next time the server starts, it returns to the startup sequence that is set in the
Setup utility.
To force the server to start from the backup copy of the server firmware, turn off the
server; then, place the J29 jumper in the backup position (pins 2 and 3).
Use the backup copy of the server firmware until the primary copy is restored. After
the primary copy is restored, turn off the server; then, move the J29 jumper back to
the primary position (pins 1 and 2).
The IMM also provides the following remote server management capabilities
through the OSA SMBridge management utility program:
v Command-line interface (IPMI Shell)
The command-line interface provides direct access to server management
functions through the IPMI 2.0 protocol. Use the command-line interface to issue
commands to control the server power, view system information, and identify the
server. You can also save one or more commands as a text file and run the file
as a script.
v Serial over LAN
Establish a Serial over LAN (SOL) connection to manage servers from a remote
location. You can remotely view and change the UEFI settings, restart the server,
identify the server, and perform other management functions. Any standard Telnet
client application can access the SOL connection.
After the virtual media key is installed in the server, it is authenticated to determine
whether it is valid. If the key is not valid, you receive a message from the Web
292 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
interface (when you attempt to start the remote presence feature) indicating that the
hardware key is required to use the remote presence feature.
The virtual media key has an LED. When this LED is lit and green, it indicates that
the key is installed and functioning correctly. When the LED is not lit, it indicates
that the key might not be installed correctly.
The blue-screen capture feature captures the video display contents before the IMM
restarts the server when the IMM detects an operating-system hang condition. A
system administrator can use the blue-screen capture to assist in determining the
cause of the hang condition.
Note: The IMM defaults to DHCP. If a DHCP host is not available, the IMM
assigns a static IP address of [Link].
2. On the Login page, type the user name and password. If you are using the IMM
for the first time, you can obtain the user name and password from your system
administrator. All login attempts are documented in the event log.
Note: The IMM is set initially with a user name of USERID and password of
PASSW0RD (passw0rd with a zero, not the letter O). You have read/write
access. You must change the default password the first time you log on.
3. On the Welcome page, type a timeout value (in minutes) in the field that is
provided. The IMM will log you off of the Web interface if your browser is
inactive for the number of minutes that you entered for the timeout value.
4. Click Continue to start the session. The System Health page provides a quick
view of the system status.
To start using the embedded hypervisor functions, you must add the USB flash
device to the boot order in the Setup utility.
To add the USB flash device to the boot order, complete the following steps:
1. Turn on the server.
294 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
5. Select Change Boot Order and then select Commit Changes; then, press
Enter.
6. Select Save Settings and then select Exit Setup.
If the embedded hypervisor flash device image becomes corrupt, you can use the
VMware Recovery CD to recover the flash device image. To recover the flash
device image, complete the following steps:
1. Turn on the server.
For additional information and instructions, see the ESXi Embedded and vCenter
Server Setup Guide at [Link]
vsp_40_u1_esxi_e_vc_setup_guide.pdf.
You do not have to set any jumpers or configure the controllers. However, you must
install a device driver to enable the operating system to address the controllers.
To find device drivers and information about configuring the Ethernet controllers, go
to [Link]
The integrated SAS/SATA controller with RAID capabilities supports RAID arrays.
You can use the LSI Configuration Utility program to configure RAID 1 (IM), RAID
1E (IME), and RAID 0 (IS) for a single pair of attached devices. If you install the
optional ServeRAID-MR10i SAS/SATA controller, it provides RAID levels 0, 1, 5, 6,
When you are using the LSI Configuration Utility program to configure and manage
arrays, consider the following information:
v The integrated SAS/SATA controller with RAID capabilities supports the following
features:
– Integrated Mirroring (IM) with hot-spare support (also known as RAID 1)
Use this option to create an integrated array of two disks plus up to two
optional hot spares. All data on the primary disk can be migrated.
– Integrated Mirroring Enhanced (IME) with hot-spare support (also known as
RAID 1E)
Use this option to create an integrated mirror enhanced array of three to eight
disks, including up to two optional hot spares. All data on the array disks will
be deleted.
– Integrated Striping (IS) (also known as RAID 0)
Use this option to create an integrated striping array of two to eight disks. All
data on the array disks will be deleted.
v Hard disk drive capacities affect how you create arrays. The drives in an array
can have different capacities, but the RAID controller treats them as if they all
have the capacity of the smallest hard disk drive.
v If you use an integrated SAS/SATA controller with RAID capabilities to configure
a RAID 1 (mirrored) array after you have installed the operating system, you will
lose access to any data or applications that were previously stored on the
secondary drive of the mirrored pair.
v If you install a different type of RAID controller, see the documentation that
comes with the controller for information about viewing and changing settings for
attached devices.
When you have finished changing settings, press Esc to exit from the program;
select Save to save the settings that you have changed.
296 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Formatting a hard disk drive
Low-level formatting removes all data from the hard disk. If there is data on the disk
that you want to save, back up the hard disk before you perform this procedure.
Note: Before you format a hard disk, make sure that the disk is not part of a
mirrored pair.
You can also use the ASU program to configure the optional remote presence
features or other IMM settings. The remote presence features provide enhanced
systems-management capabilities.
In addition, the ASU program provides limited settings for configuring the IPMI
function in the IMM through the command-line interface.
Use the command-line interface to issue setup commands. You can save any of the
settings as a file and run the file as a script. The ASU program supports scripting
environments through a batch-processing mode.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
If your management server is not connected to the Internet, to locate and install
updates and interim fixes, complete the following steps:
1. Make sure that you have run the Discovery and Inventory collection tasks.
2. On a system that is connected to the Internet, go to [Link]
eserver/support/fixes/fixcentral/.
3. From the Product family list, select IBM Systems Director.
4. From the Product list, select IBM Systems Director.
5. From the Installed version list, select the latest version, and click Continue.
6. Download the available updates.
7. Copy the downloaded files to the management server.
8. On the management server, on the Welcome page of the IBM Systems
Director Web interface, click the Manage tab, and click Update Manager.
9. Click Import updates and specify the location of the downloaded files that you
copied to the management server.
10. Return to the Welcome page of the Web interface, and click View updates.
11. Select the updates that you want to install, and click Install to start the
installation wizard.
298 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Remote access to the target system (LAN based)
v Bootable media containing ASU (LAN or KCS, depending upon the bootable
media)
2. Copy and unpack the ASU package, which also includes other required files, to
the server. Make sure that you unpack the ASU and the required files to the
same directory. In addition to the application executable (asu or asu64), the
following files are required:
v For Windows based operating systems:
– ibm_rndis_server_os.inf
– [Link]
v For Linux based operating systems:
– cdc_interface.sh
3. After you install ASU, use the following command syntax to set the UUID:
asu set SYSTEM_PROD_DATA.SysInfoUUID <uuid_value> [access_method]
Where:
<uuid_value>
Up to 16-byte hexadecimal value assigned by you.
[access_method]
The access method that you selected to use from the following
methods:
v Online authenticated LAN access, type the command:
[host <imm_internal_ip>] [user <imm_user_id>][password
<imm_password>]
Where:
imm_internal_ip
The IMM internal LAN/USB IP address. The default value is
[Link].
imm_user_id
The IMM account (1 of 12 accounts). The default value is USERID.
imm_password
The IMM account password (1 of 12 accounts). The default value is
PASSW0RD (with a zero 0 not an O).
Note: If you do not specify any of these parameters, ASU will use the default
values. When the default values are used and ASU is unable to access the
IMM using the online authenticated LAN access method, ASU will
automatically use the unauthenticated KCS access method.
The following commands are examples of using the userid and password
default values and not using the default values:
Example that does not use the userid and password default values:
asu set SYSTEM_PROD_DATA.SysInfoUUID <uuid_value> --user <user_id>
--password <password>
Example that does use the userid and password default values:
asu set SYSTEM_PROD_DATA.SysInfoUUID <uuid_value>
v Online KCS access (unauthenticated and user restricted):
You do not need to specify a value for access_method when you use this
access method.
Note: When using the remote LAN access method to access IMM using the
LAN from a client, the host and the imm_external_ip address are required
parameters.
host <imm_external_ip> [user <imm_user_id>][password <imm_password>]
Where:
imm_external_ip
The external IMM LAN IP address. There is no default value. This
parameter is required.
imm_user_id
The IMM account (1 of 12 accounts). The default value is USERID.
imm_password
The IMM account password (1 of 12 accounts). The default value is
PASSW0RD (with a zero 0 not an O).
The following commands are examples of using the userid and password
default values and not using the default values:
Example that does not use the userid and password default values:
asu set SYSTEM_PROD_DATA.SysInfoUUID <uuid_value> --host <imm_ip>
--user <user_id> --password <password>
Example that does use the userid and password default values:
asu set SYSTEM_PROD_DATA.SysInfoUUID <uuid_value> --host <imm_ip>
v Bootable media:
You can also build a bootable media using the applications available through
the Tools Center website at [Link]
v1r0/[Link]. From the left pane, click IBM System x and BladeCenter
Tools Center, then click Tool reference for the available tools.
4. Restart the server.
300 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
v Bootable media containing ASU (LAN or KCS, depending upon the bootable
media)
2. Copy and unpack the ASU package, which also includes other required files, to
the server. Make sure that you unpack the ASU and the required files to the
same directory. In addition to the application executable (asu or asu64), the
following files are required:
v For Windows based operating systems:
– ibm_rndis_server_os.inf
– [Link]
v For Linux based operating systems:
– cdc_interface.sh
3. After you install ASU, Type the following commands to set the DMI:
Note: If you do not specify any of these parameters, ASU will use the default
values. When the default values are used and ASU is unable to access the
IMM using the online authenticated LAN access method, ASU will
automatically use the following unauthenticated KCS access method.
Examples that do not use the userid and password default values:
asu set SYSTEM_PROD_DATA.SysInfoProdName <m/t_model>
--user <imm_user_id> --password <imm_password>
asu set SYSTEM_PROD_DATA.SysInfoProdIdentifier <system model>
--user <imm_user_id> --password <imm_password>
asu set SYSTEM_PROD_DATA.SysInfoSerialNum <s/n> --user <imm_user_id>
--password <imm_password>
asu set SYSTEM_PROD_DATA.SysEncloseAssetTag <asset_tag>
--user <imm_user_id> --password <imm_password>
Examples that do not use the userid and password default values:
asu set SYSTEM_PROD_DATA.SysInfoProdName <m/t_model>
asu set SYSTEM_PROD_DATA.SysInfoProdIdentifier <system model>
asu set SYSTEM_PROD_DATA.SysInfoSerialNum <s/n>
asu set SYSTEM_PROD_DATA.SysEncloseAssetTag <asset_tag>
v Remote LAN access, type the command:
Note: When using the remote LAN access method to access IMM using the
LAN from a client, the host and the imm_external_ip address are required
parameters.
host <imm_external_ip> [user <imm_user_id>][password <imm_password>]
Where:
imm_external_ip
The external IMM LAN IP address. There is no default value. This
parameter is required.
imm_user_id
The IMM account (1 of 12 accounts). The default value is USERID.
imm_password
The IMM account password (1 of 12 accounts). The default value is
PASSW0RD (with a zero 0 not an O).
The following commands are examples of using the userid and password
default values and not using the default values:
302 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Examples that do not use the userid and password default values:
asu set SYSTEM_PROD_DATA.SysInfoProdName <m/t_model> --host <imm_ip>
--user <imm_user_id> --password <imm_password>
asu set SYSTEM_PROD_DATA.SysInfoProdIdentifier <system model> --host <imm_ip>
--user <imm_user_id> --password <imm_password>
asu set SYSTEM_PROD_DATA.SysInfoSerialNum <s/n> --host <imm_ip>
--user <imm_user_id> --password <imm_password>
asu set SYSTEM_PROD_DATA.SysEncloseAssetTag <asset_tag> --host <imm_ip>
--user <imm_user_id> --password <imm_password>
You can solve many problems without outside assistance by following the
troubleshooting procedures that IBM provides in the online help or in the
documentation that is provided with your IBM product. The documentation that
comes with IBM systems also describes the diagnostic tests that you can perform.
Most systems, operating systems, and programs come with documentation that
contains troubleshooting procedures and explanations of error messages and error
codes. If you suspect a software problem, see the documentation for the operating
system or program.
For more information about Support Line and other IBM services, see
[Link] or see [Link] for support
telephone numbers. In the U.S. and Canada, call 1-800-IBM-SERV
(1-800-426-7378).
In the U.S. and Canada, hardware service and support is available 24 hours a day,
7 days a week. In the U.K., these services are available Monday through Friday,
from 9 a.m. to 6 p.m.
306 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Appendix B. Notices
This information was developed for products and services offered in the U.S.A.
IBM may not offer the products, services, or features discussed in this document in
other countries. Consult your local IBM representative for information on the
products and services currently available in your area. Any reference to an IBM
product, program, or service is not intended to state or imply that only that IBM
product, program, or service may be used. Any functionally equivalent product,
program, or service that does not infringe any IBM intellectual property right may be
used instead. However, it is the user's responsibility to evaluate and verify the
operation of any non-IBM product, program, or service.
IBM may have patents or pending patent applications covering subject matter
described in this document. The furnishing of this document does not give you any
license to these patents. You can send license inquiries, in writing, to:
IBM Director of Licensing
IBM Corporation
North Castle Drive
Armonk, NY 10504-1785
U.S.A.
IBM may use or distribute any of the information you supply in any way it believes
appropriate without incurring any obligation to you.
Trademarks
IBM, the IBM logo, and [Link] are trademarks of International Business Machines
Corp., registered in many jurisdictions worldwide. Other product and service names
might be trademarks of IBM or other companies. A current list of IBM trademarks is
available on the web at “Copyright and trademark information” at
[Link]
Intel, Intel Xeon, Itanium, and Pentium are trademarks or registered trademarks of
Intel Corporation or its subsidiaries in the United States and other countries.
Java and all Java-based trademarks and logos are trademarks or registered
trademarks of Oracle and/or its affiliates.
UNIX is a registered trademark of The Open Group in the United States and other
countries.
Important notes
Processor speed indicates the internal clock speed of the microprocessor; other
factors also affect application performance.
CD or DVD drive speed is the variable read rate. Actual speeds vary and are often
less than the possible maximum.
When referring to processor storage, real and virtual storage, or channel volume,
KB stands for 1024 bytes, MB stands for 1,048,576 bytes, and GB stands for
1,073,741,824 bytes.
Maximum internal hard disk drive capacities assume the replacement of any
standard hard disk drives and population of all hard disk drive bays with the largest
currently supported drives that are available from IBM.
Some software might differ from its retail version (if available) and might not include
user manuals or all program functionality.
308 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Particulate contamination
Attention: Airborne particulates (including metal flakes or particles) and reactive
gases acting alone or in combination with other environmental factors such as
humidity or temperature might pose a risk to the server that is described in this
document. Risks that are posed by the presence of excessive particulate levels or
concentrations of harmful gases include damage that might cause the server to
malfunction or cease functioning altogether. This specification sets forth limits for
particulates and gases that are intended to avoid such damage. The limits must not
be viewed or used as definitive limits, because numerous other factors, such as
temperature or moisture content of the air, can influence the impact of particulates
or environmental corrosives and gaseous contaminant transfer. In the absence of
specific limits that are set forth in this document, you must implement practices that
maintain particulate and gas levels that are consistent with the protection of human
health and safety. If IBM determines that the levels of particulates or gases in your
environment have caused damage to the server, IBM may condition provision of
repair or replacement of servers or parts on implementation of appropriate remedial
measures to mitigate such environmental contamination. Implementation of such
remedial measures is a customer responsibility.
Table 20. Limits for particulates and gases
Contaminant Limits
Particulate v The room air must be continuously filtered with 40% atmospheric dust
spot efficiency (MERV 9) according to ASHRAE Standard 52.21.
v Air that enters a data center must be filtered to 99.97% efficiency or
greater, using high-efficiency particulate air (HEPA) filters that meet
MIL-STD-282.
v The deliquescent relative humidity of the particulate contamination
must be more than 60%2.
v The room must be free of conductive contamination such as zinc
whiskers.
Gaseous v Copper: Class G1 as per ANSI/ISA 71.04-19853
v Silver: Corrosion rate of less than 300 Å in 30 days
1
ASHRAE 52.2-2008 - Method of Testing General Ventilation Air-Cleaning Devices for
Removal Efficiency by Particle Size. Atlanta: American Society of Heating, Refrigerating and
Air-Conditioning Engineers, Inc.
2
The deliquescent relative humidity of particulate contamination is the relative humidity at
which the dust absorbs enough water to become wet and promote ionic conduction.
3
ANSI/ISA-71.04-1985. Environmental conditions for process measurement and control
systems: Airborne contaminants. Instrument Society of America, Research Triangle Park,
North Carolina, U.S.A.
Documentation format
The publications for this product are in Adobe Portable Document Format (PDF)
and should be compliant with accessibility standards. If you experience difficulties
when you use the PDF files and want to request a web-based format or accessible
PDF document for a publication, direct your mail to the following address:
Information Development
IBM Corporation
205/A015
3039 E. Cornwallis Road
P.O. Box 12195
Research Triangle Park, North Carolina 27709-2195
In the request, be sure to include the publication part number and title.
When you send information to IBM, you grant IBM a nonexclusive right to use or
distribute the information in any way it believes appropriate without incurring any
obligation to you.
Properly shielded and grounded cables and connectors must be used in order to
meet FCC emission limits. IBM is not responsible for any radio or television
interference caused by using other than recommended cables and connectors or by
unauthorized changes or modifications to this equipment. Unauthorized changes or
modifications could void the user's authority to operate the equipment.
This device complies with Part 15 of the FCC Rules. Operation is subject to the
following two conditions: (1) this device may not cause harmful interference, and (2)
this device must accept any interference received, including interference that may
cause undesired operation.
310 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
European Union EMC Directive conformance statement
This product is in conformity with the protection requirements of EU Council
Directive 2004/108/EC on the approximation of the laws of the Member States
relating to electromagnetic compatibility. IBM cannot accept responsibility for any
failure to satisfy the protection requirements resulting from a nonrecommended
modification of the product, including the fitting of non-IBM option cards.
Responsible manufacturer:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
914-499-1900
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
International Business Machines Corp.
New Orchard Road
Armonk, New York 10504
914-499-1900
Generelle Informationen:
This is a Class A product based on the standard of the Voluntary Control Council for
Interference (VCCI). If this equipment is used in a domestic environment, radio
interference may occur, in which case the user may be required to take corrective
actions.
312 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
This is electromagnetic wave compatibility equipment for business (Type A). Sellers
and users need to pay attention to it. This is for any areas other than home.
C D
cabling danger statements 6
internal routing 190 dc good LED 131
caution statements 6 dc power LED 13
checkout procedure 99 dc power supply LED errors 131
316 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
hard disk drive (continued) installing (continued)
SAS 205 battery,system 248
SCSI cover 195
See SAS DVD drive 211
simple-swap SATA 207 hot-swap ac power supply 233
hard disk drive (SATA), installing (simple-swap hypervisor flash device 230
2.5-inch) 209 memory DIMMs 216
hard disk drive activity LED 9 operator-information panel 264
hard disk drive status LED 9 optional hot-swap hard disk drive cage 255
hard disk drive, installing (hot-swap 2.5-inch) 206 optional optical drive bay 251
hard drive activity SAS/SATA 4 Pac HDD option 255
LED 10 ServeRAID adapter advanced feature key 228
hardware service and support 306 the DVD cable 253
hardware, configuring 282 the heat sink 273
heat output 7 the heat sink retention module 275
heat sink the hot-swap SAS/SATA backplane 260
installing 269 the PCI riser card bracket 258
heat sink retention module the safety cover 266
installing 275 the simple-swap SATA backplate assembly 262
removing 274 the system board 277
help, getting 305 the virtual media key 238
hot-swap integrated functions 8
fans, replacing 235 integrated management module
hot-swap ac power supply 231, 233 error messages 40
installing 233 event log 24, 25
removing 231 LED 134
hot-swap disk drive, installing 2.5-inch 206 using 291
hot-swap drive Integrated Management Module programs 282
SAS/SATA IDs 207 intermittent problems 105
hot-swap SAS/SATA backplane internal cable routing 190
installing 260 internal connectors 16
replacing 259 IP address
humidity 7 obtaining for the IMM 293
hypervisor flash device IPMI event log 24, 25
installing 230 IPMItool 25
problems 104
J
I jumper
IBM Advanced Settings Utility program UEFI boot recovery 170
overview 297 jumpers and switches
IBM Support Line 306 on the system board 18
IBM Systems Director
updating 298
IDs for SAS/SATA hot-swap drives 207 K
IDs for SATA simple-swap drives 209 key, virtual media
IMM 282, 291 installing 238
error messages 40 keyboard problems 106
IMM heartbeat
LED 134
important notices 6 L
IN OK LED 133 LED
IN OK power LED 13 ac power 13
installation guidelines 187 dc power 13
installing DVD drive activity 9
2.5-inch hot-swap hard disk drive 206 Ethernet activity 10
2.5-inch simple-swap SATA hard disk drive 209 Ethernet-link status 13
a heat sink 269 for Ethernet activity 13
a hot-swap fan 235 hard disk drive activity 9
a microprocessor 269 hard disk drive status 9
a RAID adapter remote battery 243 hard drive activity 10
Index 317
LED (continued) messages
IN OK power 13 diagnostic 134
integrated management module 134 messages, error
location 9 POST 27
OUT OK power 13 methods, for viewing event logs 25
power supply error microprocessor
rear 14 installing 269
power-on 10 problems 109
rear 14 specifications 7
system information 10 mirroring mode 219
system locator 10 model and serial number
system-error 10 location 174
rear 14 monitor problems 110
system-locator
rear 14
LED errors N
ac power supply 130 NMI button
dc power supply 131 on the light path diagnostics panel 12, 121
LED, NOS installation
enclosure management heartbeat 134 with ServerGuide 285
IMM heartbeat 134 without ServerGuide 285
LEDs, system pulse 134 notes 6
legacy operating system notes, important 308
requirement 284 notices 307
License Agreement for Machine Code 5 electronic emission 310
Licenses and Attributions Documents 5 FCC, Class A 310
light panel diagnostics panel notices and statements 6
location 9
light path diagnostics 1, 119
LEDs 122 O
panel 120 obtaining
light path diagnostics LEDs 122 the IP address for IMM 293
light path diagnostics panel online publications 170
checkpoint code display 121 online service request 3
controls and LEDs 11 online-spare mode 220
NMI button 12, 121 operating-system event log 24, 25
light path diagnostics panel display operator information panel
checkpoint codes 121 controls and LEDs 10
Linux license agreement 5 removing 263, 264
LSI Configuration Utility program operator information panel release latch 9
starting 296 optional device problems 112
using 295 optional hot-swap hard disk drive cage
installing 255
removing 255
M optional optical drive
memory specifications 7
installing 216 optional optical drive bay
removing 215 installing 251
specifications 7 removing 251
two-DIMM-per-channel (2DPC) 217 OUT OK LED 133
memory installation sequence OUT OK power LED 13
for non-mirroring mode 218
memory mirroring
description 219 P
DIMM population sequence 220, 221 particulate contamination 7, 309
memory online-spare parts listing 177, 179
description 220 parts, consumable 182
memory problems 107 password 289
menu choices administrator 289
for the Setup utility 286 power-on 289
318 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
password, power-on
switch on system board 289
R
rack installation instructions 5
PCI
rack release latches 9
riser card bracket 258
RAID
riser-card assembly 256, 257
riser-card assembly 203, 204
slot 1 12
RAID array
slot 2 12
creating 297
PCI expansion slots 7
RAID controller
PCI riser card bracket
removing 222, 223
installing 258
rear view 12
PCI riser slots
recovering the server firmware 170
installation configurations 201
redundant array of independent disks (RAID)
supported configurations 201
adapter 207
POST
remind button 12, 121
error codes 27
remote battery, RAID adapter
error log 24
installing 243
POST error codes and event log 24
removing 242
POST event log 24
remote presence feature
power
using 292
power-control button 10
Remote presence support 282
power-on LED 14
removing
server 14
a hot-swap fan 235
specifications 8
a RAID adapter remote battery 242
supply 7
adapter 200
power cords 184
battery, system 246
power problems 113, 172
connector 238
power supply
DIMM 215
ac 231, 233
DVD drive 210
installing 233
hot-swap ac power supply 231
removing 231
hypervisor 229
power-on
optional hot-swap hard disk drive cage 255
LED
optional optical drive bay 251
rear 14
PCI riser card bracket 258
power-on LED 10
RAID controller 222, 223
power-on password 288
SAS/SATA 4 Pac HDD option 255
presence detection button 10
SAS/SATA controller 222, 223
problem diagnosis 3
ServeRAID adapter advanced feature key 226
problem isolation tables 101
the bezel 250
problems
the DVD cable 252
DVD drive 101
the heat sink retention module 274
Ethernet controller 173
the hot-swap SAS/SATA backplane 259
hard disk drive 102
the safety cover 265
hypervisor flash device 104
the simple-swap SATA backplate assembly 261
IMM 40
removing/replacing
intermittent 105
operator information panel 263, 264
memory 107
the system board 275
microprocessor 109
replacement parts 177
monitor 110
replacing
mouse 106
battery, system 246, 248
optional devices 112
DVD drive 210
pointing device 106
the SAS/SATA backplane 259
power 113, 172
the SATA backplate assembly 261
serial port 116
reset button 12, 121
software 118
RETAIN tips 3
undetermined 173
retention module, heat sink
USB port 119
installing 275
video 110, 119
removing 274
public services network, use in 310
riser card
public telecommunications network, connection to 310
connector location 16
publications 5
riser-card assembly
location 200
Index 319
S starting
the LSI Configuration Utility program 296
safety cover
the server firmware 291
installing 266
the Setup utility 285
removing 265
statements and notices 6
SAS/SATA
support, website 305
IDs for hot-swap drives 207
SW3 switch block description 19
SAS/SATA 4 Pac HDD option, installing 255
SW4 switch block description 19
SAS/SATA 4 Pac HDD option, removing 255
switch block, system board 19
SAS/SATA controller
switches and jumpers
hypervisor 229
on the system board 18
removing 222, 223
system
SATA
error LED front 10
IDs for simple-swap drives 209
locator LED front 10
serial and model number
system-error LED
location 174
rear 14
serial connector 12
System
serial port problems 116
information LED 10
Server
system board
power features 14
installing 277
server components 178
internal connectors 16
server firmware
power-on password switch 289
updating 270
removing 275
server firmware, recovering 170
switches and jumpers 18
server replaceable units 177
system event logs 24
server, backup firmware
system pulse LEDs 134
starting 291
system-event log 24
ServeRAID adapter
system-event log, assertion event 24
removing 222, 223
system-event log, deassertion event 24
ServeRAID adapter advanced feature key
system-locator
hypervisor 226
LED
installing 228
rear 14
ServerGuide
systems-management
features 284
Ethernet connector 13
NOS installation 285
setup 284
Setup and Installation CD 281
using 283
T
service request, online 3 telephone numbers 306
Setup utility 281, 282, 285 temperature 7
menu choices 286 test log, viewing 136
starting 285 thermal grease 273
using 285 three boot failure 172
shutdown 15 TOE 8
simple-swap drive tools, diagnostic 23
SATA IDs 209 ToolsCenter for System x and BladeCenter 5
simple-swap front view trademarks 307
LED location 9 troubleshooting 3
simple-swap SATA backplate assembly turning off the server 15
installing 262 baseboard management controller 15
replacing 261 turning on the server 14
simple-swap SATA disk drive (2.5-inch), installing 209 two-DIMM-per-channel (2DPC)
simple-swap, SATA hard disk drive requirements 217
removing 207
size 8
slots U
PCI expansion 7 UEFI
software problems 118 boot recovery jumper 170
software service and support 306 undetermined problems 173
specifications 7 undocumented problems 3
standby mode 14 United States electronic emission Class A notice 310
United States FCC Class A notice 310
320 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Universal Serial Bus (USB) problems 119
UpdateXpress 2, 281
updating
IBM Systems Director 298
server firmware 270
Systems Director, IBM 298
updating firmware 281
USB
connector 10, 12
using
embedded hypervisor 294
IMM 291
the integrated management module 291
the LSI Configuration Utility program 295
the remote presence feature 292
the Setup utility 285
utility
Setup 285
Utility program
IBM Advanced Settings 297
utility, Setup 282
V
video connector
front 10
rear 12
video controller, integrated
specifications 7
video problems 110
viewing event logs 25
virtual media key
installing 238
VMware Hypervisor support 282
W
Wake on LAN feature 14
warranty 5
Web site
UEFI flash diskette 170
website
publication ordering 305
support 305
support line, telephone numbers 306
weight 8
Index 321
322 IBM System x3550 M3 Types 4254 and 7944: Problem Determination and Service Guide
Printed in USA