AFF C250 Systems
AFF C250 Systems
• Video steps
• Detailed steps
If your system is in a MetroCluster IP configuration, see the Install MetroCluster IP Configuration instructions.
MetroCluster Documentation
1
Step 1: Prepare for installation
To install your AFF C250 system, you need to create an account and register the system. You also need to
inventory the appropriate number and type of cables for your system and collect specific network information.
Steps
1. Unpack the contents of all boxes.
2. Record the system serial number from the controllers.
The following table identifies the types of cables you might receive. If you receive a cable not listed in the
table, see the NetApp Hardware Universe to locate the cable and identify its use.
X66240-2 (112-00573), 2m
X66240A-5 (112-00600), 5m
X66211-5 (112-00576), 5m
2
Type of cable… Part number and length Connector type For…
RJ-45 (order Not applicable Management network (BMC and
dependent) wrench port) and Ethernet data
(e0a and e0b)
6. Review the ONTAP Configuration Guide and collect the required information listed in that guide.
You need to install your system in a 4-post rack or NetApp system cabinet, as applicable.
Steps
1. Install the rail kits, as needed.
2. Install and secure your system using the instructions included with the rail kit.
You need to be aware of the safety concerns associated with the weight of the system.
3. Identify and manage cables because this system does not have a cable management device.
4. Place the bezel on the front of the system.
Cable the controllers to a cluster by using the two-node switchless cluster method or by using the cluster
interconnect network.
3
Option 1: Two-node switchless cluster
The management, Fibre Channel, and data or host network ports on the controller modules are
connected to switches. The cluster interconnect ports are cabled on both controller modules.
As you insert the connector, you should feel it click into place; if you do not feel it click,
remove it, turn it around and try again.
Steps
1. Cable the cluster interconnect ports e0c to e0c and e0d to e0d with the 25GbE cluster interconnect
cables.
2. Cable the wrench ports to the management network switches with the RJ45 cables.
4
DO NOT plug in the power cords at this point.
As you insert the connector, you should feel it click into place; if you do not feel it click,
remove it, turn it around and try again.
Steps
1. Cable the cluster interconnect ports e0c and e0d to the 25 GbE cluster interconnect switches.
5
2. Cable the wrench ports to the management network switches with the RJ45 cables.
You have configuration-dependent optional cabling to the Fibre Channel or iSCSI host networks or direct-
attached storage. This cabling is not exclusive; you can have cabling to a host network and storage.
NetApp Hardware Universe slot priority for host network cards (Fibre Channel or 25GbE) is slot
2. However, if you have both cards, the Fibre Channel card goes in slot 2 and the 25GbE card
goes in slot 1 (as shown in the options below). If you have an external shelf, the storage card
goes in slot 1, the only supported slot for shelves.
6
Option 1: Cable to Fibre Channel host network
Fibre Channel ports on the controllers are connected to Fibre Channel host network switches.
As you insert the connector, you should feel it click into place; if you do not feel it click,
remove it, turn it around and try again.
Steps
1. Cable ports 2a through 2d to the FC host switches.
7
As you insert the connector, you should feel it click into place; if you do not feel it click,
remove it, turn it around and try again.
Steps
1. Cable ports e4a through e4d to the 10GbE host network switches.
As you insert the connector, you should feel it click into place; if you do not feel it click,
remove it, turn it around and try again.
8
Animation - Cable the controllers to a single NS224
Steps
1. Cable controller A to the shelf.
Complete the system setup and configuration using cluster discovery with only a connection to the switch and
laptop, or by connecting directly to a controller in the system and then connecting to the management switch.
9
Option 1: If network discovery is enabled
If you have network discovery enabled on your laptop, you can complete system setup and configuration
using automatic cluster discovery.
Steps
1. Use the following animation to power on and set shelf IDs for one or more drive shelves:
For NS224 drive shelves, shelf IDs are pre-set to 00 and 01. If you want to change the shelf IDs, use
the straightened end of a paperclip, or narrow tipped ball point pen to access the shelf ID button
behind the faceplate.
2. Plug the power cords into the controller power supplies, and then connect them to power sources on
different circuits.
The system begins to boot. Initial booting may take up to eight minutes.
10
c. Right-click and select refresh.
d. Double-click either ONTAP icon and accept any certificates displayed on your screen.
6. Use System Manager guided setup to configure your system using the data you collected in the
ONTAP Configuration Guide.
7. Set up your account and download Active IQ Config Advisor:
a. Log in to your existing account or create an account.
Steps
1. Cable and configure your laptop or console:
a. Set the console port on the laptop or console to 115,200 baud with N-8-1.
See your laptop or console’s online help for how to configure the console port.
c. Assign a TCP/IP address to the laptop or console, using one that is on the management switch.
2. Use the following animation to power on and set shelf IDs for one or more drive shelves:
For NS224 drive shelves, shelf IDs are pre-set to 00 and 01. If you want to change the shelf IDs, use
the straightened end of a paperclip, or narrow tipped ball point pen to access the shelf ID button
behind the faceplate.
11
Animation - Set drive shelf IDs
3. Plug the power cords into the controller power supplies, and then connect them to power sources on
different circuits.
The system begins to boot. Initial booting may take up to eight minutes.
Not configured a. Open a console session using PuTTY, a terminal server, or the
equivalent for your environment.
b. Configure the system using the data you collected in the ONTAP Configuration Guide.
6. Set up your account and download Active IQ Config Advisor:
a. Log in to your existing account or create an account.
b. Register your system.
c. Download Active IQ Config Advisor.
7. Verify the health of your system by running Config Advisor.
8. After you have completed the initial configuration, go to ONTAP 9 documentation for information about
configuring additional features in ONTAP.
Maintain
Maintain AFF C250 hardware
For the AFF C250 storage system, you can perform maintenance procedures on the
following components.
12
Boot media
The boot media stores a primary and secondary set of boot image files that the system uses when it boots.
Chassis
The chassis is the physical enclosure housing all the controller components such as the controller/CPU unit,
power supply, and I/O.
Controller
A controller consists of a board, firmware, and software. It controls the drives and implements the ONTAP
functions.
DIMM
You must replace a DIMM (dual in-line memory module) when a memory mismatch is present, or you have a
failed DIMM.
Drive
A drive is a device that provides the physical storage media for data.
Fan
Mezzanine card
A Mezzanine card is a printed circuit board that plugs directly into another plug-in card.
NVMEM battery
A battery is included with the controller and preserves cached data if the AC power fails.
Power supply
A real time clock battery preserves system date and time information if the power is off.
Boot media
The boot media stores a primary and secondary set of system (boot image) files that the
system uses when it boots.
Before you begin
• You must have a USB flash drive, formatted to MBR/FAT32, with the appropriate amount of storage to hold
the image_xxx.tgz file.
13
• You also must copy the image_xxx.tgz file to the USB flash drive for later use in this procedure.
To ensure data security on your storage system, you need to verify the encryption key
support and status on your boot media. Check if your ONTAP version supports NetApp
Volume Encryption (NVE), and before you shut down the controller check if the key
manager is active.
Check whether your ONTAP version supports NetApp Volume Encryption (NVE). This information is crucial for
downloading the correct ONTAP image.
1. Determine if your ONTAP version supports encryption by running the following command:
version -v
If the output includes 1Ono-DARE, NVE is not supported on your cluster version.
2. Depending on whether NVE is supported on your system, take one of the following actions:
◦ If NVE is supported, download the ONTAP image with NetApp Volume Encryption.
◦ If NVE is not supported, download the ONTAP image without NetApp Volume Encryption.
To safely shut down a controller, first identify whether the External Key Manager (EKM) or the Onboard Key
Manager (OKM) is active. Then, verify the key manager in use, display the appropriate key information, and
take action based on the status of the authentication keys.
14
ONTAP version Run this command
ONTAP 9.14.1 or later security key-manager keystore show
2. Depending on whether a key manger is configured on your system, select one of the following options.
b. Check the output for the value in the Restored column for your key manager.
This column indicates whether the authentication keys for your key manager (either EKM or OKM)
have been successfully restored.
3. Depending on whether your system is using the External Key Manager or Onboard Key Manager, select
one of the following options.
15
External Key Manager
Depending on the output value displayed in the Restored column, follow the appropriate steps.
true You can safely shut down the impaired controller. Go to shutdown
the impaired controller.
Anything other than true a. Restore the external key management authentication keys to
all nodes in the cluster using the following command:
If all the authentication keys are true, you can safely shut
down the impaired controller. Go to shutdown the impaired
controller.
16
Output value in Restored Follow these steps…
column
Anything other than true a. Enter the onboard security key-manager sync command:
After completing the NVE or NSE tasks, you need to complete the shutdown of the
impaired controller.
Steps
a. Take the impaired controller to the LOADER prompt:
17
If the impaired controller Then…
displays…
Waiting for giveback… Press Ctrl-C, and then respond y when prompted.
System prompt or password Take over or halt the impaired controller from the healthy controller:
prompt (enter system password) storage failover takeover -ofnode
impaired_node_name
b. From the LOADER prompt, enter: printenv to capture all boot environmental variables. Save the output
to your log file.
This command may not work if the boot device is corrupted or non-functional.
After completing the NVE or NSE tasks, you need to complete the shutdown of the impaired controller.
To shut down the impaired controller, you must determine the status of the controller and, if necessary, take
over the controller so that the healthy controller continues to serve data from the impaired controller storage.
• If you have a cluster with more than two nodes, it must be in quorum. If the cluster is not in quorum or a
healthy controller shows false for eligibility and health, you must correct the issue before shutting down the
impaired controller; see Synchronize a node with the cluster.
• If you have a MetroCluster configuration, you must have confirmed that the MetroCluster Configuration
State is configured and that the nodes are in an enabled and normal state (metrocluster node show).
Steps
1. If AutoSupport is enabled, suppress automatic case creation by invoking an AutoSupport message:
system node autosupport invoke -node * -type all -message
MAINT=number_of_hours_downh
The following AutoSupport message suppresses automatic case creation for two hours: cluster1:*>
system node autosupport invoke -node * -type all -message MAINT=2h
2. Disable automatic giveback from the console of the healthy controller: storage failover modify
–node local -auto-giveback false
3. Take the impaired controller to the LOADER prompt:
18
If the impaired controller is Then…
displaying…
Waiting for giveback… Press Ctrl-C, and then respond y when prompted.
System prompt or password Take over or halt the impaired controller from the healthy controller:
prompt (enter system password) storage failover takeover -ofnode
impaired_node_name
To replace the boot media, you must remove the impaired controller module, install the
replacement boot media, and transfer the boot image to a USB flash drive.
To access components inside the controller module, you must first remove the controller module from the
system, and then remove the cover on the controller module.
Steps
1. If you are not already grounded, properly ground yourself.
2. Unplug the controller module power supplies from the source.
3. Release the power cable retainers, and then unplug the cables from the power supplies.
4. Insert your forefinger into the latching mechanism on either side of the controller module, press the lever
with your thumb, and gently pull the controller a few inches out of the chassis.
If you have difficulty removing the controller module, place your index fingers through the
finger holes from the inside (by crossing your arms).
19
Lever
Latching mechanism
5. Using both hands, grasp the controller module sides and gently pull it out of the chassis and set it on a flat,
stable surface.
6. Turn the thumbscrew on the front of the controller module anti-clockwise and open the controller module
cover.
Thumbscrew
20
Step 2: Replace the boot media
You locate the failed boot media in the controller module by removing the air duct on the controller module
before you can replace the boot media.
You need a #1 magnetic Phillips head screwdriver to remove the screw that holds the boot media in place. Due
to the space constraints within the controller module, you should also have a magnet to transfer the screw on
to so that you do not lose it.
You can use the following video or the tabulated steps to replace the boot media:
1. Locate and replace the impaired boot media from the controller module.
21
Remove the screw securing the boot media to the motherboard in the
controller module.
2. Using the #1 magnetic screwdriver, remove the screw from the impaired boot media, and set it aside safely
on the magnet.
3. Gently lift the impaired boot media directly out of the socket and set it aside.
4. Remove the replacement boot media from the antistatic shipping bag and align it into place on the
controller module.
5. Using the #1 magnetic screwdriver, insert and tighten the screw on the boot media.
Do not apply force when tightening the screw on the boot media; you might crack it.
The replacement boot media that you installed is without a boot image so you need to transfer a boot image
using a USB flash drive.
• You must have a USB flash drive, formatted to MBR/FAT32, with at least 4GB capacity
• A copy of the same image version of ONTAP as what the impaired controller was running. You can
download the appropriate image from the Downloads section on the NetApp Support Site
◦ If NVE is enabled, download the image with NetApp Volume Encryption, as indicated in the download
22
button.
◦ If NVE is not enabled, download the image without NetApp Volume Encryption, as indicated in the
download button.
• If your system is an HA pair, you must have a network connection.
• If your system is a stand-alone system you do not need a network connection, but you must perform an
additional reboot when restoring the var file system.
1. Download and copy the appropriate service image from the NetApp Support Site to the USB flash
drive.
2. Download the service image to your work space on your laptop.
3. Unzip the service image.
If you are extracting the contents using Windows, do not use winzip to extract the
netboot image. Use another extraction tool, such as 7-Zip or WinRAR.
▪ boot
▪ efi
4. Copy the efi folder to the top directory on the USB flash drive.
If the service image has no efi folder, see EFI folder missing from Service Image download
file used for boot device recovery for FAS and AFF models^ .
+ The USB flash drive should have the efi folder and the same Service Image (BIOS) version of what the
impaired controller is running.
23
3. Close the controller module cover and tighten the thumbscrew.
24
Controller module cover
Thumbscrew
4. Align the end of the controller module with the opening in the chassis, and then gently push the
controller module halfway into the system.
5. Insert the USB flash drive into the USB slot on the controller module.
Make sure that you install the USB flash drive in the slot labeled for USB devices, and not in the USB
console port.
6. Push the controller module all the way into the chassis:
7. Place your index fingers through the finger holes from the inside of the latching mechanism.
8. Press your thumbs down on the orange tabs on top of the latching mechanism and gently push the
controller module over the stop.
9. Release your thumbs from the top of the latching mechanisms and continue pushing until the latching
mechanisms snap into place.
The controller module should be fully inserted and flush with the edges of the chassis.
10. Plug the power cords into the power supplies, reinstall the power cable locking collar, and then connect
the power supplies to the power source.
The controller module begins to boot as soon as power is restored. Be prepared to interrupt the boot
process.
11. Interrupt the boot process to stop at the LOADER prompt by pressing Ctrl-C when you see Starting
AUTOBOOT press Ctrl-C to abort….
If you miss this message, press Ctrl-C, select the option to boot to Maintenance mode, and then halt
the controller to boot to LOADER.
12. For systems with one controller in the chassis, reconnect the power and turn on the power supplies.
After installing the new boot media device in your system, you can boot the recovery
image from a USB drive and restore the configuration from the partner node.
Steps
1. From the LOADER prompt, boot the recovery image from the USB flash drive: boot_recovery
2. When prompted, either enter the name of the image or accept the default image displayed inside the
brackets on your screen.
25
3. Restore the var file system:
NOTE: If you see any message other than a successful restore, contact NetApp Support.
e. On the healthy partner controller, return the impaired controller to admin level: set -privilege
admin.
f. On the impaired controller, press Y when you see Was the restore backup procedure
successful?.
g. On the impaired controller, press Y when you see …would you like to use this
restored copy now?.
h. On the impaired controller, press Y when prompted to reboot the impaired controller and press
ctrl-c for the Boot Menu.
i. If the system does not use encryption, select Option 1 Normal Boot., otherwise go to Restore
encryption.
After restore procedure is successful, this message will be seen on the console -
syncflash_partner: Restore from partner complete.
b. On the impaired controller, press Y when prompted to confirm if the restore backup was
successful.
c. On the impaired controller, press Y when prompted to use the restored configuration.
d. On the impaired controller, press Y when prompted to reboot the node.
e. On the impaired controller, press Y when prompted to reboot the impaired controller and press
ctrl-c for the Boot Menu.
f. If the system does not use encryption, select Option 1 Normal Boot., otherwise go to Restore
encryption.
26
7. If AutoSupport is enabled, restore/unsuppress automatic case creation by using the system node
autosupport invoke -node * -type all -message MAINT=END command.
Depending on which a key manger is configured on your system, select one of the following options to restore
it from the boot menu.
Restore the Onboard Key Manager (OKM) configuration from the ONTAP boot menu.
Steps
1. Connect the console cable to the target controller.
2. From the ONTAP boot menu select the appropriate option from the boot menu.
27
ONTAP version Select this option
ONTAP 9.8 or later Select option 10.
28
ONTAP version Select this option
ONTAP 9.7 and earlier Select the hidden option recover_onboard_keymanager
This option must be used only in disaster recovery procedures. Are you
sure? (y or n):
While entering the passphrase the console will not show any input.
a. Paste the entire content from the BEGIN BACKUP line through the END BACKUP line.
29
Show example prompt
--------------------------BEGIN BACKUP--------------------------
0123456789012345678901234567890123456789012345678901234567890123
1234567890123456789012345678901234567890123456789012345678901234
2345678901234567890123456789012345678901234567890123456789012345
3456789012345678901234567890123456789012345678901234567890123456
4567890123456789012345678901234567890123456789012345678901234567
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
0123456789012345678901234567890123456789012345678901234567890123
1234567890123456789012345678901234567890123456789012345678901234
2345678901234567890123456789012345678901234567890123456789012345
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
---------------------------END BACKUP---------------------------
30
Show example prompt
*****************************************************************
******************
* Select option "(1) Normal Boot." to complete recovery process.
*
* Run the "security key-manager onboard sync" command to
synchronize the key database after the node reboots.
*****************************************************************
******************
Do not proceed if the displayed output is anything other than Successfully recovered
keymanager secrets. Perform troubleshooting to correct the error.
6. Select option 1 from the boot menu to continue booting into ONTAP.
31
Show example prompt
********************************************************************
***************
* Select option "(1) Normal Boot." to complete the recovery process.
*
********************************************************************
***************
8. From the partner node, giveback the partner controller by entering the following command.
9. After booting with only the CFO aggregate, run the following command.
10. Enter the cluster-wide passphrase for the Onboard Key Manager.
32
Show example prompt
If the sync is successful the cluster prompt is returned with no additional messages. If the
sync fails an error message appears before returning to the cluster prompt. Do not continue
until the the error is corrected and the sync runs successfully.
11. Ensure that all keys are synced by entering the following command.
No results should appear when filtering for false in the restored parameter.
12. Giveback the node from the partner by entering the following command.
13. Restore automatic giveback, if you disabled it, by entering the following command.
14. If AutoSupport is enabled, restore automatic case creation by entering the following command.
Restore the External Key Manager configuration from the ONTAP boot menu.
• A copy of the /cfcard/kmip/servers.cfg file from another cluster node or the following information:
◦ The KMIP server address.
◦ The KMIP port.
• A copy of the /cfcard/kmip/certs/client.crt file from another cluster node or the client certificate.
33
• A copy of the /cfcard/kmip/certs/client.key file from another cluster node or the client key.
• A copy of the /cfcard/kmip/certs/CA.pem file from another cluster node or the KMIP server CA(s).
Steps
1. Connect the console cable to the target controller.
2. Select option 11 from the ONTAP boot menu.
Show prompt
34
Show example
After you enter the client and server information, the recovery process completes.
Show example
5. Select option 1 from the boot menu to continue booting into ONTAP.
35
Show example prompt
********************************************************************
*******
* Select option "(1) Normal Boot." to complete the recovery process.
*
********************************************************************
*******
7. If AutoSupport is enabled, restore automatic case creation by entering the following command.
Return the failed part to NetApp, as described in the RMA instructions shipped with the
kit. See the Part Return and Replacements page for further information.
Chassis
To replace the chassis, you must move the bezel, controller modules, and NVMe drives
from the impaired chassis to the replacement chassis, and then remove the impaired
chassis from the equipment rack or system cabinet and install the replacement chassis in
its place.
About this task
36
• All other components in the system must be functioning properly; if not, you must contact technical support.
• You can use this procedure with all versions of ONTAP supported by your system.
• This procedure is written with the assumption that you are moving the bezel, NVMe drives, and controller
modules to the new chassis, and that the replacement chassis is a new component from NetApp.
• This procedure is disruptive. For a two-node cluster, you will have a complete service outage and a partial
outage in a multi-node cluster.
This procedure is for systems with two node configurations. For more information about
graceful shutdown when servicing a cluster, see Gracefully shutdown and power up your
storage system Resolution Guide - NetApp Knowledge Base.
Before you begin
• Make sure you have the necessary permissions and credentials:
◦ Local administrator credentials for ONTAP.
◦ BMC accessability for each controller.
• Make sure you have the necessary tools and equipment for the replacement.
• As a best practice before shutdown, you should:
◦ Perform additional system health checks.
◦ Upgrade ONTAP to a recommended release for the system.
◦ Resolve any Active IQ Wellness Alerts and Risks. Make note of any faults presently on the system,
such as LEDs on the system components.
Steps
1. Log into the cluster through SSH or log in from any node in the cluster using a local console cable and a
laptop/console.
2. Stop all clients/host from accessing data on the NetApp system.
3. Suspend external backup jobs.
4. If AutoSupport is enabled, suppress case creation and indicate how long you expect the system to be
offline:
system node autosupport invoke -node * -type all -message "MAINT=2h Replace
chassis"
exit
7. Log into SP/BMC over SSH using the IP address of any of the nodes listed in the output from the previous
step to monitor progress.
If you are using a console/laptop, log into the controller using the same cluster administrator credentials.
37
8. Halt the two nodes located in the impaired chassis:
For clusters using SnapMirror synchronous operating in StrictSync mode: system node
halt -node <node1>,<node2> -skip-lif-migration-before-shutdown true
-ignore-quorum-warnings true -inhibit-takeover true -ignore-strict
-sync-warnings true
Warning: Are you sure you want to halt node <node_name>? {y|n}:
10. Wait for each controller to halt and display the LOADER prompt.
To replace the chassis, you move the power supplies, hard drives, and controller module
from the impaired chassis to the new chassis, and swap out the impaired chassis from
with the new chassis of the same model as the impaired chassis.
To replace the chassis, you must remove the controller modules from the old chassis.
Use the following video or the tabulated steps to replace the chassis; it assumes the removal and replacement
of the bezel:
If you have difficulty removing the controller module, place your index fingers through the
finger holes from the inside (by crossing your arms).
38
Lever
Latching mechanism
5. Using both hands, grasp the controller module sides and gently pull it out of the chassis and set it on a flat,
stable surface.
6. Set the controller module aside in a safe place, and repeat these steps for the other controller module in
the chassis.
You need to move the drives from each bay opening in the old chassis to the same bay opening in the new
chassis.
The drive should disengage from the chassis, allowing it to slide free of the chassis.
When removing a drive, always use two hands to support its weight.
Drives are fragile. Handle them as little as possible to prevent damage to them.
3. Align the drive from the old chassis with the same bay opening in the new chassis.
4. Gently push the drive into the chassis as far as it will go.
5. Firmly push the drive the rest of the way into the chassis, and then lock the cam handle by pushing it up
and against the drive holder.
39
Be sure to close the cam handle slowly so that it aligns correctly with the front of the drive carrier. It clicks
when it is secure.
Step 3: Replace a chassis from within the equipment rack or system cabinet
You must remove the existing chassis from the equipment rack or system cabinet before you can install the
replacement chassis.
After you install the controller modules into the new chassis, you need to boot the system.
For HA pairs with two controller modules in the same chassis, the sequence in which you install the controller
module is especially important because it attempts to reboot as soon as you completely seat it in the chassis.
1. Align the end of the controller module with the opening in the chassis, and then gently push the controller
module halfway into the system.
Do not completely insert the controller module in the chassis until instructed to do so.
2. Recable the console to the controller module, and then reconnect the management port.
3. Insert the controller module into the chassis:
a. Ensure the latching mechanism arms are locked in the fully extended position.
b. Using both hands, align and gently slide the controller module into the latching mechanism arms until it
stops.
c. Place your index fingers through the finger holes from the inside of the latching mechanism.
d. Press your thumbs down on the orange tabs on top of the latching mechanism and gently push the
controller module over the stop.
e. Release your thumbs from the top of the latching mechanisms and continue pushing until the latching
mechanisms snap into place.
The controller module should be fully inserted and flush with the edges of the chassis.
a. Plug the power cords into the power supplies, reinstall the power cable locking collar, and then connect
the power supplies to the power source.
40
The controller module begins to boot as soon as power is restored. Be prepared to interrupt the boot
process.
4. Repeat the preceding steps to install the second controller into the new chassis.
You must verify the HA state of the chassis, and return the failed part to NetApp, as
described in the RMA instructions shipped with the kit.
You must verify the HA state of the chassis, and, if necessary, update the state to match your system
configuration.
1. In Maintenance mode, from either controller module, display the HA state of the local controller module and
chassis: ha-config show
2. If the displayed system state for the chassis does not match your system configuration:
a. Set the HA state for the chassis: ha-config modify chassis HA-state
▪ ha
▪ mcc
▪ mccip
▪ non-ha
b. Confirm that the setting has changed: ha-config show
3. If you have not already done so, recable the rest of your system.
4. Reinstall the bezel on the front of the system.
Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the Part Return
and Replacements page for further information.
Controller
You must review the prerequisites for the replacement procedure and select the correct
one for your version of the ONTAP operating system.
• All drive shelves must be working properly.
• If your system is in a MetroCluster configuration, you must review the section Choosing the correct
recovery procedure to determine whether you should use this procedure.
41
• You must replace the failed component with a replacement FRU component you received from your
provider.
• You must be replacing a controller module with a controller module of the same model type. You cannot
upgrade your system by just replacing the controller module.
• You cannot change any drives or drive shelves as part of this procedure.
• In this procedure, the boot device is moved from the impaired controller to the replacement controller so
that the replacement controller will boot up in the same version of ONTAP as the old controller module.
• It is important that you apply the commands in these steps on the correct systems:
◦ The impaired controller is the controller that is being replaced.
◦ The replacement controller is the new controller that is replacing the impaired controller.
◦ The healthy controller is the surviving controller.
• You must always capture the controller’s console output to a text file.
This provides you a record of the procedure so that you can troubleshoot any issues that you might
encounter during the replacement process.
To shut down the impaired controller, you must determine the status of the controller and,
if necessary, take over the controller so that the healthy controller continues to serve data
from the impaired controller storage.
About this task
• If you have a SAN system, you must have checked event messages (cluster kernel-service show)
for the impaired controller SCSI blade. The cluster kernel-service show command (from priv
advanced mode) displays the node name, quorum status of that node, availability status of that node, and
operational status of that node.
Each SCSI-blade process should be in quorum with the other nodes in the cluster. Any issues must be
resolved before you proceed with the replacement.
• If you have a cluster with more than two nodes, it must be in quorum. If the cluster is not in quorum or a
healthy controller shows false for eligibility and health, you must correct the issue before shutting down the
impaired controller; see Synchronize a node with the cluster.
Steps
1. If AutoSupport is enabled, suppress automatic case creation by invoking an AutoSupport message:
system node autosupport invoke -node * -type all -message MAINT=<# of hours>h
The following AutoSupport message suppresses automatic case creation for two hours: cluster1:>
system node autosupport invoke -node * -type all -message MAINT=2h
2. Disable automatic giveback from the console of the healthy controller: storage failover modify
-node local -auto-giveback false
42
If the impaired controller is Then…
displaying…
The LOADER prompt Go to the next step.
Waiting for giveback… Press Ctrl-C, and then respond y when prompted.
System prompt or password Take over or halt the impaired controller from the healthy controller:
prompt storage failover takeover -ofnode
impaired_node_name
To replace the controller module hardware, you must remove the impaired controller,
move FRU components to the replacement controller module, install the replacement
controller module in the chassis, and then boot the system to Maintenance mode.
You must remove the controller module from the chassis when you replace a component inside the controller
module.
Make sure that you label the cables so that you know where they came from.
Use the following video or the tabulated steps to replace a controller module:
If you have difficulty removing the controller module, place your index fingers through the
finger holes from the inside (by crossing your arms).
43
Lever
Latching mechanism
5. Using both hands, grasp the controller module sides and gently pull it out of the chassis and set it on a flat,
stable surface.
6. Turn the thumbscrew on the front of the controller module anti-clockwise and open the controller module
cover.
44
Thumbscrew
You must move the power supply from the impaired controller module to the replacement controller module
when you replace a controller module.
Do not use excessive force when sliding the controller module into the chassis to avoid
damaging the connectors.
The power supply is short. Always use two hands to support it when removing it from the controller module
so that it does not suddenly swing free from the controller module and injure you.
45
Blue power supply locking tab
Power supply
5. Move the power supply to the new controller module, and then install it.
6. Using both hands, support and align the edges of the power supply with the opening in the controller
module, and then gently push the power supply into the controller module until the locking tab clicks into
place.
The power supplies will only properly engage with the internal connector and lock in place one way.
To avoid damaging the internal connector, do not use excessive force when sliding the
power supply into the system.
You must move the fans from the impaired controller module to the replacement module when replacing a
failed controller module.
1. Remove the fan module by pinching the side of the fan module, and then lifting the fan module straight out
of the controller module.
46
Fan module
2. Move the fan module to the replacement controller module, and align the edges of the fan module with the
opening in the controller module, and then slide the fan module in.
3. Repeat these steps for the remaining fan modules.
You must move the boot media device from the impaired controller module to the replacement controller
module.
You need a #1 magnetic Phillips head screwdriver to remove the screw that holds the boot media in place. Due
to the space constraints within the controller module, you should also have a magnet to transfer the screw on
to so that you do not lose it.
1. Locate and move the boot media from the impaired controller module to the replacement controller module.
The boot media is located under the air duct cover you removed earlier in this procedure.
47
Remove the screw securing the boot media to the motherboard in the impaired
controller module.
2. Using the #1 magnetic screwdriver, remove the screw from the boot media, and set it aside safely on the
magnet.
3. Gently lift the boot media directly out of the socket and align it into place in the replacement controller
module.
4. Using the #1 magnetic screwdriver, insert and tighten the screw on the boot media.
Do not apply force when tightening the screw on the boot media; you might crack it.
To move the DIMMs, locate and move them from the impaired controller into the replacement controller and
follow the specific sequence of steps.
48
Install each DIMM into the same slot it occupied in the impaired controller module.
1. Slowly push apart the DIMM ejector tabs on either side of the DIMM, and slide the DIMM out of the slot.
Hold the DIMM by the edges to avoid pressure on the components on the DIMM circuit
board.
The DIMMs fit tightly in the socket. If not, reinsert the DIMM to realign it with the socket.
4. Visually inspect the DIMM to verify that it is evenly aligned and fully inserted into the socket.
5. Repeat these steps for the remaining DIMM.
To move a mezzanine card, you must remove the cabling and any QSFPs and SFPs from the ports, move the
mezzanine card to the replacement controller, reinstall any QSFPs and SFPs onto the ports, and cable the
ports.
1. Locate and move the mezzanine cards from your impaired controller module.
49
Remove screws on the face of the controller module.
Make sure that you label the cables so that you know where they came from.
a. Remove any SFP or QSFP modules that might be in the mezzanine card and set it aside.
b. Using the #1 magnetic screwdriver, remove the screws from the face of the impaired controller module
and from the mezzanine card, and set them aside safely on the magnet.
c. Gently lift the mezzanine card out of the socket and move it to the same position in the replacement
controller.
d. Gently align the mezzanine card into place in the replacement controller.
e. Using the #1 magnetic screwdriver, insert and tighten the screws on the face of the replacement
controller module and on the mezzanine card.
Do not apply force when tightening the screw on the mezzanine card; you might crack it.
3. Repeat these steps if there is another mezzanine card in the impaired controller module.
50
4. Insert the SFP or QSFP modules that were removed onto the mezzanine card.
When replacing the controller module, you must move the NV battery from the impaired controller module to
the replacement controller module.
1. Locate and move the NVMEM battery from your impaired controller module to the replacement controller
module.
Grasp the battery and press the blue locking tab marked PUSH.
2. Locate the battery plug and squeeze the clip on the face of the battery plug to release the plug from the
socket.
3. Grasp the battery and press the blue locking tab marked PUSH, and then lift the battery out of the holder
and controller module.
4. Locate the corresponding NV battery holder on the replacement controller module and align the NV battery
51
to the battery holder.
5. Insert the NV battery plug into the socket.
6. Slide the battery pack down along the sheet metal side wall until the support tabs on the side wall hook into
the slots on the battery pack, and the battery pack latch engages and clicks into the opening on the side
wall.
7. Press firmly down on the battery pack to make sure that it is locked into place.
After all of the components have been moved from the impaired controller module to the replacement controller
module, you must install the replacement controller module into the chassis, and then boot it to Maintenance
mode.
You can use the following illustrations or the written steps to install the replacement controller module in the
chassis.
1. If you have not already done so, install the air duct.
52
Controller module cover
Thumbscrew
3. Align the end of the controller module with the opening in the chassis, and then gently push the controller
module halfway into the system.
Do not completely insert the controller module in the chassis until instructed to do so.
4. Cable the management and console ports only, so that you can access the system to perform the tasks in
the following sections.
You will connect the rest of the cables to the controller module later in this procedure.
53
mechanisms snap into place.
The controller module should be fully inserted and flush with the edges of the chassis.
f. Plug the power cords into the power supplies, reinstall the power cable locking collar, and then connect
the power supplies to the power source.
The controller module begins to boot as soon as power is restored. Be prepared to interrupt the boot
process.
After completing the hardware replacement and booting to Maintenance mode, you verify
the low-level system configuration of the replacement controller and reconfigure system
settings as necessary.
Step 1: Set and verify system time after replacing the controller
You should check the time and date on the replacement controller module against the healthy controller
module in an HA pair, or against a reliable time server in a stand-alone configuration. If the time and date do
not match, you must reset them on the replacement controller module to prevent possible outages on clients
due to time differences.
• The replacement node is the new node that replaced the impaired node as part of this procedure.
• The healthy node is the HA partner of the replacement node.
Steps
1. If the replacement node is not at the LOADER prompt, halt the system to the LOADER prompt.
2. On the healthy node, check the system time: cluster date show
3. At the LOADER prompt, check the date and time on the replacement node: show date
4. If necessary, set the date in GMT on the replacement node: set date mm/dd/yyyy
5. If necessary, set the time in GMT on the replacement node: set time hh:mm:ss
6. At the LOADER prompt, confirm the date and time on the replacement node: show date
You must verify the HA state of the controller module and, if necessary, update the state to match your system
configuration.
54
1. In Maintenance mode from the new controller module, verify that all components display the same HA
state: ha-config show
2. If the displayed system state of the controller module does not match your system configuration, set the HA
state for the controller module: ha-config modify controller ha-state
◦ ha
◦ mcc
◦ mccip
◦ non-ha
3. If the displayed system state of the controller module does not match your system configuration, set the HA
state for the controller module: ha-config modify controller ha-state
4. Confirm that the setting has changed: ha-config show
Continue the replacement procedure by recabling the storage and confirming disk
reassignment.
Steps
1. Verify that the cabling is correct by using Active IQ Config Advisor.
a. Download and install Config Advisor.
b. Enter the information for the target system, and then click Collect Data.
c. Click the Cabling tab, and then examine the output. Make sure that all disk shelves are displayed and
all disks appear in the output, correcting any cabling issues you find.
d. Check other cabling by clicking the appropriate tab, and then examining the output from Config Advisor.
If the storage system is in an HA pair, the system ID of the new controller module is automatically assigned to
the disks when the giveback occurs at the end of the procedure. You must confirm the system ID change when
you boot the replacement controller and then verify that the change was implemented.
1. If the replacement controller is in Maintenance mode (showing the *> prompt, exit Maintenance mode and
go to the LOADER prompt: halt
2. From the LOADER prompt on the replacement controller, boot the controller, entering y if you are prompted
to override the system ID due to a system ID mismatch:
3. Wait until the Waiting for giveback… message is displayed on the replacement controller console and
55
then, from the healthy controller, verify that the new partner system ID has been automatically assigned:
storage failover show
In the command output, you should see a message that the system ID has changed on the impaired
controller, showing the correct old and new IDs. In the following example, node2 has undergone
replacement and has a new system ID of 151759706.
4. From the healthy controller, verify that any coredumps are saved:
a. Change to the advanced privilege level: set -privilege advanced
You can respond Y when prompted to continue into advanced mode. The advanced mode prompt
appears (*>).
b. Save any coredumps: system node run -node local-node-name partner savecore
c. Wait for the `savecore`command to complete before issuing the giveback.
You can enter the following command to monitor the progress of the savecore command: system
node run -node local-node-name partner savecore -s
The replacement controller takes back its storage and completes booting.
If you are prompted to override the system ID due to a system ID mismatch, you should enter y.
56
Find the High-Availability Configuration content for your version of ONTAP 9
b. After the giveback has been completed, confirm that the HA pair is healthy and that takeover is
possible: storage failover show
The output from the storage failover show command should not include the System ID changed
on partner message.
7. Verify that the disks were assigned correctly: storage disk show -ownership
The disks belonging to the replacement controller should show the new system ID. In the following
example, the disks owned by node1 now show the new system ID, 1873775277:
8. If the system is in a MetroCluster configuration, monitor the status of the controller: metrocluster node
show
The MetroCluster configuration takes a few minutes after the replacement to return to a normal state, at
which time each controller will show a configured state, with DR Mirroring enabled and a mode of normal.
The metrocluster node show -fields node-systemid command output displays the old system
ID until the MetroCluster configuration returns to a normal state.
9. If the controller is in a MetroCluster configuration, depending on the MetroCluster state, verify that the DR
home ID field shows the original owner of the disk if the original owner is a controller on the disaster site.
Disk ownership changes during HA takeover and MetroCluster switchover in a four-node MetroCluster
configuration
10. If your system is in a MetroCluster configuration, verify that each controller is configured: metrocluster
node show - fields configuration-state
57
node1_siteA::> metrocluster node show -fields configuration-state
11. Verify that the expected volumes are present for each controller: vol show -node node-name
12. If you disabled automatic takeover on reboot, enable it from the healthy controller: storage failover
modify -node replacement-node-name -onreboot true
To restore your system to full operation, you must restore the NetApp Storage Encryption
configuration (if necessary), and install licenses for the new controller, and return the
failed part to NetApp, as described in the RMA instructions shipped with the kit.
You must install new licenses for the replacement node if the impaired node was using ONTAP features that
require a standard (node-locked) license. For features with standard licenses, each node in the cluster should
have its own key for the feature.
Also, using unlicensed features on the node might put you out of compliance with your license agreement, so
you should install the replacement license key or keys on the replacement node as soon as possible.
You have a 90-day grace period in which to install the license keys. After the grace period, all old licenses are
invalidated. After a valid license key is installed, you have 24 hours to install all of the keys before the grace
period ends.
If your system was initially running ONTAP 9.10.1 or later, use the procedure documented in
Post Motherboard Replacement Process to update Licensing on a AFF/FAS system. If you are
unsure of the initial ONTAP release for your system, see NetApp Hardware Universe for more
information.
Steps
58
1. If you need new license keys, obtain replacement license keys on the NetApp Support Site in the My
Support section under Software licenses.
The new license keys that you require are automatically generated and sent to the email
address on file. If you fail to receive the email with the license keys within 30 days, you
should contact technical support.
2. Install each license key: system license add -license-code license-key, license-key...
3. Remove the old licenses, if desired:
a. Check for unused licenses: license clean-up -unused -simulate
b. If the list looks correct, remove the unused licenses: license clean-up -unused
Before returning the replacement node to service, you should verify that the LIFs are on their home ports, and
register the serial number of the replacement node if AutoSupport is enabled, and reset automatic giveback.
Steps
1. Verify that the logical interfaces are reporting to their home server and ports: network interface show
-is-home false
If any LIFs are listed as false, revert them to their home ports: network interface revert -vserver
* -lif *
Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the Part Return
and Replacements page for further information.
59
You must replace the failed component with a replacement FRU component you received from your provider.
To shut down the impaired controller, you must determine the status of the controller and, if necessary, take
over the controller so that the healthy controller continues to serve data from the impaired controller storage.
Each SCSI-blade process should be in quorum with the other nodes in the cluster. Any issues must be
resolved before you proceed with the replacement.
• If you have a cluster with more than two nodes, it must be in quorum. If the cluster is not in quorum or a
healthy controller shows false for eligibility and health, you must correct the issue before shutting down the
impaired controller; see Synchronize a node with the cluster.
Steps
1. If AutoSupport is enabled, suppress automatic case creation by invoking an AutoSupport message:
system node autosupport invoke -node * -type all -message MAINT=<# of hours>h
The following AutoSupport message suppresses automatic case creation for two hours: cluster1:>
system node autosupport invoke -node * -type all -message MAINT=2h
2. Disable automatic giveback from the console of the healthy controller: storage failover modify
-node local -auto-giveback false
Waiting for giveback… Press Ctrl-C, and then respond y when prompted.
System prompt or password Take over or halt the impaired controller from the healthy controller:
prompt storage failover takeover -ofnode
impaired_node_name
60
Step 2: Remove the controller module
You must remove the controller module from the chassis when you replace a component inside the controller
module.
Make sure that you label the cables so that you know where they came from.
If you have difficulty removing the controller module, place your index fingers through the
finger holes from the inside (by crossing your arms).
Lever
Latching mechanism
5. Using both hands, grasp the controller module sides and gently pull it out of the chassis and set it on a flat,
stable surface.
6. Turn the thumbscrew on the front of the controller module anti-clockwise and open the controller module
cover.
61
Thumbscrew
62
Step 3: Replace a DIMM
To replace a DIMM, you must locate it in the controller module using the DIMM map label on top of the air duct
and then replace it following the specific sequence of steps.
The DIMMs are in slot 3 or 1 on the motherboard. Slot 2 and 4 are left empty. Do not attempt to install
DIMMs into these slots.
63
2. Note the orientation of the DIMM in the socket so that you can insert the replacement DIMM in the proper
orientation.
3. Slowly push apart the DIMM ejector tabs on either side of the DIMM, and slide the DIMM out of the slot.
4. Leave DIMM ejector tabs on the connector in the open position.
5. Remove the replacement DIMM from the antistatic shipping bag, hold the DIMM by the corners, and align it
to the slot.
Hold the DIMM by the edges to avoid pressure on the components on the DIMM circuit
board.
The DIMMs fit tightly in the socket. If not, reinsert the DIMM to realign it with the socket.
7. Visually inspect the DIMM to verify that it is evenly aligned and fully inserted into the socket.
After you have replaced the component in the controller module, you must reinstall the controller module into
the chassis, and then boot it.
You can use the following illustrations or the written steps to install the replacement controller module in the
chassis.
1. If you have not already done so, install the air duct.
64
2. Close the controller module cover and tighten the thumbscrew.
65
Controller module cover
Thumbscrew
The controller module should be fully inserted and flush with the edges of the chassis.
4. Plug the power cords into the power supplies, reinstall the power cable locking collar, and then connect the
power supplies to the power source.
The controller module begins to boot as soon as power is restored. Be prepared to interrupt the boot
process.
Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the Part Return
and Replacements page for further information.
66
The failed drive appears in the list of failed drives. If it does not, you should wait, and then run the
command again.
It can take up to several hours for the drive to appear in the list of failed drives.
How you replace the disk depends on how the disk drive is being used. If SED authentication is enabled,
you must use the SED replacement instructions in the ONTAP 9 NetApp Encryption Power Guide. These
Instructions describe additional steps you must perform before and after replacing an SED.
• Make sure the replacement drive is supported by your platform. See the NetApp Hardware Universe.
• Make sure all other components in the system are functioning properly; if not, you must contact technical
support.
When replacing several disk drives, you must wait 70 seconds between the removal of each failed disk drive
and the insertion of the replacement disk drive to allow the storage system to recognize the existence of each
new disk.
Steps
1. If you want to manually assign drive ownership for the replacement drive, you need to disable automatic
drive assignment replacement drive, if it is enabled
You manually assign drive ownership and then reenable automatic drive assignment later in
this procedure.
a. Verify whether automatic drive assignment is enabled: storage disk option show
If automatic drive assignment is enabled, the output shows on in the “Auto Assign” column (for each
controller module).
b. If automatic drive assignment is enabled, disable it: storage disk option modify -node
node_name -autoassign off
When a drive fails, the system logs a warning message to the system console indicating which drive failed.
Additionally, the attention (amber) LED on the drive shelf operator display panel and the failed drive
illuminate.
The activity (green) LED on a failed drive can be illuminated (solid), which indicates that the
drive has power, but should not be blinking, which indicates I/O activity. A failed drive has no
I/O activity.
67
4. Remove the failed drive:
a. Press the release button on the drive face to open the cam handle.
b. Slide the drive out of the shelf using the cam handle and supporting the drive with your other hand.
5. Wait a minimum of 70 seconds before inserting the replacement drive.
Be sure to close the cam handle slowly so that it aligns correctly with the face of the drive.
When the drive’s activity LED is solid, it means that the drive has power. When the drive’s activity LED is
blinking, it means that the drive has power and I/O is in progress. If the drive firmware is automatically
updating, the LED blinks.
b. Assign each drive: storage disk assign -disk disk_name -owner node_name
You can use the wildcard character to assign more than one drive at once.
c. Reenable automatic drive assignment if needed: storage disk option modify -node
node_name -autoassign on
10. Return the failed part to NetApp, as described in the RMA instructions shipped with the kit.
Contact technical support at NetApp Support, 888-463-8277 (North America), 00-800-44-638277 (Europe),
or +800-800-80-800 (Asia/Pacific) if you need the RMA number or additional help with the replacement
procedure.
68
Step 1: Shut down the impaired controller
To shut down the impaired controller, you must determine the status of the controller and, if necessary, take
over the controller so that the healthy controller continues to serve data from the impaired controller storage.
Each SCSI-blade process should be in quorum with the other nodes in the cluster. Any issues must be
resolved before you proceed with the replacement.
• If you have a cluster with more than two nodes, it must be in quorum. If the cluster is not in quorum or a
healthy controller shows false for eligibility and health, you must correct the issue before shutting down the
impaired controller; see Synchronize a node with the cluster.
Steps
1. If AutoSupport is enabled, suppress automatic case creation by invoking an AutoSupport message:
system node autosupport invoke -node * -type all -message MAINT=<# of hours>h
The following AutoSupport message suppresses automatic case creation for two hours: cluster1:>
system node autosupport invoke -node * -type all -message MAINT=2h
2. Disable automatic giveback from the console of the healthy controller: storage failover modify
-node local -auto-giveback false
Waiting for giveback… Press Ctrl-C, and then respond y when prompted.
System prompt or password Take over or halt the impaired controller from the healthy controller:
prompt storage failover takeover -ofnode
impaired_node_name
You must remove the controller module from the chassis when you replace a component inside the controller
module.
69
Make sure that you label the cables so that you know where they came from.
If you have difficulty removing the controller module, place your index fingers through the
finger holes from the inside (by crossing your arms).
Lever
Latching mechanism
5. Using both hands, grasp the controller module sides and gently pull it out of the chassis and set it on a flat,
stable surface.
6. Turn the thumbscrew on the front of the controller module anti-clockwise and open the controller module
cover.
70
Thumbscrew
To replace a fan, remove the failed fan module and replace it with a new fan module.
1. Identify the fan module that you must replace by checking the console error messages or by locating the lit
LED for the fan module on the motherboard.
2. Remove the fan module by pinching the side of the fan module, and then lifting the fan module straight out
of the controller module.
71
Fan module
3. Align the edges of the replacement fan module with the opening in the controller module, and then slide the
replacement fan module into the controller module.
After you replace a component within the controller module, you must reinstall the controller module in the
system chassis and boot it.
72
Controller module cover
Thumbscrew
The controller module should be fully inserted and flush with the edges of the chassis.
f. Plug the power cords into the power supplies, reinstall the power cable locking collar, and then connect
the power supplies to the power source.
The controller module begins to boot as soon as power is restored. Be prepared to interrupt the boot
process.
73
4. Return the controller to normal operation by giving back its storage: storage failover giveback
-ofnode impaired_node_name
5. If automatic giveback was disabled, reenable it: storage failover modify -node local -auto
-giveback true
Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the Part Return
and Replacements page for further information.
To shut down the impaired controller, you must determine the status of the controller and, if necessary, take
over the controller so that the healthy controller continues to serve data from the impaired controller storage.
Each SCSI-blade process should be in quorum with the other nodes in the cluster. Any issues must be
resolved before you proceed with the replacement.
• If you have a cluster with more than two nodes, it must be in quorum. If the cluster is not in quorum or a
healthy controller shows false for eligibility and health, you must correct the issue before shutting down the
impaired controller; see Synchronize a node with the cluster.
Steps
1. If AutoSupport is enabled, suppress automatic case creation by invoking an AutoSupport message:
system node autosupport invoke -node * -type all -message MAINT=<# of hours>h
The following AutoSupport message suppresses automatic case creation for two hours: cluster1:>
system node autosupport invoke -node * -type all -message MAINT=2h
2. Disable automatic giveback from the console of the healthy controller: storage failover modify
-node local -auto-giveback false
74
3. Take the impaired controller to the LOADER prompt:
Waiting for giveback… Press Ctrl-C, and then respond y when prompted.
System prompt or password Take over or halt the impaired controller from the healthy controller:
prompt storage failover takeover -ofnode
impaired_node_name
Remove the controller module from the chassis when you replace a component inside the controller module.
Make sure that you label the cables so that you know where they came from.
If you have difficulty removing the controller module, place your index fingers through the
finger holes from the inside (by crossing your arms).
Lever
75
Latching mechanism
5. Using both hands, grasp the controller module sides and gently pull it out of the chassis and set it on a flat,
stable surface.
6. Turn the thumbscrew on the front of the controller module anti-clockwise and open the controller module
cover.
Thumbscrew
To replace a mezzanine card, you must remove the impaired card and install the replacement card; to install a
mezzanine card, you must remove the faceplate and install the new card.
Use the following video or the tabulated steps to replace a mezzanine card:
76
Remove screws on the face of the controller
module.
Make sure that you label the cables so that you know where they came from.
b. Remove any SFP or QSFP modules that might be in the impaired mezzanine card and set it aside.
c. Using the #1 magnetic screwdriver, remove the screws from the face of the controller module and set
them aside safely on the magnet.
d. Using the #1 magnetic screwdriver, loosen the screw on the impaired mezzanine card.
e. Using the #1 magnetic screwdriver, gently lift the impaired mezzanine card directly out of the socket
and set it aside.
f. Remove the replacement mezzanine card from the antistatic shipping bag and align it to the inside face
of the controller module.
g. Gently align the replacement mezzanine card into place.
h. Using the #1 magnetic screwdriver, insert and tighten the screws on the face of the controller module
and on the mezzanine card.
77
Do not apply force when tightening the screw on the mezzanine card; you might crack it.
i. Insert any SFP or QSFP modules that were removed from the impaired mezzanine card to the
replacement mezzanine card.
3. To install a mezzanine card:
4. You install a new mezzanine card if your system does not have one.
a. Using the #1 magnetic screwdriver, remove the screws from the face of the controller module and the
faceplate covering the mezzanine card slot, and set them aside safely on the magnet.
b. Remove the mezzanine card from the antistatic shipping bag and align it to the inside face of the
controller module.
c. Gently align the mezzanine card into place.
d. Using the #1 magnetic screwdriver, insert and tighten the screws on the face of the controller module
and on the mezzanine card.
Do not apply force when tightening the screw on the mezzanine card; you might crack it.
After you replace a component within the controller module, you must reinstall the controller module in the
system chassis and boot it.
78
Controller module cover
Thumbscrew
The controller module should be fully inserted and flush with the edges of the chassis.
f. Plug the power cords into the power supplies, reinstall the power cable locking collar, and then connect
the power supplies to the power source.
The controller module begins to boot as soon as power is restored. Be prepared to interrupt the boot
process.
Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the Part Return
and Replacements page for further information.
To shut down the impaired controller, you must determine the status of the controller and, if necessary, take
over the controller so that the healthy controller continues to serve data from the impaired controller storage.
79
for the impaired controller SCSI blade. The cluster kernel-service show command (from priv
advanced mode) displays the node name, quorum status of that node, availability status of that node, and
operational status of that node.
Each SCSI-blade process should be in quorum with the other nodes in the cluster. Any issues must be
resolved before you proceed with the replacement.
• If you have a cluster with more than two nodes, it must be in quorum. If the cluster is not in quorum or a
healthy controller shows false for eligibility and health, you must correct the issue before shutting down the
impaired controller; see Synchronize a node with the cluster.
Steps
1. If AutoSupport is enabled, suppress automatic case creation by invoking an AutoSupport message:
system node autosupport invoke -node * -type all -message MAINT=<# of hours>h
The following AutoSupport message suppresses automatic case creation for two hours: cluster1:>
system node autosupport invoke -node * -type all -message MAINT=2h
2. Disable automatic giveback from the console of the healthy controller: storage failover modify
-node local -auto-giveback false
Waiting for giveback… Press Ctrl-C, and then respond y when prompted.
System prompt or password Take over or halt the impaired controller from the healthy controller:
prompt storage failover takeover -ofnode
impaired_node_name
You must remove the controller module from the chassis when you replace a component inside the controller
module.
Make sure that you label the cables so that you know where they came from.
80
4. Insert your forefinger into the latching mechanism on either side of the controller module, press the lever
with your thumb, and gently pull the controller a few inches out of the chassis.
If you have difficulty removing the controller module, place your index fingers through the
finger holes from the inside (by crossing your arms).
Lever
Latching mechanism
5. Using both hands, grasp the controller module sides and gently pull it out of the chassis and set it on a flat,
stable surface.
6. Turn the thumbscrew on the front of the controller module anti-clockwise and open the controller module
cover.
81
Thumbscrew
To replace the NVMEM battery, you must remove the failed battery from the controller module and install the
replacement battery into the controller module.
Use the following video or the tabulated steps to replace the NVMEM battery:
1. Locate and replace the impaired NVMEM battery on your controller module.
It is recommended that you follow the illustrated instructions in the order listed.
82
Squeeze the clip on the face of the battery plug.
2. Locate the battery plug and squeeze the clip on the face of the battery plug to release the plug from the
socket.
3. Grasp the battery and press the blue locking tab marked PUSH, and then lift the battery out of the holder
and controller module and set it aside.
4. Remove the replacement NV battery from the antistatic shipping bag and align it to the battery holder.
5. Insert the replacement NV battery plug into the socket.
6. Slide the battery pack down along the sheet metal side wall until the support tabs on the side wall hook into
the slots on the battery pack, and the battery pack latch engages and clicks into the opening on the side
wall.
7. Press firmly down on the battery pack to make sure that it is locked into place.
83
Step 4: Install the controller module
After you have replaced the component in the controller module, you must reinstall the controller module into
the chassis, and then boot it.
You can use the following illustration or the written steps to install the replacement controller module in the
chassis.
Thumbscrew
84
The controller module should be fully inserted and flush with the edges of the chassis.
f. Plug the power cords into the power supplies, reinstall the power cable locking collar, and then connect
the power supplies to the power source.
The controller module begins to boot as soon as power is restored. Be prepared to interrupt the boot
process.
Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the Part Return
and Replacements page for further information.
It is a best practice to replace the PSU within two minutes of removing it from the chassis.
The system continues to function, but ONTAP sends messages to the console about the
degraded PSU until the PSU is replaced.
Do not mix PSUs with different efficiency ratings. Always replace like for like.
85
Option 1: Replace an AC PSU
Use the following video or the tabulated steps to replace the PSU:
The PSU is short. Always use two hands to support it when removing it from the
controller module so that it does not suddenly swing free from the controller module
and injure you.
Power supply
86
b. Gently push the PSU into the controller module until the locking tab clicks into place.
The power supplies will only properly engage with the internal connector and lock in place one
way.
To avoid damaging the internal connector, do not use excessive force when sliding
the PSU into the system.
7. Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the
Part Return and Replacements page for further information.
The PSU is short. Always use two hands to support it when removing it from the
controller module so that it does not suddenly swing free from the controller module
and injure you.
Thumb screws
87
Power supply handle
The power supplies will only properly engage with the internal connector and lock in place one
way.
To avoid damaging the internal connector, do not use excessive force when sliding
the PSU into the system.
Once power is restored to the PSU, the status LED should be green.
7. Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the
Part Return and Replacements page for further information.
To shut down the impaired controller, you must determine the status of the controller and, if necessary, take
over the controller so that the healthy controller continues to serve data from the impaired controller storage.
Each SCSI-blade process should be in quorum with the other nodes in the cluster. Any issues must be
resolved before you proceed with the replacement.
88
• If you have a cluster with more than two nodes, it must be in quorum. If the cluster is not in quorum or a
healthy controller shows false for eligibility and health, you must correct the issue before shutting down the
impaired controller; see Synchronize a node with the cluster.
Steps
1. If AutoSupport is enabled, suppress automatic case creation by invoking an AutoSupport message:
system node autosupport invoke -node * -type all -message MAINT=<# of hours>h
The following AutoSupport message suppresses automatic case creation for two hours: cluster1:>
system node autosupport invoke -node * -type all -message MAINT=2h
2. Disable automatic giveback from the console of the healthy controller: storage failover modify
-node local -auto-giveback false
Waiting for giveback… Press Ctrl-C, and then respond y when prompted.
System prompt or password Take over or halt the impaired controller from the healthy controller:
prompt storage failover takeover -ofnode
impaired_node_name
You must remove the controller module from the chassis when you replace a component inside the controller
module.
Make sure that you label the cables so that you know where they came from.
If you have difficulty removing the controller module, place your index fingers through the
finger holes from the inside (by crossing your arms).
89
Lever
Latching mechanism
5. Using both hands, grasp the controller module sides and gently pull it out of the chassis and set it on a flat,
stable surface.
6. Turn the thumbscrew on the front of the controller module anti-clockwise and open the controller module
cover.
90
Thumbscrew
To replace the RTC battery, locate it inside the controller and follow the specific sequence of steps.
Use the following video or the tabulated steps to replace the RTC battery:
1. Locate the RTC battery between the heatsink and the midplane and remove it exactly as shown in the
graphic.
91
Gently pull tab away from the battery housing. Attention: Pulling it away
aggressively might displace the tab.
Lift the battery up. Note: Make a note of the polarity of the battery.
92
With positive polarity face up, slide the battery under
the tab of the battery housing.
4. Visually inspect the battery to make sure that it is completely installed into the holder and that the polarity is
correct.
Step 4: Reinstall the controller module and setting time/date after RTC battery replacement
After you replace a component within the controller module, you must reinstall the controller module in the
system chassis, reset the time and date on the controller, and then boot it.
1. If you have not already done so, close the air duct or controller module cover.
2. Align the end of the controller module with the opening in the chassis, and then gently push the controller
module halfway into the system.
Do not completely insert the controller module in the chassis until instructed to do so.
If you removed the media converters (QSFPs or SFPs), remember to reinstall them if you are using fiber
93
optic cables.
The controller module begins to boot as soon as power is restored. Be prepared to interrupt the boot
process.
Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the Part Return
and Replacements page for further information.
94
Copyright information
Copyright © 2025 NetApp, Inc. All Rights Reserved. Printed in the U.S. No part of this document covered by
copyright may be reproduced in any form or by any means—graphic, electronic, or mechanical, including
photocopying, recording, taping, or storage in an electronic retrieval system—without prior written permission
of the copyright owner.
Software derived from copyrighted NetApp material is subject to the following license and disclaimer:
THIS SOFTWARE IS PROVIDED BY NETAPP “AS IS” AND WITHOUT ANY EXPRESS OR IMPLIED
WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY
AND FITNESS FOR A PARTICULAR PURPOSE, WHICH ARE HEREBY DISCLAIMED. IN NO EVENT SHALL
NETAPP BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR
CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE
GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
NetApp reserves the right to change any products described herein at any time, and without notice. NetApp
assumes no responsibility or liability arising from the use of products described herein, except as expressly
agreed to in writing by NetApp. The use or purchase of this product does not convey a license under any
patent rights, trademark rights, or any other intellectual property rights of NetApp.
The product described in this manual may be protected by one or more U.S. patents, foreign patents, or
pending applications.
LIMITED RIGHTS LEGEND: Use, duplication, or disclosure by the government is subject to restrictions as set
forth in subparagraph (b)(3) of the Rights in Technical Data -Noncommercial Items at DFARS 252.227-7013
(FEB 2014) and FAR 52.227-19 (DEC 2007).
Data contained herein pertains to a commercial product and/or commercial service (as defined in FAR 2.101)
and is proprietary to NetApp, Inc. All NetApp technical data and computer software provided under this
Agreement is commercial in nature and developed solely at private expense. The U.S. Government has a non-
exclusive, non-transferrable, nonsublicensable, worldwide, limited irrevocable license to use the Data only in
connection with and in support of the U.S. Government contract under which the Data was delivered. Except
as provided herein, the Data may not be used, disclosed, reproduced, modified, performed, or displayed
without the prior written approval of NetApp, Inc. United States Government license rights for the Department
of Defense are limited to those rights identified in DFARS clause 252.227-7015(b) (FEB 2014).
Trademark information
NETAPP, the NETAPP logo, and the marks listed at http://www.netapp.com/TM are trademarks of NetApp, Inc.
Other company and product names may be trademarks of their respective owners.
95