You are on page 1of 27

Power Supply Error Diagnosis

Power Supply RMA Reduction Team, NetApp


January, 2011
1 POWER SUPPLY ERROR DIAGNOSIS .................................................... 4
1.1 ............................................................................................ NETAPP POWER SUPPLY MESSAGES ..... 4

1.2 ..............................................................................................................................BEST PRACTICES ..... 4

1.3 ...................................................................................................... CHECK ENVIRONMENT STATUS ..... 4

1.3.1 ............................................................................................... SHELF ENVIRONMENTAL STATUS ..... 5

1.3.2 ........................................................................................... CHASSIS ENVIRONMENTAL STATUS ..... 5

1.4 .......................................................................................................................SHELF VS PLATFORM ..... 5

1.5 .................................................................................................................... PART NUMBER MATRIX ..... 6

2 SHELF SPECIFIC DIAGNOSIS INFORMATION ....................................... 8


2.1 ............................................................ DS14MK2 AND DS14MK1 SHELF PS LED CONFIRMATION ..... 8

2.2 .....................................................................................................................SHLFLT: SHELF FAULT ..... 8

2.3 ...............................................................................................SHPWRN: SHELF POWER WARNING ..... 9

2.4 ........................................................................................ SHCUFL: SHELF COOLING UNIT FAILED ..... 9

2.5 ............................................................................................ SHPINT: SHELF POWER INTERUPTED ... 10

3 PLATFORM SPECIFIC DIAGNOSIS INFORMATION ............................. 11


3.1 ......................................................................................................................... FAS6000 PLATFORM ... 11

3.1.1 ............................................. FAS6000 POWER SUPPLIES CAN LATCH FAILED: BURT 249588. ... 11

3.1.2BURT 222687: RTC BATTERY VOLTAGE WARNING ON FAS60XX INCLUDING CHASSIS POWER
DEGRADED ......................................................................................................................................... 12

3.2 ............................................................................................ FAS3040/3070 PLATFORM ... 12

3.3 ................................................................................................... FAS3100 PLATFORM ... 12

3.3.1 .......................... BURT 416892: FALSE ERROR MESSAGES FOR PSU TEMP AND FAN SPEED ... 12

3.3.2 ................................................. FAS3100 POWER SUPPLY DOES NOT CONTAIN SYSTEM FAN ... 13

3.3.3 ..................................................... AC INPUT POWER GENERATING PS DEGRADED MESSAGE ... 14

3.4 ................................................................................................... FAS2000 PLATFORM ... 14

3.4.1FAS2000 CANNOT DISTINGUISH BETWEEN AC INPUT FAILURE AND DEAD POWER SUPPLY.. 14

3.4.2 ...................................................FAS2000 BOOT UP BOGUS ERROR BURT 338125 O R 228125 ... 16

3.5 ................................................................................................... FAS3000 PLATFORM ... 16

3.5.1 .............................................................. POWER SUPPLY LED CONTROLLED BY DATA ONTAP ... 16

3.5.2 ....................................................................................SENSOR PSU 1 TEMP NOT RESPONDING ... 17

3.5.3FAS30X0 AND V30X0 PSU ERRONEOUS MESSAGES AND/OR PANICS: DATA ONTAP SOLUTIONS
............................................................................................................................................................. 17

3.5.4 ................ FAS3020, FAS3050, V3020, AND V3050 “POWER SUPPLY MISSING” AND “DEVICE
COMMUNICATIONS TIMEOUT” ERRONEOUS ERROR MESSAGES ............................................... 18

4 EXISTING POWER SUPPLY KB ARTICLES:.......................................... 20


4.1KB ARTICLE 2013354 FORMER KB: 54444: AUTOSUPPORT MESSAGE: CHASSIS POWER SUPPLY
DEGRADED ......................................................................................................................................... 20

4.2 ...KB ARTICLE 2013210 FORMER KB: 2262: INTERMITTENT SHELF POWER SUPPLY ALARM ... 20

2 Power Supply Error Diagnosis


4.3 ...................................................................................... KB 2013003 FORMER KB: 25584 ... 20

5 OTHER RESOURCES ............................................................................. 20


5.1 ...................................................HARDWARE TROUBLESHOOTING GUIDE ON NOW SITE ... 20

6 APPENDIX A – LEDS .............................................................................. 21


6.1 ..................................................................................................... FAS6000 PSU LEDS ... 21

6.2 ................................................................................... FAS2000 POWER SUPPLY LEDS ... 22

6.3 ................................................................... FAS3040 & FAS3070 POWER SUPPLY LEDS ... 22

6.4 ..................................................................... FAS3140/3160/3170 POWER SUPPLY LEDS ... 24

6.5 ............................................................................................................FAS3000 LEDS ... 26

3 Power Supply Error Diagnosis


1 POWER SUPPLY ERROR DIAGNOSIS

1.1 NETAPP POWER SUPPLY MESSAGES


The power supply error messages are not easily interpreted and issues need to
be carefully diagnosed to ensure proper resolution. This document is intended to
provide guidance in diagnosing power supply issues experienced on our systems
to determine appropriate resolution actions.
A special note for NetApp Power Supply Messages: Power Supply Degraded
does not necessarily mean a power supply has failed, although it can mean a
power supply has failed. Our systems and software do not always have the ability
to distinguish power supply failures from input power and other issues. Proper
Diagnosis requires investigation to distinguish a power supply failure from
another source of the error message or condition. Power supply status should
always be confirmed in diagnosis power supply issues.

1.2 BEST PRACTICES


Best Practices - Confirmation of power supply Failure
It is NetApp best practices to confirm power supply failures prior to delivering
replacement parts. There are many reasons the system power supplies do not
require replacement even though an error message is generated and a case
created.
 Contact the customer to inquire regarding input power issues.

 Confirm the power supply status using the power supply LEDs

 Ensure the AC input to the Power Supply is good.

 Reviewing recent ASUP message.

 Run the Diag application: NOTE: Requires system downtime - customer


environment must be considered – not required for many power supply
error messages diagnosis – only when appropriate.

1.3 CHECK ENVIRONMENT STATUS


It is NetApp best practices to confirm power supply failures prior to delivering
replacement parts. Checking the environment Status will help to confirm any
errors detected and reported by the system. System access is required but down
time is not needed as these are DOT commands.

4 Power Supply Error Diagnosis


1.3.1 Shelf Environmental Status
Check the environmental status to confirm error detection by the system. NOTE:
This command applies to filers such as FAS2000 which use a shelf enclosure for
the base.

Enter the command:

environment status

Results include Power Supply error detection:

Power Supply installed element list: 1, 2; with error: none

KB 2013351 Former kb54480 has an example output for the environment status
https://kb.netapp.com/support/index?page=content&id=2013351

1.3.2 Chassis Environmental Status


Check the environmental status to confirm error detection by the system. Enter
the command:

environment status chassis

Results include Power Supply error detection

Additional detail information is provided by the following command:

environment chassis list-sensors

Refer to KB article 2013353 former KB 54450:


https://kb.netapp.com/support/index?page=content&id=2013353

1.4 SHELF VS PLATFORM


Distinguishing Shelf vs. Chassis Power Supply messages
There can be confusion between the shelf and the platform power
supplies in diagnosing power supply error messages.

There are only 5 shelf power supply messages: All other power supply
messages are related to platform power supplies

SHCUFL SHELF COOLING UNIT FAILED


SHPINT SHELF POWER INTERRUPTED
SHPWRN SHELF POWER SUPPLY

5 Power Supply Error Diagnosis


WARNING
SHLFLT SHELF_FAULT
SHLFOT SHELF OVER TEMPERATURE

1.5 PART NUMBER MATRIX


Power Supply Part Numbers for the affected platforms
 The table indicates the part numbers for our current platforms.

 NetApp support can confirm current part number details and any updated
information.

 * There are older versions of X730 which are non-ROHS compliant which
can be used in Non-ROHS compliant regions, including USA.

 *** The X731-R5 is the universal input 110-240VAC with a 20 Amp fuse.
This model is used in all FAS6000 applications except in specific Europe
installations which require a high line 200-240VAC with a 10 amp fuse, the
X732- R5. System configurations should be consulted for Europe systems
to determine if there is a high line requirement for the particular customer.

 Further details regarding the high line requirements can be provided by


NetApp support.

Platform Power Supply Alternate Version


FAS2020 X513A-R5
FAS2050 X515A-R5
FAS2040 X513A-R5
FAS3020 X730-R5 X730*
FAS3050 X730-R5 X730*
FAS3040 X730-R5 X730*
FAS3070 X730-R5 X730*
FAS3140 X758-R5
FAS3160 X758-R5
FAS3170 X758-R5
FAS6030 X731-R5*** X732-R5***
FAS6070 X731-R5*** X732-R5***
FAS6040 X731-R5*** X732-R5***
FAS6080 X731-R5*** X732-R5***

6 Power Supply Error Diagnosis


FAS3210 X758-R5
FAS3240 X758-R5
FAS3270 X758-R5
FAS6210 X763-R6
FAS6240 X763-R6
FAS6280 X763-R6

7 Power Supply Error Diagnosis


2

2 SHELF SPECIFIC DIAGNOSIS INFORMATION

2.1 DS14MK2 AND DS14MK1 SHELF PS LED CONFIRMATION


Synopsis
For Shelf Faults the best source for diagnosis information are the LEDs. Confirm
the status of the shelf components using the LEDs.

Error Messages
Various Faults reported on disk storage shelf.

Field Response
Refer to KB article 1012073 (Former KB 50783) for LED identification and
interpretation.
https://kb.netapp.com/support/index?page=content&actp=LIST&id=1012073

2.2 SHLFLT: SHELF FAULT


Synopsis
This AutoSupport message is triggered whenever there is a fault condition
reported on a storage shelf. Common causes are both hardware problems on the
shelf such as a controller, fan, cable connection, power supply, or temperature
sensor condition, or Loop and controller issues. The affected shelf is called out in
the error message.

Error Messages

Field Response
For more information on SHELF_FAULT errors, please refer to the
KnowledgeBase article
https://kb.netapp.com/support/index?page=content&id=2012955

For temperature issues, investigate the ambient temperature status and ensure
proper system installation and airflow. Please refer to the Site Requirements
Guide for specifics on your storage controller.

8 Power Supply Error Diagnosis


http://now.netapp.com/NOW/public/knowledge/docs/hardware/NetApp/site/pdf/sit
e.pdf
Refer to page 74 for DS14 series disk shelves or page 83 for DS4243 series disk
shelves.

2.3 SHPWRN: SHELF POWER WARNING


Synopsis
This AutoSupport email indicates that a shelf power supply has reported a loss of
DC voltage output. This is a “DC under voltage” condition. It is important to note
that this event is transient in nature. If the outage lasts more than 20 seconds a
SHPINT message is generated.

Error Messages
SHPWRN

Field Response
This SHPWRN scenario can occur for a number of reasons:

The power supply has been turned off or unplugged.

The power to the server room either failed or fluctuated.

A brown out situation has occurred. In Japan, running voltage at 90 volts can
more easily lead to DC under voltage due to no margin for fluctuations.

Only in rare occasions the shelf power supply has malfunctioned and requires
replacement.

For more detailed information on this error, including instructions on how to


determine and verify the condition of the power supply please refer to the
KnowledgeBase article

https://kb.netapp.com/support/index?page=content&id=2012128

2.4 SHCUFL: SHELF COOLING UNIT FAILED


Synopsis
This AutoSupport email indicates that a shelf cooling unit (power supply fan) has
failed; this could be a hardware problem. This message can be triggered by
faulty hardware on the shelf such as a power supply fan. This can also be the
result of a signal issue. Diagnosis of the shelf should be conducted including
confirming component status.

9 Power Supply Error Diagnosis


Error Messages
SHCUFL

Field Response
For more information on Shelf Cooling Unit Fail Warnings, please refer to the
KnowledgeBase article
https://kb.netapp.com/support/index?page=content&id=2011427

2.5 SHPINT: SHELF POWER INTERUPTED


Synopsis
This AutoSupport email indicates that a shelf power supply has reported either a
sustained loss of DC voltage output for a period of about 20 seconds or a failure
condition.

The system will report a PWRFLT power fault warning specifically: SHPINT -
shelf power interrupted. This is a clear indication the power to the shelf has been
lost at least temporarily.

Error Messages
PWRFLT:Cluster Notification from afspf120 (SHELF POWER INTERRUPTED) WARNING

Field Response
The power supply has been turned off or unplugged.

The input power to the system either failed or fluctuated. Check the AC input in
the server room.

Monitor the LED status

If the shelf power supply has malfunctioned there are accompanying messages
indicating a problem.

For more information on shelf power interrupted warnings, please refer to the
KnowledgeBase article

https://kb.netapp.com/support/index?page=content&id=2013351

10 Power Supply Error Diagnosis


3 PLATFORM SPECIFIC DIAGNOSIS INFORMATION

3.1 FAS6000 PLATFORM

3.1.1 FAS6000 Power Supplies can latch failed: BURT 249588.


Synopsis
When there is a very short glitch in the external power supplied to the PSUs, the
PSU sensors are left at power-on-default state and any reading from those
sensors result in 0 values (power on default values).

This results in Data ONTAP issuing false warning messages about these
sensors. A power cycle will reset the unit

Error Messages
Every hour message appears regarding degraded power supply, although filer is functioning
properly and both power supplies are receiving power.

Wed May 30 14:00:18 GMT [st-fs06a: monitor.chassisPowerSupply.degraded:notice]: Chassis


power supply 1 is degraded: PSU 1 12V warning low: 0mV PSU 1 5V warning low: 0mV
Wed May 30 14:00:18 GMT [st-fs06a: monitor.chassisPowerSupply.degraded:notice]: Chassis
power supply 2 is degraded: PSU 2 12V warning low: 0mV PSU 2 5V warning low: 0mV

===== ENVIRONMENT =====


Voltage ok
PSU 1: Sensor PSU 1 12V warning low: current voltage is 0 mV, normal low is 10800 mV
PSU 1: Sensor PSU 1 5V warning low: current voltage is 0 mV, normal low is 4500 mV
PSU 2: Sensor PSU 2 12V warning low: current voltage is 0 mV, normal low is 10800 mV
PSU 2: Sensor PSU 2 5V warning low: current voltage is 0 mV, normal low is 4500 mV
Temperature ok
System_Fan ok
CF_status ok

Field Response

Power cycle the PSUs by either using the rocker switch on the PSU (one by one)
or by using the Remote LAN Module (RLM) NOTE: Using the RLM is disruptive
as both PSUs are powered off both PSUs simultaneously

This issue has been fixed in DOT version 7.2.4 and subsequent releases,
including the 7.3 series DOT.

11 Power Supply Error Diagnosis


3.1.2 BURT 222687: RTC Battery Voltage Warning on FAS60xx
including Chassis Power Degraded

Synopsis
Since the real time clock (RTC) lithium manganese battery has a normal service
Life of eight (8) years, a low battery warning does not require a battery
Replacement, there is an issue with the motherboard.

Error Messages
Tue Oct 10 18:00:39 GMT [svtl-6070-1: monitor.chassisPower.degraded:notice]: Chassis power
is degraded: RTC Battery is in WARN LOW state current voltage is 2445 mV, normal low is 2800
mV

Field Response
If you observe a low RTC console message this is an indication that you need to
have your motherboard replaced. The power supply does not need to be
replaced.

BUG 222687

3.2 FAS3040/3070 PLATFORM


This product uses the FAS3020/FAS3050 power supplies. Please refer to the
FAS3020/FAS3050 section for issues associated with the FAS3040/3070 platform.

3.3 FAS3100 PLATFORM

3.3.1 BURT 416892: False Error messages for PSU Temp and Fan
speed
Synopsis
On FAS3100 an error message about PSU Temperature and Fan speed is seen
as shown in the example below. In this case the PSU may not be faulty.
Error Messages
monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded:
PSU1_Temperature failed: 0C PSU1_FAN1 failed: 0RPM PSU1_FAN2 failed: 0RPM
[mgl-psnetapp01: callhome.chassis.ps.degraded:error]: Call home for CHASSIS POWER
SUPPLY DEGRADED: PS 1
[mgl-psnetapp01: callhome.chassis.ps.fault:error]: Call home for CHASSIS POWER SUPPLY
FAIL: PS 1

12 Power Supply Error Diagnosis


[mgl-psnetapp01: monitor.chassisPowerSupply.failed:CRITICAL]: Chassis power supply 1 failed
[mgl-psnetapp01: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is
degraded: PSU2_Temperature failed: 0C PSU2_FAN1 failed: 0RPM PSU2_FAN2 failed: 0RPM
[mgl-psnetapp01: callhome.chassis.ps.degraded:error]: Call home for CHASSIS POWER
SUPPLY DEGRADED: PS 2
[mgl-psnetapp01: callhome.chassis.ps.fault:error]: Call home for CHASSIS POWER SUPPLY
FAIL: PS 2
[mgl-psnetapp01: monitor.chassisPowerSupply.failed:CRITICAL]: Chassis power supply 2 failed
[mgl-psnetapp01: monitor.chassisPower.degraded:notice]: Chassis power is degraded: sensor
PSU1_Temperature
[mgl-psnetapp01: callhome.chassis.power:error]: Call home for CHASSIS POWER DEGRADED:
sensor PSU1_Temperature
[mgl-psnetapp01: monitor.globalStatus.critical:CRITICAL]: sensor PSU1_Temperature

Field Response
Engineering is working on root cause analysis for an issue which affects the
reporting of PSU sensors on one controller in a FAS31X0 C-C configuration.

In the meantime, if only one controller in the C-C configuration is reporting


sensor failures on the PSUs, and the other controller is reporting normal
readings for both PSUs sensors, then the PSUs are both healthy and operating
normally.

If both controllers in a C-C configuration are reporting sensor failures for one of
the PSUs, then that PSU does need to be replaced. It is suggested re-seating the
PSU may clear the fault.
Do not replace the power supply, there is no defect. If reseating does not clear
the situation, ignore the failure message.
BURT 416892 No Public Report at this time

3.3.2 FAS3100 Power Supply does not contain system fan


Synopsis
FAS3100 PSU Fans are internal to the PSU and are meant to cool only the
power supply and not the system. The system fans are a separate FRU and
replacing the power supply will not resolve a fan issue.
Error Messages
CET [netapp1: monitor.chassisFanFail.xMinShutdown:EMERGENCY]: Multiple Chassis Fan
failure: System will shut down in 2 minutes.
Field Response
The FAS3100 product has a separate Fan module for the system cooling and
PSUs should not be replaced for fan related errors.

13 Power Supply Error Diagnosis


3.3.3 AC Input power generating PS degraded message
Synopsis
A loss of AC input to the power supply due to no AC cord (power cord)
connected to the PSU input or AC input switch being in OFF position will
generate an error message as shown below. PSU Degraded Message here
doesn’t mean that PSU is BAD.
Error Messages
EST [THLFS01: asup.smtp.sent:notice]: Cluster Notification mail sent: Cluster
Notification from THLFS01 (CHASSIS POWER DEGRADED: sensor
PSU2_Power_Input) WARNING
Field Response
Check AC cord and make sure it is connected to Power Supply firmly. Also make
sure that the voltage at AC cord is within PSU input range. Make sure that the
PSU is seated and PSU input Switch is ON.

3.4 FAS2000 PLATFORM

3.4.1 FAS2000 cannot distinguish between AC input failure and dead


power supply
Synopsis
FAS2000 Power Supplies do not detect AC INPUT power. Manual determination
of adequate input power is required prior to shipping replacement parts.
The FAS2020 & FAS2050 systems will automatically generate an ASUP and
subsequently a case will be created if one of the system’s PSU AC-input is not
present. This will happen if one of the AC input switch is turned off, or the AC is
somehow removed at the facility. If power is not available to both filers,
well….there won’t be anything coming out of the filer.
Error Messages
Thu Aug 13 17:37:02 GMT [ses.status.psWarning: warning]: DS12-ESAS shelf 0 on channel 0c
power warning for Power supply 1: non-critical status; DC undervoltage fault. This module is on
the rear side of the shelf, at the left.

Thu Aug 13 17:37:02 GMT [ses.status.volError:CRITICAL]: DS12-ESAS shelf 0 on channel 0c


voltage error for Voltage sensor 1: critical status. This module is on the rear side of the shelf, at
the left, on the power supply.

Thu Aug 13 17:37:02 GMT [ses.status.volError:CRITICAL]: DS12-ESAS shelf 0 on channel 0c


voltage error for Voltage sensor 2: critical status. This module is on the rear side of the shelf, at
the left, on the power supply.

14 Power Supply Error Diagnosis


Thu Aug 13 17:37:02 GMT [ses.status.volError:CRITICAL]: DS12-ESAS shelf 0 on channel 0c
voltage error for Voltage sensor 3: critical status. This module is on the rear side of the shelf, at
the left, on the power supply.

Thu Aug 13 17:37:02 GMT [ses.status.currentError:CRITICAL]: DS12-ESAS shelf 0 on channel


0c current error for Current sensor 1: critical status. This module is on the rear side of the shelf, at
the left, on the power supply.

Thu Aug 13 17:37:02 GMT [ses.status.currentError:CRITICAL]: DS12-ESAS shelf 0 on channel


0c current error for Current sensor 2: critical status. This module is on the rear side of the shelf, at
the left, on the power supply.

Thu Aug 13 17:37:02 GMT [monitor.shelf.fault:CRITICAL]: Fault reported on disk storage shelf
attached to channel 0c. Please check fans, power supplies, disks, and temperature sensors.

Thu Aug 13 17:37:06 GMT [monitor.chassisPower.degraded:notice]: Chassis power is degraded:


PSU1 12V is failed. PSU1 5V is failed. PSU1 3V is failed. PSU1 12 Amp is failed. PSU1 5 Amp
is failed.

Thu Aug 13 17:37:25 GMT [ses.status.psError:CRITICAL]: DS12-ESAS shelf 0 on channel 0c


power error for Power supply 1: critical status; overtemperature fault. This module is on the rear
side of the shelf, at the left.

Thu Aug 13 17:38:00 GMT [monitor.globalStatus.critical: CRITICAL]: PSU1 is failed. PSU1 12V is
failed. PSU1 5V is failed. PSU1 3V is failed. PSU1 12 Amp is failed. PSU1 5 Amp is failed. Disk
shelf fault.

The environment command shows the status of the PSU1(2) as

===== ENVIRONMENT =====


Voltage ok
Temperature ok
Power Supplies Out Voltage ok
Power Supply 1: Sensor PSU1 12V failed
Power Supply 1: Sensor PSU1 5V failed
Power Supply 1: Sensor PSU1 3V failed
Power Supply 1: Sensor PSU1 12 Amp failed
Power Supply 1: Sensor PSU1 5 Amp failed
Power Supply 2 ok

There may be variations to the message, possibly including “over-temp” failure.


An “over-temp” condition does not exist although this additional failure mode
statement may also be included.

Field Response
Contact NetApp support for

15 Power Supply Error Diagnosis


3.4.2 FAS2000 BOOT UP bogus Error BURT 338125 o r 228125
Synopsis
On FAS2020 and FAS2050 configurations, a shelf fault critical error message as
shown in the example below is seen during boot up: A short time after, the error
is corrected and a message as shown in the example below is displayed:
Error Messages
"Mon Jun 25 01:07:15 GMT [monitor.shelf.fault:CRITICAL]: Fault reported on disk storage shelf
attached to channel 0c. Please check fans, power supplies, disks, and temperature sensors"
"Mon Jun 25 01:07:25 GMT [monitor.shelf.fault.ok:info]: Fault previously reported on disk
storage shelf attached to channel 0c has been corrected."
Field Response
The fault message does not indicate a real fault when it occurs during boot up.
This message should be ignored. There are no defective components.
The problem is fixed on IC.2 and recommend to upgrade DOT to greater than
7.3.2 as soon as possible.
BUG 338125

3.5 FAS3000 PLATFORM

3.5.1 Power Supply LED controlled by Data ONTAP


Synopsis
On a FAS3000 series, an amber LED on the Power Supply does NOT
necessarily mean that the PSU is bad. Data ONTAP controls one of the LEDs on
the power supply for the FAS3000 products. This can cause confusion as the
status of the system can change the LEDs on the power supplies.
Error Messages

Hot swapping the power supply can generate the following message:

MFANFL:System Notification from (MULTIPLE FAN FAILURE) CRITICAL


[monitor.chassisPowerSupply.notPresent:notice]: Chassis power supply # not present
[monitor.chassisPowerSupplies.failed:CRITICAL]: Multiple chassis power supplies failed: 2 PSU
Fans are not functional. System will be shutdown in approximately 2 minutes unless this is
corrected.

Note:
This message will NOT appear in the messages log. It will ONLY be displayed
on the console.

16 Power Supply Error Diagnosis


Field Response
Follow best practices and confirm the power supply status. To confirm the Power
Supply status, look at the environment data and the messages log on the filer
and use the LED guide in appendix “A” prior to replacing the Chassis Power
Supply.
FAS3000 power supply degraded messages: "(MULTIPLE FAN FAILURE)
CRITICAL
https://kb.netapp.com/support/index?page=content&id=2010101

3.5.2 Sensor PSU 1 Temp not responding


Synopsis
You may observe a repeating hourly "Sensor PSU 1 Temp not responding"
console message on your FAS3020/3050 System Controller.

Error Messages
Thu Oct 11 11:00:03 CEST [de-fraapp972: monitor.chassisSensor.NR:warning]: Sensor PSU 1
Temp not responding.
Thu Oct 11 11:03:57 CEST [de-fraapp972: monitor.chassisPowerSupply.degraded:notice]:
Chassis power supply 1 is degraded: PSU 1 AC Failed
Field Response
Fixed in DOT 7.2.5 and subsequent releases. Encourage customer to upgrade
Data ONTAP to at least 7.2.5.

If there are no other messages related to the affected PSU, the PSU is working
properly and no hardware replacement is required.

To clear this periodic console message, physically turn off the affected PSU
using the PSU rocker switch. Physically remove the PSU from the chassis and
reseat within 2 minutes and turn on the PSU using the rocker switch. This action
clears the problem, though the problem may reoccur.

BUG 262075

3.5.3 FAS30x0 and V30x0 PSU Erroneous Messages and/or Panics:


Data ONTAP Solutions
Synopsis
FAS3020, FAS3050, V3020, and V3050 systems running 7.1 series and 7.2 series
prior to 7.2.2 could experience erroneous error messages. The conditions that lead

17 Power Supply Error Diagnosis


to the polling protocol problem are complicated and are not associated with a
specific application or environment.

Error Messages
There are 2 different error message produced by the system depending on the version of Data
OnTap:

Mon Mar 12 22:13:20 GMT [sts-svl01: driver.ich3smb.kill:CRITICAL]: A device communication


timeout occurred on the appliance motherboard with the PS 2 temperature sensor device at address
0x39C

Tue Jan 16 22:23:11 CST [monitor.chassisPowerSupply.notPresent:notice]: Chassis power supply 1


not present*

*Note: Power supply 2 often reports the same message: sometimes in addition to the power supply 1
message, and sometimes independently.

Field Response
The system will continue to operate without issue as these are warning messages.

Upgrade to DOT greater than 7.2.2.

3.5.4 FAS3020, FAS3050, V3020, and V3050 “Power Supply Missing”


and “Device Communications Timeout” erroneous error
messages
Synopsis
FAS3020, FAS3050, V3020, and V3050 systems running 7.1 series and 7.2 series
prior to 7.2.2 could experience erroneous error messages. The conditions that lead
to the polling protocol problem are complicated and are not associated with a
specific application or environment.

Error Messages
There are 2 different error message produced by the system depending on the version of Data
OnTap:

Mon Mar 12 22:13:20 GMT [sts-svl01: driver.ich3smb.kill:CRITICAL]: A device communication


timeout occurred on the appliance motherboard with the PS 2 temperature sensor device at address
0x39C

Tue Jan 16 22:23:11 CST [monitor.chassisPowerSupply.notPresent:notice]: Chassis power supply 1


not present*

*Note: Power supply 2 often reports the same message: sometimes in addition to the power supply 1
message, and sometimes independently.

Field Response

18 Power Supply Error Diagnosis


Encourage customer to upgrade to 7.2.2 or greater Data ONTAP
Ignore the message, the filer and the power supplies are operating correctly, it is
in the reporting of the parameters that the problem resides. These messages are
erroneous as the power supplies are operating properly and the timeout is
cleared by the filer. Do not replace the power supply or the motherboard.

19 Power Supply Error Diagnosis


4 EXISTING POWER SUPPLY KB ARTICLES:

4.1 KB ARTICLE 2013354 FORMER KB: 54444: AUTOSUPPORT MESSAGE: CHASSIS


POWER SUPPLY DEGRADED
This article was written a long time ago and addressed the same issue identified
in this document for the FAS3000 product BURT 262705
https://kb.netapp.com/support/index?page=content&id=2013354

4.2 KB ARTICLE 2013210 FORMER KB: 2262: INTERMITTENT SHELF POWER SUPPLY
ALARM
This article was written in support of a very specific issue at one customer and
has not been identified in any other instance. Also reference BURT 164900.
https://kb.netapp.com/support/index?page=content&id=2013210

4.3 KB 2013003 FORMER KB: 25584


https://kb.netapp.com/support/index?page=content&id=2013003
Combines RTC FAS6000 with check environment status

5 OTHER RESOURCES

5.1 HARDWARE TROUBLESHOOTING GUIDE ON NOW SITE


General information source, the Power Supply information is not as detailed and
accurate as this document.
http://now.netapp.com/NOW/knowledge/docs/hardware/NetApp/errormsg.pdf

20 Power Supply Error Diagnosis


6 APPENDIX A – LEDS

6.1 FAS6000 PSU LEDS


Location of the PSU LEDs
The following illustrations show the location of the PSU LEDs on the system
backplane:

Meaning of the PSU LEDs


The following table describes the behavior of the PSU LEDs.

Amber (indicates Green (indicates


AC input) DC output) Description

On On The AC power source is good and is


powering the system.

On Off There is AC power present, but the


PSU is not operational.

On Blinking There is AC power present but the


PSU is not enabled.

Off Off There is insufficient power to the


system.

21 Power Supply Error Diagnosis


6.2 FAS2000 POWER SUPPLY LEDS
Location of the LEDs
The following illustration shows the location of the power supply LEDs, which are
visible from the back of the system.
Note
The following illustration shows a FAS2050 or SA200 power supply. The LEDs
on FAS2020 and FAS2040 systems are in a different position but are functionally
identical.

What the LEDs mean


The following table explains what the LEDs on the power supply mean.

LED LED
name color Description

Ready Green The power supply is functioning correctly.

Fault Amber The power supply is not functioning properly and needs
service. See the system console for any applicable error
messages.

6.3 FAS3040 & FAS3070 POWER SUPPLY LEDS


Location of the LEDs
The following illustration shows the location of the power supply LEDs on your
system backplane.

22 Power Supply Error Diagnosis


What the LEDs mean
The following table explains what the power-supply LEDs mean.

LED Status
label indicator Description

AC Amber No fault is indicated.

OK Green

AC Off There is no external power; check the connections and


the power source.
OK Off

AC Amber The system displays the LOADER> prompt because it has


not booted Data ONTAP.
OK Off

AC Flashing There is a power supply fault; replace the power supply.


amber

OK Amber

23 Power Supply Error Diagnosis


6.4 FAS3140/3160/3170 POWER SUPPLY LEDS
Location of the LEDs
The following illustration shows the location of the AC PSU or DC PSU power
supply LEDs on the rear of the system.

What the LEDs mean


The following table describes the behavior of the AC PSU and DC PSU LEDs.

PSU Power LED Fault LED


type PSU condition status status

AC PSU is present and switched on. Normal Green Off


mode.
-48VDC

AC PSU is missing or switched off. The other Off Off


PSU is off or functioning normally.
-48VDC

AC PSU Fault: AC in or -48VDC is out of Off Blinking


range, or there is a DC fault or fan fault. amber
-48VDC

DS14MK2 Power Supply Monitoring


LEDs on the power supply

24 Power Supply Error Diagnosis


The power supply has four LEDs. The LEDs indicate whether the power supply or the integrated
fan module is functioning normally or there are problems with the hardware. You can also identify
any hardware failure associated with the power supplies from the error messages displayed on
your storage system console.
Location of LEDs
Each power supply, which contains two LEDs, is encased in a device carrier and housed at the
rear of the disk shelf. The following illustration shows the location of the power supply LEDs.

Note
The PSU status LED is green. The other three LEDs are amber. See Interpreting disk drive LEDs
for an illustrated explanation of how the LEDs function.
Power supply console error messages
The following error messages appear on your storage system console if an SES element on the
power supply fails. For information about replacing the power supply, see Replacing a power
supply in a disk shelf.

Error
message Action required

Power The power supply unit on the left at the back of the disk shelf failed. Contact
supply technical support to replace the power supply.
Element 1:
failed

Power The power supply unit on the right at the back of the disk shelf failed. Contact
supply technical support to replace the power supply.
Element 2:

25 Power Supply Error Diagnosis


failed

Cooling The integrated fan module in the power supply unit on the left at the back of the
element disk shelf failed. Contact technical support to replace the power supply.
Element 1:
failed

Cooling The integrated fan module in the power supply unit on the right at the back of the
element disk shelf failed. Contact technical support to replace the power supply.
Element 2:
failed

6.5 FAS3000 LEDS


Location and meaning of 30xx, SA300, and C2300 and C3300 NetCache
PSU LEDs
You can check the LEDs on the PSUs to learn whether they are functioning normally.
The following illustration shows the location of the PSU LEDs on the back of the system.

26 Power Supply Error Diagnosis


PSU 1

PSU 2

PSU LEDs

The following table explains what the PSU LEDs mean.

LED label Status Description


indicator
AC Amber No fault is indicated.
OK or Status Green
AC Off There is no external power; check the connections
and the power source.
OK or Status Off
AC Amber (3020 and 3050 systems) CFE prompt.
(3040, 3070, and SA300 systems) The
OK or Status Off
system displays the LOADER> prompt
because it has not booted Data ONTAP.

AC Flashing amber There is a power supply fault; replace the power


supply.
OK or Status Amber

27 Power Supply Error Diagnosis