You are on page 1of 15

Quick Maintenance Guide

CloudEngine 12800 Series Switches

Issue: 02 (2017-09-12)
Contents

Before You Start 1

How to Quickly Maintain


2
the CE12800

Fault Information Collection


and Feedback 8

Solution to Device Login 10


Failure

Risky Operations 12

References 13
Before You Start

Before you take over the maintenance of the switch, you are advised to:

1 Obtain the network's topology diagram and data plans (including ports, VLANs, and IP addresses),
print them, and paste them in your equipment room for quick reference.

2 Print the following contact information and paste it nearby your workplace.
 Enterprise network customer service telephone number (Global Service Hotline) or carrier network
customer service telephone number (Global TAC Information).
 Contact method of the agent who is responsible for constructing your network and providing service.

3 Prepare the tools and cables that you may use during device maintenance.

No. Item Description

Cables  An RS232 serial cable: used to log in to the device through the console port.
1  Serial to USB converter: used to connect the USB port of the maintenance terminal to the
Console port of the switch.
 Two straight-through cables: used to commission the management port or other services.
 Multiple fibers and SFP/SFP+/QSFP optical modules: used to connect the switch to other
network devices.

Maintenance terminal A maintenance terminal can be a portable computer with serial communication software installed.
2 You can log in to the switch through the maintenance terminal.

3 Instruments and meters Optical power meter: used to test optical parameters of optical ports (such as optical power and
receive sensitivity).

4 Visit Huawei enterprise technical support website (http://support.huawei.com/enterprise) or carrier


network technical support website (http://support.huawei.com/carrier) to register an account. Then you
can browse or download more product documents, cases, and announcements, and subscribe and
receive push messages from the website.

1
How to Quickly Maintain the
CE12800
The overall CE12800 maintenance process is as follows:

Start

Check alarms on Check the health


Check indicators Check card status
the switch and status of the switch
and rectify the fault. and rectify the fault.
rectify the fault. and rectify the fault.

Is the fault
rectified?

Yes No

Collect and report


the fault information.

End

To check the alarms, health status, card status, and record fault information, you
must log in to the switch through the console port, Telnet, or STelnet. (For how to
log in to the switch, see Configuration Guide-Basic Configuration.) If you fail to log
in to the switch, see Solution to Device Login Failure.

2
Check Indicator Status
Check whether the status of each indicator is normal. If an indicator is in abnormal state, record the fault
information and find out the fault handling methods according to the indicator status and meanings in
Hardware Description or the troubleshooting procedures in Troubleshooting. If the fault cannot be
rectified, contact technical support personnel.

The following table lists the normal status of each indicator on the switch.

Category Indicator Normal State

PWR Steady green


Chassis
FAN Steady green
header
SFU Steady green

MPU Steady green

CMU RUN/ALM Slow blinking green

ACT Steady green: active CMU; off: standby CMU

MPU RUN/ALM Slow blinking green

ACT Steady green: active MPU; off: standby MPU


Steady on: active MPU of the stack system; blinking: non-active MPU of
STACK the stack system; off: stacking is disabled.
LPU RUN/ALM Slow blinking green

Power Input Steady green


module
Output Steady green

Alarm Off

Fan module STATUS Slow blinking green

Switch RUN/ALM Slow blinking green


fabric unit OFL Off

Note: For the meanings, status, and status description of each indicator, see the Hardware
Description.

3
Check for Critical or Major Alarms on the Switch
Log in to the switch and run the display alarm active root command to view the alarm status on the
switch. Check whether any critical or major alarms exist.

<HUAWEI> display alarm active root


--------------------------------------------------------------------------------
Sequence AlarmId Severity Date Time Description
--------------------------------------------------------------------------------
7831 0x8520003 Major 2014-03-26 The interface status changes. (ifName=
08:43:05 10GE4/0/23, AdminStatus=UP, OperStatus
=DOWN, Reason=Interface physical link
is down, mainName=10GE4/0/23)
7805 0x95E0012 Warning 2014-03-26 MAC flapping detected, VlanId = 1, Ori
08:39:02 ginal-Port = 10GE4/0/12, Flapping port
= Eth-Trunk1,10GE4/0/0.Check the netw
ork connected to the interface learnin
g a flapping MAC address : ac94-8400-d
f01.

The alarms on the switch are classified into critical, major, minor, and warning alarms. The critical and
major alarms must be handled immediately. Handle these alarms according to the Alarm Reference. If
the alarms cannot be cleared, contact technical support personnel.

If you have a network management system (NMS), check the alarms on the NMS.
For details, see the NMS product documents.

4
Check the Health Status of the Switch
Log in to the switch and run the display health command to check the health status of the switch.

1 View the voltage information and check whether the voltage status of each present card is normal.

Voltage:
--------------------------------------------------------------------------------
Slot Card SensorID SensorName Status Upper Lower Current
(Volt) (Volt) (Volt)
--------------------------------------------------------------------------------
3 -- 16 12V NORMAL 14.35 9.54 11.62
-- 17 1.0V_AVS_A NORMAL 1.19 0.80 1.00

If the voltage status of a card is abnormal, record the fault information and handle the fault according
to the Troubleshooting. If the fault cannot be rectified, contact technical support personnel.

2 View the power information and check whether the status of each present power module is Supply.

Power:
-----------------------------------------------------------------------------
PowerNo Present Mode State Current Voltage ActualPower RatedPower
(Ampere) (Volt) (Watts) (Watts)
-----------------------------------------------------------------------------
PWR1 YES AC Supply 9.9 53.5 531.2 2700
PWR2 NO -- -- -- -- -- --
PWR3 NO -- -- -- -- -- --
PWR4 NO -- -- -- -- -- --
-----------------------------------------------------------------------------
N/A:Power not available

If the power status is abnormal, check whether the power supply is switched on and whether the
power cable is loose, and replace the problematic power module. If the fault cannot be rectified,
record the fault information and contact technical support personnel.

3 View the fan information and check whether the status of each present fan is Normal.

Fan:
------------------------------------------------------------------------------
FanId FanNum Present Register Status Speed Mode Airflow Direction
------------------------------------------------------------------------------
FAN1 -[1-2] YESNO YES - Normal
- 42%(4500)
- Auto
- Front-to-Back
-
FAN2 [1-2]
1 YES YES Normal 42%(4500)
4800 Auto Front-to-Back
1
2 4800
4200
FAN2 --2 NO -- -- --
4200 -- --
If the fan status is abnormal, check whether the fan module is properly connected, whether the fan
blades are blocked or covered with heavy dust. If the preceding situations occur, remove (hot swap)
and reinstall the fan modules or clean the fan blades. If other situations occur, replace the fan module.
If the fault cannot be rectified, record the fault information and contact technical support personnel.

5
4 View the temperature information and check whether the temperature status of each present card is
normal.
Temperature:
---------------------------------------------------------------------
Slot Card SensorID Status Major Fatal Current
(Celsius) (Celsius) (Celsius)
---------------------------------------------------------------------
3 - 1 NORMAL 72 101 55
- 2 NORMAL 72 100 60

If the temperature is abnormal, check whether the ambient temperature in the equipment room is
normal, whether the heat dissipation channel in the chassis is blocked, and whether all fan modules
are working properly. Take appropriate measures accordingly. If the fault cannot be rectified, record
the fault information and contact technical support personnel.

5 View the memory usage information. The memory usage of each present card should be lower than 60%.

System Memory Usage Information:


System Memory Usage at 2013-10-12 15:44:35
------------------------------------------------------------------------------
Slot Total Memory(MB) Used Memory(MB) Used Percentage Upper Limit
------------------------------------------------------------------------------
3 1869 797 42% 95%
4 1869 487 26% 95%
5 3793 812 21% 95%
8 128 90 71% 95%
13 228 53 23% 95%
------------------------------------------------------------------------------

If the memory usage is too high, observe the memory usage for 5-10 minutes. If the memory usage is
still high, contact technical support personnel.

6 View the CPU usage information. The CPU usage of each present card should be lower than 80%.

System CPU Usage Information:


System CPU Usage at 2013-10-12 15:44:35
----------------------------------------
Slot CPU Usage Upper Limit
----------------------------------------
3 24% 95%
4 9% 95%
5 5% 95%
8 5% 95%
13 6% 95%
----------------------------------------

If the CPU usage is too high, observe the CPU usage for 5-10 minutes. If the CPU usage is still high,
contact technical support personnel.

7 View the storage media usage information. The storage media usage should be lower than 80%.

System Disk Usage Information:


System Disk Usage at 2013-10-12 15:44:35
--------------------------------------------------------------------------
Slot Device Total Memory(MB) Used Memory(MB) Used Percentage
--------------------------------------------------------------------------
5 flash: 3821 3642 95%
--------------------------------------------------------------------------

If the storage media usage exceeds 80%, delete redundant files. For details, see the Configuration
Guide-Basic Configuration.

6
Check the Card Status

Log in to the switch and run the display device all command to view card status.

<HUAWEI> display device all


CE12804's Device status:
----------------------------------------------------------------------------------
Slot Card Type Online Power Register Alarm Primary
----------------------------------------------------------------------------------
4 - CE-L24XS-EA Present On Registered Normal NA
5 - CE-MPUA Present On Registered Normal Master
7 - CE-CMUA Present On Registered Normal Master
13 - CE-SFU04C Present On Registered Normal NA
PWR1 - PAC-2700WA Present On Registered Normal NA
FAN1 - FAN-12C Present On Registered Normal NA
FAN2 - FAN-12C Present On Registered Normal NA
FAN3 - FAN-12C Present On Registered Normal NA
FAN4 - FAN-12C Present On Registered Normal NA
FAN5 - FAN-12C Present On Registered Normal NA
FAN6 - FAN-12C Present On Registered Normal NA
FAN7 - FAN-12C Present On Registered Normal NA
FAN8 - FAN-12C Present On Registered Normal NA
FAN9 - FAN-12C Present On Registered Normal NA
-----------------------------------------------------------------------------------

Check the following items:


 Whether the Online value is Present.
 Whether the Power value is On.
 Whether the Register value is Registered.
 Whether the Alarm value is Normal.
If the card status is abnormal, record the fault information and handle the fault according to the
Troubleshooting. If the fault cannot be rectified, contact technical support personnel.

7
Fault Information Collection
and Feedback
When you find errors on your switch, collect fault information in real time and take the corresponding
measures.

Fault information includes:


 Basic fault information: fault occurrence time, symptom, severity, impact, network topology,
measures that have been taken, and effect
 Running status information: device name, version, current configuration, and interface information
 Log information: logs recorded when faults occur

Provide the collected information to technical support personnel.

Collect Basic Fault Information


When a fault occurs, collect the following basic fault information.

No. Item Collection Method


Fault occurrence
1 Record the time when the fault occurs, in minutes.
time
2 Symptom Record the fault symptom and detailed information.

3 Impact Record the severity of the fault and impacted services.


Draw a networking diagram, including the upstream and downstream
4 Networking
devices and connected ports.
5 Measures that have Record the measures that have been taken and effect of the measures
been taken (including command execution procedure and output).

Collect Switch Running Information


Log in to the switch and run the display diagnostic-information command to collect switch running
information, including startup configuration, current configuration, port information, time, and system
version.
<HUAWEI> display diagnostic-information dia-info.txt
Now saving the diagnostic information to the device.............................
................................................................................
..............
Info: The diagnostic information was saved to the device successfully.

The generated configuration file is saved in the flash:/ directory by default. You can run the dir
command in the user view to check whether the configuration file is generated.

You can transfer the configuration file to your computer through TFTP, FTP, or SFTP to facilitate
information query and feedback. For details, see the Configuration Guide-Basic Configuration.

8
Collect Logs
Device logs involve user operations, system faults, and system security issues. Logs are classified into
user logs and diagnostic logs. After logging in to the switch, obtain the user logs and diagnostic logs as
follows:
<HUAWEI> save logfile //Collect common user logs.
<HUAWEI> system-view
[~HUAWEI] diagnose
[~HUAWEI-diagnose] save logfile diagnose-log //Collect device diagnostic logs.
[~HUAWEI-diagnose] collect diagnostic information //Collect OS diagnostic logs.

You can transfer the files from flash:/logfile to your computer through TFTP, FTP, or SFTP to facilitate
information query and feedback. For details, see the Configuration Guide-Basic Configuration.

9
Solution to Device Login
Failure
If you fail to log in to the switch through Telnet or STelnet, log in to the switch through the console (also
called serial) port and check the Telnet or STelnet configuration.

If you still fail to log in to the switch through the console port, you cannot perform any operations related
to CLI. In this situation, you need to perform the following operations:

Before performing the following operations, ensure that the user service is
interrupted. If the user service is not interrupted, these operations will affect user
service. Collect the fault information and contact technical support personnel.

1 Check and recover the power supply system.


If the indicators of all cards are off and the fans do not work (listening to the noise), the power supply
system fails.

1. Check the power supply switches. If your switch has multiple power modules installed, at least
one power module must be switched on.
2. Check the Input indicator of the power module. If the indicator is off, the power supply input is
abnormal. Request the electrician to recover the power lines in the equipment room, rack, or
cabinet.
3. Check the Output indicator of the power supply. If the indicator is off, the power supply is
abnormal. Replace the power module.
4. Check the Alarm indicator of the power supply. If the indicator is on, the power supply is abnormal.
Replace the power module.
If the cards cannot be powered on and no error is found in the preceding checks, contact technical
support personnel.

2 Check and modify the communication parameters of the COM port on your computer.
Check whether the communication parameters of the COM port are the same as those of the switch's
console port. If not, modify the communication parameters.
The default settings of the switch's console port parameters include 9600 bps, 8 data bits, 1 stop bit,
no parity check, and no flow control (the actual settings may be different).

3 Remove/reinstall or replace the MPU.


If the power supply system and console port work properly, the MPU may be faulty. If your switch has
two MPUs installed, remove/reinstall the problematic MPU. If your switch has only one MPU installed,
replace it with a new one.

10
4 Restart the switch.
If the fault cannot be rectified after you remove/reinstall or replace the MPU, you can restart the switch.
Power off the switch, and then power it on after it is completely shut down (indicators are off and fans
are completely stopped).

5 Seek technical support.


If the preceding methods are ineffective, contact technical support personnel.

11
Risky Operations

Hardware-Related Risky Operations


 Remove or install cables inside a cabinet.
 Remove or install cards without an ESD wrist strap.
 Remove the active MPU.
 Press the RST button of the MPU.
 Press the OFL button on the SFU.

Software-Related Risky Operations


 Run the reboot command to restart the switch.
 Run the reset slot command to reset cards.
 Run the power off slot command to power off cards.
 Run the shutdown command to shut down physical ports.
 Run the format command to format the storage device.
 Run the delete command to delete files from the storage device.
 Run the reset command to reset protocols.
 Change the authentication method or user login password of the console port or VTY user interfaces.

12
References
The following table lists some links, which provides you with more information about device
maintenance.
Information Link
Browse or download the CE12800 http://support.huawei.com/enterprise/en//cloudengine-12800-pid-7542409
product documents.
Search for the CE12800 cases in the http://support.huawei.com/enterprise/servicecenter?lang=en&idAbsPath=7
case library.
919710|21782165|21782236&pid=21782236
Post your questions in the
http://support.huawei.com/huaweiconnect/enterprise/forum-897.html
technical forum.

13

You might also like