Professional Documents
Culture Documents
Troubleshooting
Issue 20
Date 2020-09-25
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their respective
holders.
Notice
The purchased products, services and features are stipulated by the contract made between Huawei and
the customer. All or part of the products, services and features described in this document may not be
within the purchase scope or the usage scope. Unless otherwise specified in the contract, all statements,
information, and recommendations in this document are provided "AS IS" without warranties, guarantees
or representations of any kind, either express or implied.
The information in this document is subject to change without notice. Every effort has been made in the
preparation of this document to ensure accuracy of the contents, but all statements, information, and
recommendations in this document do not constitute a warranty of any kind, express or implied.
Website: https://e.huawei.com
Overview
This document describes how to collect logs, diagnose faults, upgrade software,
perform preventive maintenance and common operations, and collect the
information required to for troubleshoot Huawei E9000, E6000, X6000, X8000,
X6800, rack, heterogeneous, Atlas 800 AI inference (model 3010), and Atlas 800 AI
training (model 9010) servers.
It guides you through the server troubleshooting process.
Intended Audience
This document is intended for:
● Technical support engineers
● Maintenance engineers
Symbol Conventions
The symbols that may be found in this document are defined as follows.
Symbol Description
Symbol Description
Change History
Issue Date Description
Contents
1 Safety Instructions
General Instructions
● Comply with all local laws and regulations when installing the hardware.
These Safety Instructions are only a supplement.
● Observe the instructions that accompany all "DANGER", "WARNING",
"CAUTION", and "NOTE" symbols in this document. Follow them in
conjunction with these Safety Instructions.
● Observe all safety instructions provided on the device labels when installing
hardware. Follow them in conjunction with these Safety Instructions.
● Operations involving high voltages or moving equipment must be performed
by authorized, qualified personnel.
● Take protective measures against radio interference before operating the
device in residential areas.
Personal Safety
● Only personnel certified or authorized by Huawei are allowed to install
equipment or its components.
● Discontinue any dangerous operations and take protective measures. Report
anything that could cause personal injury or equipment damage to a project
supervisor.
● Do not move devices or install cabinets and power cables in hazardous
weather conditions.
● The average weight carried by a person cannot exceed the maximum
acceptable weight of lift (MAWL) allowed by local safety regulations. Before
moving a device, check the maximum device weight and arrange required
personnel.
● Wear clean protective gloves, ESD clothing, a protective hat, and protective
shoes, as shown in Figure 1-1.
● Before contacting devices, wear antistatic clothing and ESD gloves, and take
off electricity-conductive materials such as watches and jewelries, as shown in
Figure 1-2.
● Exercise caution when using tools that could cause personal injury.
● Use a stacker when lifting hardware above shoulder height.
● Avoid any contact with high-voltage cables.
● Ensure that the device is properly grounded before powering it on.
● Do not use a ladder alone.
● Do not look into optical ports without eye protection.
Equipment Safety
● Use dedicated power cables to ensure equipment and personal safety.
● Use power cables only for dedicated devices.
● When moving a device, hold the handles or bottom of the device. Do not hold
the handle of the installed module, such as a power module, fan module,
drive, or mainboard.
● Connect the power cables to separate power distribution units (PDUs) for
active/standby operation.
Transportation Precautions
● The logistics company engaged to transport the equipment must be reliable
and comply with international standards for transporting electronics. Ensure
that the equipment being transported is always kept upright. Take necessary
precautions to prevent collisions, corrosion, package damage, damp
conditions and pollution.
● Transport the equipment in its original packaging.
● If original packages are not used, package heavy, bulky items (such as chassis
and compute nodes) and fragile components (such as PCIe cards and optical
modules) separately.
NOTE
CAUTION
To reduce the risk of personal injury, comply with local regulations with regard to
the maximum weight one person is permitted to carry.
Table 1-1 lists the maximum weight each person is permitted to carry by
standards organization.
2 Troubleshooting Process
3 Preparing for Prepare the manuals and tools required for fault diagnosis
Troubleshooting and rectification.
Step Description
Scenarios
This section describes how to prepare for troubleshooting.
Essential Materials
Table 3-1 lists the materials that you must read before routine maintenance for
Huawei servers.
Software Tools
Table 3-2 lists the software tools required for routine maintenance of Huawei
servers.
FusionServer For details, see Used for new site deployment and
Tools 2.0 the FusionServer delivery, troubleshooting, and firmware
SmartKit Tools 2.0 upgrade.
SmartKit User Download link: FusionServer Tools
Guide.
Smart See the Smart Used to install OSs without a physical
Provisioning Provisioning DVD-ROM drive, configure RAID, upgrade
User Guide. firmware, and perform troubleshooting.
Download link: Smart Provisioning
WinSCP All Huawei Third-party tool used for file transfer for
servers of all iMana 200/iBMC or the management
versions module. You can obtain the tool from the
Internet.
WFTPD All Huawei Third-party tool used for file transfer for
servers of all the Ethernet switching plane of a switch
versions module. You can obtain the tool from the
Internet.
CoreFTPServer/ All Huawei Third-party tools used for file transfer for
mini-sftp-server servers of all the FC switching plane of a switch
versions module. You can obtain the tool from the
Internet.
Hardware Tools
Table 3-3 lists the hardware tools required for routine maintenance of Huawei
servers.
Floating nut hook Used to guide floating nuts to the holes in the
mounting bars of a rack.
Tool Description
ESD wrist strap Used to prevent ESD damage when you touch or
operate devices or components.
Serial cable Used to connect the serial port on the server. The serial
port is usually a DB9 or RJ45 port.
4 Collecting Information
Collect logs immediately upon fault occurrence to obtain the original data.
4.1 Collecting Basic Information
4.2 Collecting OS Logs
4.3 Collecting Hardware Logs
4.4 Collecting Switch Module Logs (for E9000+MM910)
4.5 Collecting Switch Module Logs (for E9000+MM910/MM921)
4.6 Collecting Qlogic HBA Logs
4.7 Collecting Other Logs
OS and Service Example: SLES 11 SP1 64-bit or Oracle 10.2. (Consider the
Software Version fault symptom to determine whether to collect the OS and
service software versions.)
NOTICE
Table 4-2 describes the methods for collecting logs of different OSs.
Window Use SmartKit to collect Windows and Linux (RHEL, SLES, CentOS,
s Ubuntu) logs. For details, see the FusionServer Tools 2.0 SmartKit
User Guide.
Linux
VMware ● If the purple screen of death (PSOD) does not occur, perform the
following steps:
1. Log in to the ESX server console as the root user.
2. Run the vm-support command to collect all VMware logs.
3. After logs are collected, check that a log file in the esxsupport-
YYYY-MM-DD@HH-MM-SS.tgz format is generated in
the /var/tmp directory.
● If the PSOD occurs and the customer retains the site environment,
perform the following steps:
1. Capture a screenshot of the PSOD or take a photo to save the
displayed information.
2. Press Alt+F12 to switch to forcible memory information output
mode, and press Alt+PageUp/Alt+PageDown to capture
screenshots and photos. Ensure that screenshots and photos of
the last several screens are captured after the PSOD occurs.
3. Hot-restart the system, and run the vm-support command to
collect all VMware logs.
4. After logs are collected, check that a log file in the esxsupport-
YYYY-MM-DD@HH-MM-SS.tgz format is generated in
the /var/tmp directory.
● If the PSOD occurs and the customer hot-restarts the system, run
vm-support to collect all of the VMware logs and check that a log
file in the esxsupport-YYYY-MM-DD@HH-MM-SS.tgz format is
generated in the /var/tmp directory.
FreeBSD Log in to the OS CLI over SSH and copy all files in /var/log/.
Copy the messages file and all files prefixed with messages (for
example, messages.0) in /var/log/ before copying other files.
Solaris Log in to the OS CLI over SSH and copy all files in the /var/log/
directory and /var/adm/ directory.
Copy the syslog file and all files prefixed with syslog (for example,
syslog.0) in /var/log/, and copy the messages file and files prefixed
with messages (for example, messages.0) in /var/adm/ before
copying other files.
NOTICE
You can use one of the following methods to collect hardware logs:
● Use SmartKit to collect server hardware information in batches. For details
about the supported servers and operations, see section "Using SmartKit >
Collecting Server Logs" in the FusionServer Tools 2.0 SmartKit User Guide.
● Use iBMC to collect hardware logs of a single server. For details, see 8.3 Using
iBMC to Collect Information in Batches.
● Use iMana 200/iBMC to collect hardware logs. For details, see the 8.2 Using
iMana 200 to Collect Information in Batches or 8.3 Using iBMC to Collect
Information in Batches.
● Use SmartKit to collect hardware logs and Windows/Linux logs. For details,
see the FusionServer Tools 2.0 SmartKit User Guide.
Procedure
Step 1 Connect the Ethernet port of the PC to the management network ports of the
active and standby MM910 modules over the LAN. Figure 4-1 shows the network
connection.
NOTICE
● The MGMT port on the MM910 panel is the management network port.
● If the active MM910 MGMT port has been connected to the network by using a
network cable and the client needs to be directly connected to the MM910, do
not directly disconnect the network cable from the active MM910 MGMT port.
Otherwise, an active/standby MM910 switchover will be triggered, which may
cause network interruption. You are advised to connect the client to the active
MM910 STACK port in the chassis by using a network cable. If the active
MM910 STACK port has been connected to the MGMT port in another chassis,
use an idle active MM910 STACK port in another chassis.
NOTE
Step 2 Use an SSH tool and the MM910 floating IP address to connect to the MM910 CLI.
For details about how to use PuTTY for SSH login, see 8.15 Logging In to a
Server Over a Network Port by Using PuTTY.
NOTE
Step 3 to Step 5 configure the IP address and route for the management network port of
the Ethernet switching plane. If the IP address and routing information of the management
network port have been configured, skip Step 3 to Step 5.
Step 3 (Optional) Run the following command to query the IP address of the
management network port of the Ethernet switching plane:
● N indicates the slot number of the switch module. The value range is 1 to 4,
mapping to logical slot numbers 1E, 2X, 3X, and 4E from left to right on the
panel respectively.
● M: indicates the ID of the switching plane. The value for the Ethernet
switching plane is 2.
● If yes, go to Step 4.
● If no, go to Step 5.
Step 4 (Optional) Run the following command to set an IP address for the management
network port of the Ethernet switching plane:
● N indicates the slot number of the switch module. The value range is 1 to 4,
mapping to logical slot numbers 1E, 2X, 3X, and 4E from left to right on the
panel respectively.
● M: indicates the ID of the switching plane. The value for the Ethernet
switching plane is 2.
● ipaddress: indicates the IP address of the management network port.
● maskaddress: indicates the subnet mask of the management network port.
Step 5 (Optional) Configure the gateway for the switching plane by running the
following command so that the switching plane can communicate with the PC:
NOTE
For stacked switching planes, configure the gateway only for the master switching plane.
● N indicates the slot number of the switch module. The value range is 1 to 4,
mapping to logical slot numbers 1E, 2X, 3X, and 4E from left to right on the
panel respectively.
● M: indicates the ID of the switching plane. The value for the Ethernet
switching plane is 2.
● targetvalue: indicates the target network segment IP address of the switching
plane.
● maskvalue: indicates the subnet mask of the switching plane.
● gatewayvalue: indicates the gateway IP address of the switching plane.
----End
Prerequisites
● The switch modules have been powered on.
● For logging in to the Ethernet switching plane over SSH, the default username
is root and the default password is Huawei12#$.
● By default, the MM910 username is root and the password is Huawei12#$.
● You are familiar with the parameters required for this operation.
Procedure
Step 1 Connect the PC to the Ethernet switching plane.
For details, see 4.4.1.1 Connecting a PC to the Ethernet Switching Plane.
Step 2 Log in to the CLI of the Ethernet switching plane by using the SOL function of the
MM910.
For details about SOL login, see 8.17 Logging In to a Compute Node,
Passthrough Module, or Switch Module by Using the SOL Function of the
MM910.
Step 3 Run the following command to query the version of the Ethernet switching plane:
display version
● Information similar to the following is displayed:
BoardName : CX910
CPLD Version : 003
PCB Version : VER.A
Bootrom Version : 008
Creation Time : Sep 17 2012, 09:53:25
Backup Bootrom Version : 008
----End
4.4.2.3 Using the V5 Switch Module CLI to Collect Ethernet Switching Plane
Information
Operation Scenario
Use the E9000 server switch module CLI of the V5 platform to collect Ethernet
switching plane information, including:
● Logs
● Debugging information
● Trap information
For details about how to query the Ethernet switching plane version, see 4.4.1.2
Querying the Software Version of the Ethernet Switching Plane.
Prerequisites
Conditions
● WFTPD 4.2.4.610 or later has been installed on the PC.
● You have logged in to the Ethernet switching plane CLI. For details, see 8.15
Logging In to a Server Over a Network Port by Using PuTTY or 8.17
Logging In to a Compute Node, Passthrough Module, or Switch Module
by Using the SOL Function of the MM910.
Data
Table 4-5 describes the required parameters.
The default username of the switching plane is root, and the default password is
Huawei12#$.
NOTE
You can query and set IP addresses of all modules. For details, see 8.11 Logging In to the
MM910 WebUI.
● For the MM910 versions earlier than (U54) 2.20, choose System Management >
Network Management > xx > IP addresses.
● For the MM910 (U54) 2.20 or later, choose Chassis Settings > Network Settings > xx.
Software Tools
wftpd32.exe: used to transfer files between different platforms, for example, from
a PC to a switch module. This tool is third-party software. You need to prepare it
by yourself.
Procedure
Step 1 Configure the FTP server.
For detailed about the configuration operations, see 8.20 Configuring an FTP
Server.
Step 2 Configure the IP address of the management network port.
1. After logging in to the switch module by using a serial port or the SOL
function, run the following commands on the switching plane CLI to query
and set the IP address of the management network port so that the switch
module can properly communicate with the FTP server:
NOTE
Skip this step if you log in to the switch module by using a network port.
<Fabric>system-view
[Fabric]interface MEth 0/0/1
[Fabric-MEth0/0/1]ip address 192.168.100.123 24
[Fabric-MEth0/0/1]display this
#
interface MEth0/0/1
[Fabric-MEth0/0/1]quit
[Fabric]quit
2. If the configured IP address and the FTP server address are not on the same
network segment, run the following command on the HMM CLI to configure
a gateway for the switching plane:
smmset -l swiN:fruM -d route -v targetvalue maskvalue gatewayvalue
The parameters are described as follows:
– N indicates the slot number of the switch module. The value range is 1 to
4, mapping to logical slot numbers 1E, 2X, 3X, and 4E from left to right
on the panel respectively.
– M: indicates the ID of the switching plane. The value for the Ethernet
switching plane is 2.
– targetvalue: indicates the target network segment IP address of the
switching plane.
– maskvalue: indicates the subnet mask of the switching plane.
– gatewayvalue: indicates the gateway IP address of the switching plane.
For example, if the IP address is 192.168.112.1, run the following
command:
smmset -l swi3:fru2 -d route -v 0.0.0.0 0.0.0.0 192.168.112.1
Step 3 Obtain the log information.
1. Run the following command to collect logs.
<Fabric>display diagnostic-information diag-info.txt
Now saving the diagnostic information to the device
<Fabric>save logfile
Save log file successfully.
<Fabric>dir flashvx:/logfile/
Directory of flashvx:/logfile/
Idx Attr Size(Byte) Date Time(LMT) FileName
0 -rw- 2,939,200 Apr 01 2000 23:55:02 log.dblg
1 -rw- 95,988 Jan 07 2014 19:16:00 2014-01-07.19-13-54.log.zip
2 -rw- 172,081 Jan 07 2014 21:35:14 2014-01-07.21-31-56.log.zip
3 -rw- 2,716,484 Jan 23 2014 01:35:24 log.log
4 -rw- 4,589,648 Jan 17 2014 12:30:48 2000-04-01.23-55-08.dblg
3. Enter the IP address, username, and password to log in to the FTP server. In
the following example, the FTP server address is 200.1.1.126 and the
username is root.
<Fabric>ftp 200.1.1.126
Trying 200.1.1.126 ...
Press CTRL+K to abort
Connected to 200.1.1.126.
220 WFTPD 2.0 service (by Texas Imperial Software) ready for new user
User(200.1.1.126 none):root
331 Give me your password, please
Enter password:
230 Logged in successfull
[ftp]
NOTE
The IP address of the FTP server is configured by the user and is on the same network
segment as the management IP address of the switch module.
4. Convert the log file into a binary file for transfer.
[ftp]binary
5. Obtain the log file.
[ftp]put flash:/diag-info.txt
200 PORT command okay
150 "F:\diag-info.txt" file ready to receive in IMAGE / Binary mode
226 Transfer finished successfully.
FTP: 148848 byte(s) sent in 0.280 second(s) 531.60Kbyte(s)/sec.
[ftp]lcd flashVX:/logfile
The current local directory is flashVX:/logfile.
[ftp]mput *
Error: The file name . is invalid.
Error: The file name .. is invalid.
200 PORT command okay
150 "F:\log.dblg" file ready to receive in IMAGE / Binary mode
226 Transfer finished successfully.
FTP: 1513938 byte(s) sent in 1.160 second(s) 1305.11Kbyte(s)/sec.
200 PORT command okay
150 "F:\log.log" file ready to receive in IMAGE / Binary mode
226 Transfer finished successfully.
FTP: 2689148 byte(s) sent in 1.940 second(s) 1386.15Kbyte(s)/sec.
[ftp]quit
----End
4.4.2.4 Using the V8 Switch Module CLI to Collect Ethernet Switching Plane
Information
Operation Scenario
Use the CLI of an E9000 switch module to collect the following information about
the V8 platform:
● Logs
● Debugging information
● Trap information
For details about how to query the Ethernet switching plane version, see 4.4.1.2
Querying the Software Version of the Ethernet Switching Plane.
Prerequisites
Conditions
● WFTPD 4.2.4.610 or later has been installed on the PC.
● You have logged in to the Ethernet switching plane CLI. For details, see 8.15
Logging In to a Server Over a Network Port by Using PuTTY or 8.17
Logging In to a Compute Node, Passthrough Module, or Switch Module
by Using the SOL Function of the MM910.
Data
Table 4-6 describes the required parameters.
The default username of the switching plane is root, and the default password is
Huawei12#$.
NOTE
You can query and set IP addresses of all modules. For details, see 8.11 Logging In to the
MM910 WebUI.
● For the MM910 versions earlier than (U54) 2.20, choose System Management >
Network Management > xx > IP addresses.
● For the MM910 (U54) 2.20 or later, choose Chassis Settings > Network Settings > xx.
Software Tools
wftpd32.exe: used to transfer files between different platforms, for example, from
a PC to a switch module. This tool is third-party software. You need to prepare it
by yourself.
Procedure
Step 1 Configure the FTP server.
For details, see 8.20 Configuring an FTP Server.
Step 2 After logging in through the serial port or SOL function, run the following
commands on the Ethernet switching plane CLI to check whether the
management network port IP address has been configured.
NOTE
Skip this step if you log in to the switch module by using a network port.
<HUAWEI>system-view
[~HUAWEI]interface MEth 0/0/0
[~HUAWEI-MEth0/0/0]display this
● If the command output is as follows with no IP address displayed, go to Step
3
#
interface MEth0/0/0
#
return
● If the command output contains an IP address and gateway address, go to
Step 4.
#
interface MEth0/0/0
ip address 192.168.100.123 255.255.255.0
#
return
Step 3 (Optional) After logging in to the switch module by using a serial port or the SOL
function, run the following commands on the Ethernet switching plane CLI to
query and set the IP address of the management network port so that the switch
module can properly communicate with the FTP server:
NOTE
Skip this step if you log in to the switch module by using a network port.
<HUAWEI>system-view
[~HUAWEI]interface MEth 0/0/0
[~HUAWEI-MEth0/0/0]ip address 192.168.100.123 24
[~HUAWEI-MEth0/0/0]commit
[~HUAWEI-MEth0/0/0]display this
#
interface MEth0/0/0
ip address 192.168.100.123 255.255.255.0
#
return
[~HUAWEI-MEth0/0/0]quit
[~HUAWEI]quit
Step 4 Obtain the log information.
1. View the log file system.
<HUAWEI>system-view
Enter system view, return user view with return command.
[~HUAWEI]diagnose
Warning: Enter diagnose view, return user view by pressing Ctrl+Z.
Info: The diagnose view is used to debug system hardware and software. Misuse of some commands
in this view will affect system performance. Therefore, use these commands with the guidance of
Huawei engineers.
[~HUAWEI-diagnose]collect diagnostic information
Info: Succeeded in collecting diagnostic information in slot 3.
[~HUAWEI-diagnose]display diagnostic-information diag-info.txt
--------------------------------------------------------------------------------
2 Standby dcd2-fcf8-5600 100 CX910 2X/300
--------------------------------------------------------------------------------
Role specifies the switch module role. The value can be Master, Standby, or
Slave, indicating the primary switch module, standby switch module, and
slave switch module respectively. Bay in Bay/Chassis indicates the switch
module slot number.
3. Obtain the log file.
<HUAWEI>ftp 192.168.100.122
Trying 192.168.100.122 ...
Press CTRL+K to abort
Connected to 192.168.100.122.
220 WFTPD 2.0 service (by Texas Imperial Software) ready for new user
User(192.168.100.122:(none)):huawei
331 Give me your password, please
Enter password:
230 Logged in successfully
[ftp]binary
200 Type is Image (Binary)
# On the FTP server, create a log receiving directory for the master switch
module in the stack. In this example, the number 3 in swi3 indicates the stack
ID (same as the slot number) of the master switch module. (If the switch
modules are not stacked, create a log receiving directory for the current
switch module. The number 3 in swi3 indicates the slot number of the current
switch module.)
[ftp]mkdir swi3
[ftp]cd swi3
[ftp]put flash:/diag-info.txt
200 Port command successful.
150 Opening data connection for diag-info.txt.
/ 100% [***********]
226 File received ok
[ftp]mput flash:/logfile/*
200 Port command successful.
150 Opening data connection for diag.log.
/ 100% [***********]
226 File received ok
[ftp]cd ..
# On the FTP server, create a log receiving directory for the standby or slave
switch module in the stack. In this example, the number 2 in swi2 indicates
the stack ID (same as the slot number) of the master switch module. (If the
switch modules are not stacked, log in to each switch module and repeat the
preceding log collection procedure.)
[ftp]mkdir swi2
[ftp]cd swi2
[ftp]mput 2#flash:/logfile/*
[ftp]cd ..
[ftp]quit
221 Windows FTP Server (WFTPD, by Texas Imperial Software) says goodbye
<HUAWEI>
NOTE
– When you use the mput command in the FTP CLI, 2#flash:/ indicates the flash
root directory of the switch module with the stack ID 2. You can obtain the stack
ID and role information by using the display stack command.
– The flash root directory of the master switch module in a stack is flash:/.
– If multiple switch modules are displayed after running the display stack
command, obtain the log file of each switch module in the logfile directory.
4. View the log file in the FTP directory on the PC.
----End
Operation Scenario
Use Web Tools page of a switch module (MX510) to collect information about the
FC switching plane.
This section applies to the CX311, CX911, and CX915.
Prerequisites
Conditions
● The connection between the management IP address of the FC switch module
and the server IP address is normal.
● You have logged in to the Ethernet switching plane Web Tools page. For
details, see 8.10 Logging In to the Web Tools of the MX510.
Data
IP address 192.168.1.100
For exporting the dump_support log file, the username is images, and the default
password is Huawei12#$.
Procedure
Step 1 On Web Tools, choose Switch > Download Support File, as shown in Figure 4-2.
Step 2 Select the directory for storing the log file, and click Start.
The log file download starts. If "Support file saved" is displayed in the Status area,
the log file has been successfully exported, See Figure 4-3.
----End
Operation Scenario
Use the CLI of a switch module (MX510) to collect FC switching plane information.
This section applies to the CX311, CX911, and CX915.
Prerequisites
Conditions
● The PC has been connected to the management network port of the server by
using a network cable.
● The mini-sftp-server.exe software has been obtained.
NOTE
If the MX510 firmware version is earlier than 9.8.2.6.0, you can use the FTP tool WFTPD to
collect information. For details, see 8.20 Configuring an FTP Server.
Data
IP address 192.168.1.100
The default username of the switching plane is admin, and the default password
is Huawei12#$.
Software Tools
mini-sftp-server.exe: used to transfer files between different platforms, for
example, from a switch module to a PC. This tool is third-party software. You need
to prepare it by yourself.
Procedure
Step 1 Configure an SFTP server.
For details, see 8.21 Using SFTP to Transfer Files.
Step 2 Log in to the MX510.
For details about how to access the FC switching plane CLI, see 8.15 Logging In
to a Server Over a Network Port by Using PuTTY or 8.17 Logging In to a
– If you press Enter when the CLI prompts you to specify the directory for
storing the dump file, the dump file is automatically downloaded to the
default directory on the SFTP server.
----End
Operation Scenario
Use the CLI of a switch module (MX210/MX220) to collect FC switching plane
information.
This section applies to the CX210, CX220, CX912, and CX916. The FC switching
planes of the CX210 and CX912 are the MX210, and those of the CX220 and
CX916 are the MX220.
Prerequisites
Conditions
● The PC has been connected to the management network port of the server by
using a network cable.
● The mini-sftp-server.exe software has been obtained.
Data
IP address 10.77.77.77
The default username of the switching plane is admin, and the default password
is Huawei12#$.
Software Tools
Procedure
Step 1 Configure an SFTP server.
For details about how to access the FC switching plane CLI, see 8.15 Logging In
to a Server Over a Network Port by Using PuTTY or 8.17 Logging In to a
Compute Node, Passthrough Module, or Switch Module by Using the SOL
Function of the MM910.
Step 3 Run the ipaddrset command to set the management IP address and then run the
ipaddrshow command to check whether the IP address is correct.
● IPv4
FC_SW:admin> ipaddrset
Ethernet IP Address [10.77.77.77]:10.32.53.47
Ethernet Subnetmask [255.255.255.0]:255.255.240.0
Fibre Channel IP Addresss [none]:
Fibre Channel Subnetmask [none]:
Gateway IP Address [0.0.0.0]:10.32.48.1
DHCP [Off]:
IP address is being changed...Done.
FC_SW:admin> ipaddrshow
FC_SW:admin> ipaddrshow
Ethernet IP Address: 10.32.53.47
Ethernet Subnetmask: 255.255.240.0
Fibre Channel IP Addresss: none
Fibre Channel Subnetmask: none
Gateway IP Address 10.32.48.1
DHCP: Off
● IPv6
FC_SW:admin> ipaddrset -ipv6 --add fd00:60:69bc:82:205:33ff:fed7:f6fe/64
IP address is being changed...Done.
FC_SW:admin> ipaddrshow
SWITCH
Ethernet IP Address: 10.20.24.55
Ethernet Subnetmask: 255.255.240.0
Gateway IP Address: 10.20.16.1
DHCP: Off
IPv6 Autoconfiguration Enabled: No
Local IPv6 Addresses:
static fd00:60:69bc:82:205:33ff:fed7:f6fe/64 preferred
IPv6 Gateways: fe80:21b:3dff:fe0b:7800 fe80:21b:edff:fe0b:2400
NOTE
The current environment uses IPv4 addresses. You do not need to set the IPv6 address.
2. Set the log collection parameters as prompted and start log collection.
– Host IP or Host Name: specifies the address for storing logs on the
target device (the SFTP server IP address).
– User Name: specifies the username for logging in to the target device
(the SFTP server username).
– Password: specifies the password for logging in to the target device (the
SFTP server password).
– Protocol: specifies the transfer protocol. Set this parameter to sftp.
– Remote Directory: specifies the directory for storing log files on the SFTP
server. Create the /support directory in the home directory of the SFTP
server, and set Remote Directory to /support.
(Optional) When "Do you want to continue with CRA (Y/N)" is displayed,
enter n to start collecting logs.
----End
Using FusionDirector
● FusionDirector has been installed on the MM920 and can be used to collect
chassis information.
● After FusionDirector manages the chassis of the MM921, you can use
FusionDirector to collect information.
Step 6 Select the switch modules whose logs you want to export and click OK.
After the task is complete, decompress the downloaded package to obtain switch
module logs.
----End
NOTICE
NOTICE
V2 rack servers See the Huawei Rack Server Alarm Handling (iMana
200).
V2/V3/V5 rack See the FusionServer Pro Rack Server iBMC Alarm
servers Handling.
X6000 See the FusionServer Pro X6000 Server iBMC (Earlier
than V250) Alarm Handling or X6000 Server Alarm
Handling (iMana 200).
X8000 See the X8000 Server V100R001 Alarm Reference.
Atlas 800 AI See the Atlas 800 AI Training Server iBMC (V3.01.00.00
training server or Later) Alarm Handling (Model 9010).
(model 9010)
2488H V5, 5885H V5, 1288X V5, 2288X V5, 2288 C V5, Atlas 800 AI inference
server (model 3010), Atlas 800 AI training server (model 9010). Table 5-3
describes the status and meanings of the fault diagnosis LED. For details about the
position of the fault diagnosis LED on each server, see the server user guide.
Figure 5-1 shows the position of the fault diagnosis LED on an RH1288 V3 server.
For details about how to rectify the fault, see the corresponding alarm handling
manual.
3. On the Documentation tab page, choose Operation & Maintenance > User
Guide.
4. View the required user guide.
Process
Figure 5-2 shows the process for checking the indicators.
Step 2 View iMana 200 or iBMC system event logs (SELs) to locate faults.
Step 3 Check the status indicators of the components.
● Table 5-5, Table 5-6, Table 5-7, Table 5-8, Table 5-9, and Table 5-10
describe the meanings of the SAS/SATA drive status indicator, NVMe drive
status indicator, M.2 FRU indicator, PSU status indicator, network port
indicator, and FlexIO card status indicator, and the corresponding handling
procedures.
● Table 5-11 describes the meanings of the indicators for each module of the
RH5885 V2, RH5885 V3, and RH5885H V3, and the corresponding handling
procedures.
● Table 5-12 describes the meanings of the indicators for each module of the
RH8100 and X6800, and the corresponding handling procedures.
● Table 5-13 describes the meanings of the aggregation network port
indicators on the X6000, X6800, and X6800 V5, and the corresponding
handling procedures.
● Table 5-14, Table 5-15, and Table 5-16 describe the meanings of the MM910
management module indicator, E9000 fan module indicator, and E9000 switch
module indicator, and the corresponding handling procedures.
● Table 5-17 and Table 5-18 describe the meanings of the fan module
indicator and network port indicator on the Atlas 800 training server (model
9010), and the corresponding handling procedures.
Steady green or Steady yellow The NVMe drive is Reseat the NVMe
off faulty. drive. If the
problem persists,
replace the NVMe
drive.
Blinking Data is
yellow being
transmitted
on the
network.
Blinking Data is
green being
transmitted
on the
network.
Blinking Data is
yellow being
transmitted
on the
network.
Blinking Data is
green being
transmitted
on the
network.
----End
Indicators Available Only on the RH5885 V2, RH5885 V3, and RH5885H V3
A switch module
that cannot be
stacked is
operating
properly.
A switch module
that cannot be
stacked is being
powered on.
Indicators Available Only on the Atlas 800 AI Training Server (Model 9010)
1288H V5, 1288X V5, 2288 One CPU in the CPU1 One PSU in any slot
V5, 2288C V5, 2288H V5, socket
2288X V5, 5288 V5, 5288X
V5 One DIMM in the
DIMM000(A) slot
2488 V5, 2488H V5, 5885H Two CPUs in the CPU1 None
V5 and CPU2 sockets
NOTE
● If a fault can be located using logs or tools, see "Handling Procedure". If a fault needs
to be rectified quickly onsite, see "Quick Recovery Method".
● For more fault symptoms and solutions, see the Computing Case Library. The
Computing Case Library is available only to Huawei engineers and partners.
A PSU is 1. Check the PSU indicator and 1. Check whether the current
faulty record any alarms on the configuration has sufficient
(the PSU iMana 200 or iBMC WebUI. For power supplies.
has no details, see 5.5 Checking ● If yes, services are not
power Indicators to Locate Faults. affected.
output NOTE
and the ● If no, contact Huawei
● For E9000 servers, record technical support.
health alarms on the MM910 WebUI.
indicator 2. Replace the faulty PSU with
2. Check whether an "AC lost"
is a spare PSU. Do not install
alarm is generated.
blinking the faulty PSU into a server
red). ● If yes, check that the power again.
cable is connected properly
and that the PDU is
supplying power properly.
● If no, go to 3.
3. Replace the PSU with a spare
PSU and check whether the
fault is rectified.
● If yes, no further action is
required.
● If no, go to 4.
4. Replace the PSU backplane or
replace the mainboard if no
PSU backplane is configured.
Check whether the fault is
rectified.
● If yes, no further action is
required.
● If no, contact Huawei
technical support.
The rack 1. Check whether the external Follow the handling procedure
server/ power supply to the rack server to replace any faulty modules.
Atlas 800 is normal.
AI ● If yes, go to 2.
inference
server ● If no, resolve this issue.
(model 2. Replace the PSU with a normal
3010)/ one and check whether the
Atlas 800 fault is rectified.
AI ● If yes, no further action is
training required.
server
(model ● If no, go to 3.
9010) is 3. Replace the mainboard and
not PSU backplane and check
powered whether the fault is rectified.
on (all ● If yes, no further action is
indicator required.
s are
● If no, contact Huawei
off).
technical support.
● If a fault can be located using logs or tools, see "Handling Procedure". If a fault
needs to be rectified quickly onsite, see "Quick Recovery Method".
● For more fault symptoms and solutions, see the Computing Case Library. The
Computing Case Library is available only to Huawei engineers and partners.
2. If the KVM connection is abnormal, you are advised to use Independent
Remote Console for login.
● If a fault can be located using logs or tools, see "Handling Procedure". If a fault needs
to be rectified quickly onsite, see "Quick Recovery Method".
● For more fault symptoms and solutions, see the Computing Case Library. The
Computing Case Library is available only to Huawei engineers and partners.
The server 1. View serial port logs to For a rack server/Atlas 800 AI
fails to determine whether the iMana inference server (model 3010)/
enter the 200 or iBMC has been Atlas 800 AI training server
standby repeatedly reset. (model 9010), perform the
mode If the iMana 200 or iBMC has following operations:
after it been repeatedly reset, the logs 1. Power off the server,
powers repeatedly record the remove and reinstall the
on. (The following information: power cables, power on the
power ### JFFS2 load complete: 1107083
server, and check whether
bytes loaded to 0x8b000000
indicator ## Booting kernel from Legacy Image the iMana 200 or iBMC is
is blinking at 8a000000 ... functioning correctly.
yellow for Image Name: linux-2.6.34
over 5 Image Type: ARM Linux Kernel ● If yes, upgrade the iMana
Image (uncompressed)
minutes.) Data Size: 1511292 Bytes = 1.4
200 or iBMC by using
MiB software of its current
Load Address: 86008000 version or a later version.
Entry Point: 86008000
Verifying Checksum ... OK ● If no, check the iMana
## Loading init Ramdisk from Legacy 200 or iBMC version. If
Image at 8b000000 ...
Image Name: Ramdisk Image the version is 1.91 or
Image Type: ARM Linux RAMDisk later, go to 2; otherwise,
Image (uncompressed) go to 3.
Data Size: 1107019 Bytes = 1.1
MiB 2. Keep the power cables
Load Address: 00000000 removed and add a jumper
Entry Point: 00000000
Verifying Checksum ... OK cap to the Clear_BMC_PW
Loading Kernel Image ... OK pin on the mainboard to
OK attempt to restore the
Starting kernel ... default settings of the
iMana 200 or iBMC. Then
NOTE
reconnect power cables.
● The CH140 and CH140 V3
compute nodes of the E9000 3. Replace the mainboard or
do not provide any serial BMC board.
ports. Directly ping the IP
address of the iMana 200 or
For an E9000 server, perform
iBMC. If the ping tests the following operations:
occasionally or always fail, use 1. Remove and reinstall the
the quick recovery method. If
the problem persists, contact
compute node and check
Huawei technical support. whether the iMana 200 or
iBMC is functioning
● During the iMana 200 or
iBMC startup process, the
correctly.
serial port on a server is used ● If yes, upgrade the iMana
by default. After the startup is 200 or iBMC by using
complete, the serial port is
switched for the system serial
software of its current
port. version or a later version.
● During the iBMC startup ● If no, check the iMana
process, the serial port on a 200 or iBMC version. If
server is used by default. After the version is 1.91 or
the startup is complete, the later, go to 2; otherwise,
serial port is switched for the
system serial port.
go to 3.
NOTE
● If a fault can be located using logs or tools, see "Handling Procedure". If a fault needs
to be rectified quickly onsite, see "Quick Recovery Method".
● For more fault symptoms and solutions, see the Computing Case Library. The
Computing Case Library is available only to Huawei engineers and partners.
NOTE
● If a fault can be located using logs or tools, see "Handling Procedure". If a fault needs
to be rectified quickly onsite, see "Quick Recovery Method".
● For more fault symptoms and solutions, see the Computing Case Library. The
Computing Case Library is available only to Huawei engineers and partners.
A RAID 1. Power off the server, swap the 1. If the redundant RAID array
controller drive that cannot be identified fails or no RAID array is
card fails with a normal drive, and configured, the related drive
to identify power on the server to check partitions are unavailable.
one or whether the drive is faulty. 2. Move the unidentified drives
more ● If the fault is caused by the or all drives in the RAID
drives. drive, replace the drive. array to a standby server.
● If the fault is caused by the Ensure that you retain their
drive slot, check whether order during this process
SAS cables are connected and attempt to back up
properly to all SAS ports on data.
the drive backplane. For 3. Follow the handling
details, see the server user procedure to replace any
guide. faulty modules.
● If the fault persists, go to 2.
2. Replace the RAID controller
card first, the SAS cables
second, and the drive
backplane third.
Note: If a fault occurs on the RH2288A V2 server, check whether the cable
connecting the mainboard to the power adapter board is connected properly.
Figure 5-3 shows the cable connection.
NOTE
● If a fault can be located using logs or tools, see "Handling Procedure". If a fault needs
to be rectified quickly onsite, see "Quick Recovery Method".
● For more fault symptoms and solutions, see the Computing Case Library. The
Computing Case Library is available only to Huawei engineers and partners.
A network 1. Ensure that the NIC type, NIC 1. If a visible NIC port
port is driver, OS, BIOS version, and becomes invisible when the
invisible. iMana 200 or iBMC version on server is running, and
the server or compute node services can be interrupted,
are compatible. power the server off and on.
● If you use a system that is If the fault persists, go to 2.
not listed in Computing 2. Insert the NIC into another
Product Compatibility PCIe slot and check whether
Checker, contact the OS the fault is rectified.
compatibility team. ● If the NIC is causing the
NOTE fault, replace the NIC.
You are advised to use the system
listed in Computing Product ● If the PCIe slot is causing
Compatibility Checker. the fault, replace the
mainboard.
● If the NIC firmware and
driver versions do not
match, upgrade them to
the matching versions.
2. To check whether the PCI
device of the NIC is visible, run
the lspci | grep -i eth*
command in Linux (or
equivalent in other operating
systems) and observe the
response.
● If yes, go to 4.
● If no, go to 3.
3. If the PCI device is invisible,
perform the following steps:
a. Check the logical topology
of the NIC. If the NIC PCI
bus does not have a CPU,
screw-in PCI cards
connected to the bus are
invisible.
b. Power the iMana 200 or
iBMC off and then on.
Check whether the fault
persists.
c. Insert the NIC you suspect
to be faulty into another
slot, and a normal NIC into
the slot you suspect to be
faulty. Then check which of
these cause the fault.
4. If the PCI device is visible but
its network port is invisible,
NOTE
For more fault symptoms and solutions, see the Computing Case Library. The Computing
Case Library is available only to Huawei engineers and partners.
The storage device 1. Connect to the switch and run the brocade:
fails to identify the switchshow command to query port connection
host World Wide status.
Port Name 2. If the switch fails to obtain the host WWPN, the host
(WWPN). bus adapter (HBA) cannot register with the switch. In
this case, do as follows:
a. Check that the HBA and the processor connected
to the PCIe bus are installed properly.
b. (Optional) Check the mapping between the HBAs
and switch modules for E9000 and E6000 servers.
c. Check the FC link between the HBA and the switch
by checking the optical module power, optical
fiber, and optical module compatibility. If E9000
servers are used, check the HBA work mode.
d. Ensure that the lpfc driver and firmware matching
the E9000 are installed.
e. If multiple switches are connected, check whether
the switch connection mode (AG or TR) is correct.
f. Collect the OS message logs and check lpfc driver
information for faults.
g. Collect log information of the switches.
3. If the HBA is successfully registered with the switch,
the switch obtains the host WWPN, but the storage
cannot identify host WWPNs, rectify the fault as
follows:
a. Check the FC links (optical cables and modules)
between the switch and the storage device.
b. Check whether the HBA and the storage ports are
in the same zone.
c. Check whether the zone configurations are the
same for switches from the same vendor.
d. Collect the OS message logs and check lpfc driver
information for faults.
e. Collect the log information of switches.
The storage device 1. Check whether the lpfc driver and firmware matching
has identified the the E9000 have been installed.
HBA WWPN, but 2. Collect the OS message logs and check lpfc driver
LUNs cannot be information for faults.
mapped to the host.
3. Collect log information of the switches.
4. If no faults are identified, faults may exist on the
storage device or OS SCSI application layer. Contact
the OS or storage device vendor.
Some multipath 1. Ensure that the installed lpfc driver and firmware
links of LUNs are match the E9000.
down. 2. Check for error codes on FC links between the HBA
and the storage device.
3. Collect the OS message log and check lpfc and
multipath driver information for faults.
4. Collect log information of the switches.
5. Contact the OS multipath driver vendor or storage
device vendor.
Poor data read/write 1. Check whether the installed lpfc driver and firmware
performance of match the E9000.
LUNs 2. Check for error codes on FC links between the HBA
and the storage device.
3. Run the iostat command on the host to query the I/O
delay and concurrent I/O operations.
4. Collect the OS message log and check the lpfc driver
information and the I/O queue depth configured for
the HAB driver.
5. Perform drive performance tests (read and write 100
GB and 100 MB files).
6. Contact storage analysis engineers.
Storage services are 1. Migrate all services, and safely power off the server.
affected but HBA Next, remove and reinstall the compute node, and
links are normal. power on the server. Then, check whether the fault is
rectified.
● If yes, no further action is required.
● If no, contact the storage vendor for quick fault
recovery.
2. Before contacting Huawei technical support, it is
recommended that you migrate services and collect
switch module logs, OS logs, LLD networking
information, and device time differences.
NOTE
For more fault symptoms and solutions, see the Computing Case Library. The Computing
Case Library is available only to Huawei engineers and partners.
Incorrect packets are generated Run the display interface command and
(running the display interface check CRC and Symbols.
command shows that the value 1. If the values of CRC and Symbols are not
of Total Error in the Input zero, perform the following operations:
area is not zero and keeps
increasing). ● Ensure that the optical cables are
connected properly to the faulty switch
module and the device it is directly
connected to.
● Check whether any optical cables are
damaged.
● Check whether the optical modules of
the faulty switch module and the device
it is directly connected to are working
properly.
● If there is a transmission device between
the switch module and its connected
device, check the transmission device
gateway for alarms.
2. If the values of CRC and Symbols are zero,
run the reboot command to restart the
switch module.
5.6.9 OS Faults
OS Installation Faults
Diagnose and rectify faults related to OS installation depending on the symptoms.
NOTE
For more fault symptoms and solutions, see the Computing Case Library. The Computing
Case Library is available only to Huawei engineers and partners.
OS Faults
If you have confirmed that faults are not caused by other factors, diagnose them
as follows:
Table 6-1 lists the software and firmware to be upgraded and reference
documents of servers.
Atlas 800 AI iBMC, BIOS, LCD, CPLD, and card 2. Choose a server model
inference firmware to access the product
server page.
(model 3. Click the Software
3010)/Atlas Download tab.
800 AI
training 4. Select the latest patch
server version.
(model 5. Download the required
9010) upgrade package.
7 Preventive Maintenance
NOTICE
Take protective measures to prevent ESD damage and any other damage to
servers during preventive maintenance.
7.1.1 Precautions
Familiarize yourself with the security icons listed in Table 7-1 before preventive
maintenance to reduce the chance of injury to yourself or damage to the
equipment. These security icons will be on some server components.
Icon Description
Indicates that this device can cause personal injury or can fail to
operate properly if it is not externally grounded. Each end of a
ground cable should be connected to a different device, and the
devices must be connected to ground points.
Indicates that this device can cause personal injury or can fail to
operate properly if it is not internally grounded. Each end of a
ground cable should be connected to different device
components, and the device must be connected to a ground
point.
To prevent any damage to the cables, take the following precautions before
inspecting the cable layout:
● Check that power cables meet the following requirements:
– The connector surface of each three-wire power ground cable is in a good
condition.
– All power cable types are correct.
– The insulation layer of each power cable is in a good condition.
● Keep cables slack and away from heat sources.
● Do not use excessive force to install or remove a cable.
● Install or remove a cable by holding its connectors.
● Do not twist or tear cables.
● Lay out and connect cables properly, and ensure that they are not in contact
with any components that are removable or replaceable.
For details, see 7.3 Huawei Server Inspection Report.
7.2.1 Precautions
● Obtain the customer's consent before inspecting servers. Without customers'
written authorization, do not modify server configurations, power on or off
servers, remove or insert components, or change cables.
● Before inspecting servers, obtain the iMana 200 or iBMC IP address, MM910
IP address and password of the root or Administrator user for each server to
be inspected. After inspecting servers, advise the customer to change the
password of the root or Administrator user as soon as possible.
● Power indicator
● UID indicator
● Network port status indicator
● Fan module indicators
● E9000 switch module indicators
● E9000 management module indicators
Inspection and log collection do not modify data, collect service data, or affect services, and
will delete the collection scripts and files when finished.
For details about the supported server models and inspection operations, see the
FusionServer Tools 2.0 SmartKit User Guide.
Prerequisites
You can log in to the iBMC WebUI.
Step 3 View the status of hardware, including drives, DIMMs, and sensors.
1. On the menu bar of the iBMC WebUI, choose Information.
2. In the navigation tree, choose System Info. On the right panel, click the
Storage tab and view hardware status information.
3. In the navigation tree, choose Real-Time Monitoring to view the CPU usage,
memory usage, and air intake vent temperature.
NOTE
– The RH5885 V3, RH5885H V3, and RH8100 V3 do not support display of the CPU
usage and memory usage.
– After iBMA 2.0 is installed and started on the server OS, the CPU usage is obtained
from the iBMA 2.0 and the CPU usage data is the same as the data collected on
the OS.
– If iBMA 2.0 is not installed on the server OS or iBMA 2.0 has not completely
started, the CPU usage data is obtained from the Intel Management Engine (ME).
The CPU usage is the average compute usage per second of all CPU cores
calculated by the CPU internal module.
– If iBMA 2.0 is not installed on the server OS, obtain the latest iBMA user guide and
software package, and install iBMA 2.0 by referring to the user guide.
4. In the navigation tree, choose Sensor Info to view the status of sensors.
----End
Procedure 2 (For iBMC V561 and Later or iBMC V3.01.00.00 and Later)
Step 1 Log in to the iBMC WebUI. For details, see 8.9 Logging In to the iBMC WebUI.
Step 3 View the status of hardware, including drives, DIMMs, and sensors.
1. In the navigation tree, choose System > System Info. Click Memory to view
the detailed memory information.
2. In the navigation tree, choose System > System Info. Click Sensors Info to
view the sensor status.
3. In the navigation tree, choose System > Storage Management to view the
status of hardware such as system drive.
4. In the navigation tree, choose System > Performance Monitoring to view the
CPU usage, memory usage, and drive usage.
----End
Customer
Name
Equipment Eq
Room Address uip
me
nt
Ro
om
Na
me
Equipment Ph
Room Director on
e
Nu
mb
er
Time of
Inspection
Inspected By Phone
Numb
er
Service Hotline
Enterprise China 4008229999
Region:
Huawei 8008303118/02981770177
engineers and
partners:
Inspecting Servers
View the inspection report generated by SmartKit to check server health status. An
item has passed the inspection if the value of Result for the item is OK in the
report.
Insp Ph Date
ecte on
d By e
N
u
m
be
r
In P Date
sp h
ec o
te n
d e
By N
u
m
b
er
8 Common Operations
NOTE
Check the first two digits of the product SN before reading the following information.
● If the first two digits of the product SN are 02 or 03, see Figure 8-1.
NOTE
No. Description
1 SN ID (two characters).
No. Description
No. Description
Obtaining a Product SN
Use one of the following methods to obtain a product SN:
● Use SmartKit.
Use the server inspection function of SmartKit to obtain ESNs in batches. For
details about the product SN, see "Asset Inspection Information" > "Board SN"
in the inspection report.
● View the product label.
A product label is attached to each Huawei server. You can view the product
label to obtain its ESN. The product label position varies with the Huawei
server model. For details, see the user guide of a specific server.
– Figure 8-3 shows the product SN of a rack server.
– Figure 8-6 shows the product SN of an X6800. In Figure 8-6, (1) is the
product label of the server, and (2) is the product label of a server node.
– Figure 8-7 shows the product SN of an E9000. In Figure 8-7, (1) is the
product label of the server, and (2) is the product label of a compute
node.
The product labels of switch modules and MM910s are on their ejector
levers.
● Use the iMana 200 WebUI.
NOTE
a. Log in to the iMana 200 WebUI. For details, see 8.8 Logging In to the
iMana 200 WebUI.
b. On the Overview page, view the product SN of the server, as shown in
Figure 8-8.
This method applies only to E9000 servers whose MM910 version is (U54) 2.20 or
later.
a. Log in to the MM910 WebUI. For details, see 8.11 Logging In to the
MM910 WebUI.
b. Choose Chassis Information > Manufacturing Information and view
the product SN of the server, as shown in Figure 8-11.
● This method applies only to E9000 servers whose management module is the
MM920/MM921.
● Before the operations, add the MM920/MM921 to FusionDirector.
e. Click the Device tab and click Server, Management Module, and Switch
Module respectively to view the SNs of the compute node, management
module, and switch module, as shown in Figure 8-14.
Procedure
Step 1 Use PuTTY to log in to the server. For details, see 8.15 Logging In to a Server
Over a Network Port by Using PuTTY or 8.17 Logging In to a Compute Node,
Passthrough Module, or Switch Module by Using the SOL Function of the
MM910.
Step 2 On the iMana 200 CLI, run the imtool command (for versions earlier than 7.01) or
the ipmcset -t maintenance -d imtool command (for 7.01 and later versions).
Information similar to the following is displayed:
root@BMC:/#ipmcset -t maintenance -d imtool
tar: removing leading '/' from member names
Tar result information success.
iMana:/->
Step 3 Use a cross-platform file transfer tool to connect to the iMana 200 IP address.
In this document, WinSCP is used as the cross-platform file transfer tool. For
details, see 8.19 Using WinSCP to Transfer Files.
Step 4 Download the tar.gz package in the /tmp directory on iMana 200 to a directory on
the local PC. See Figure 8-15.
----End
Table 8-1 One-click information collection by the iBMC for each server
Server Series One-Click Information Description
Collection
X6800
----End
Procedure 2 (For iBMC V561 and Later or iBMC V3.01.00.00 and Later)
Step 1 Log in to the iBMC WebUI. For details, see 8.9 Logging In to the iBMC WebUI.
Step 2 Choose Home. The Home page is displayed, as shown in Figure 8-17 or Figure
8-18.
Step 3 Click One-Click Info Collection in the Shortcuts area to download the collected
maintenance information.
----End
Procedure
Step 1 Log in to the MM910 WebUI. For details, see 8.11 Logging In to the MM910
WebUI.
Step 2 Choose System Management on the menu bar, choose SEL Information in the
navigation tree, and click the SMM tab and then the One touch collect tab.
Step 3 On the log collection page, choose Collect All > Start.
Log collection takes about 20 minutes. When log collection is complete, a log file
named one_touch_info_all.tar.gz is displayed in the File Name area.
Step 4 Click the log file name and download it to the local PC as prompted.
NOTE
For MM910 earlier than (U54) 2.20, you need to collect logs of both the active and standby
HMMs.
----End
Procedure
Step 1 Log in to the MM910 WebUI. For details, see 8.11 Logging In to the MM910
WebUI.
Step 2 Choose System Management > Information Collection, and set log collection
parameters.
● Select MM for Collected from.
● Select One-click full collection for Collected content.
Log collection takes about 20 minutes. When log collection is complete, a log file
named one_touch_info_all.tar.gz is displayed in the File Name area.
Step 4 In the dialog box displayed, download the log file to the local PC as prompted. (In
some browsers, the log file is automatically saved in the default directory.)
----End
Prerequisites
The MM920 or MM921 has been managed by FusionDirector.
Procedure
Step 1 Log in to the FusionDirector WebUI. For details, see 8.12 Logging In to the
FusionDirector WebUI.
Step 2 Choose Menu > Alarms and Logs > Log. The Log page is displayed.
Step 3 Click Collect Log. In the displayed dialog box, click OK.
The Task area is displayed on the right of the page, showing the progress and
status of the log collecting task.
When the task is complete, a message indicating success is displayed.
Step 4 Click Export Log to export the log information to a local directory.
----End
NOTE
Use the MM510 CLI to collect information about the MM510 and heterogeneous nodes in
batches. To collect information about the server, MM510, and heterogeneous nodes in
batches, use the iBMC. For details, see 8.3 Using iBMC to Collect Information in Batches.
Prerequisites
You have logged in to the CLI of the MM510. For details, see 8.13 Logging In to
the MM510 CLI.
Example
# One-click information collection
iBMC:/->ipmcget -d diaginfo
Download diagnose info to /tmp/ successfully.
Prerequisites
Conditions
If the remote control function is required, ensure that the OS, browser, and Java
Runtime Environment (JRE) of the required versions have been installed on the
local PC. Table 8-2 shows the system configuration requirements of the local PC.
OS Software Version
OS Software Version
NOTE
If the JRE does not meet requirements, download and install a proper Java version by
referring to Table 8-2.
Data
Table 8-3 lists the required data before you log in to the iBMC WebUI.
Procedure
Step 1 Connect the local PC to the iMana 200 management network port on the server
by using a crossover cable or twisted pair cable.
Figure 8-19 shows the network diagram.
Step 3 In the address box, enter the iMana 200 address in the format of https://IP
address of the iMana 200 management network port on the server (for example,
https://192.168.2.100).
NOTE
● If the message "There is a problem with this website's security certificate" is displayed,
click Continue to this website (not recommended).
● If the Security Alert dialog box indicating a certificate error is displayed, click Yes.
Step 5 On the iMana 200 login page, enter the username and password.
NOTE
The user account will be locked after five consecutive login failures caused by incorrect
passwords. If your user account is locked, log in again 5 minutes later.
You can click Reset to clear the information entered on the User Login page.
The Overview page is displayed. The login username is displayed in the upper
right corner of the page.
----End
Prerequisites
Conditions
The local PC that uses the remote control function must have the Java runtime
environment (JRE) and the browser of the required version. For details, see the
corresponding iBMC User Guide.
Data
Table 8-4 lists the required data before you log in to the iBMC WebUI.
Step 1 Connect the local PC to the iBMC management network port on the server by
using a crossover cable or twisted pair cable.
Step 3 In the address box, enter the IP address of the server iBMC management network
port (for example, https://192.168.2.100) and press Enter.
● If the message "There is a problem with this website's security certificate" is displayed,
click Continue to this website (not recommended).
● If the Security Alert dialog box indicating a certificate error is displayed, click Yes.
Step 4 On the login page, enter the username and password for logging in to the iBMC
WebUI.
NOTE
The user account will be locked after five consecutive login failures with wrong passwords.
If your user account is locked, log in again 5 minutes later.
----End
Procedure 2 (For iBMC V561 and Later or iBMC V3.01.00.00 and Later)
This section uses a PC running Windows 7 and Internet Explorer 11 as an example.
Step 1 Open Internet Explorer, enter the iBMC management network port address
https://ipaddress/ in the address box, and press Enter.
NOTE
NOTE
If a website security alert is displayed, you can ignore this message or perform any of the
following to shield this alert:
● Import a trust certificate and a root certificate to the iBMC. For details, see "Importing
the iBMC Trust and Root Certificates" in the corresponding iBMC User Guide.
● If no trust certificate is available and network security can be ensured, add the iBMC to
the Exception Site List on Java Control Panel or reduce the Java security level. This
operation, however, poses security risks. Exercise caution when performing this
operation.
Step 3 On the login page, enter the username and password for logging in to the iBMC
WebUI.
Step 4 Select Local iBMC from the Domain drop-down list.
Step 5 Click Log In.
The Home page is displayed.
----End
Data
The following data is required:
● IP address of the server to be connected
● User name for logging in to the server to be connected. The default username
is admin.
● User password for logging in to the server to be connected. The default user
password is Huawei12#$.
Tool
Java plug-in: This tool is third-party software. You need to prepare it by yourself.
JRE 1.8 or later is required.
Procedure
Step 1 Connect a client (for example, a local PC) to the management network port of the
management module by using a network cable.
Step 2 In this displayed security alert dialog box, click Allow to allow web access.
Step 3 In the displayed security alert dialog box, select Do not block this program.
Step 4 In the address box of the PC browser, enter https://IP address of the FC switching
plane and press Enter.
The login dialog box is displayed, as shown in Figure 8-25.
Step 5 Enter the username and password, and click Add Fabric.
----End
NOTE
● The user account will be locked if incorrect passwords are entered for five consecutive
times. The user account will be automatically unlocked in 5 minutes, but cannot be
forcibly unlocked. If you attempt to enter a password again within 5 minutes, the lock
duration is reset to 5 minutes no matter whether the entered password is correct.
● The WebUI of the standby MM910 (displayed as "This is the standby MM.") does not
display component installation status. After logging in to the WebUI of the standby
MM910, you can view the status of the active MM910 and perform the following
operations for the standby MM910: Set the DHCP parameters and a static IP address,
set and query the thresholds and hysteresis of threshold sensors, collect system
operating information, and upgrade the management software. To perform other
operations, log in to the WebUI of the active MM910.
Data
You have obtained the following data:
● Username for logging in to the server to be connected. The default username
is root.
● User password for logging in to the server to be connected. The default user
password is Huawei12#$.
Procedure
Step 1 Connect the Ethernet port on the local PC to the MGMT ports on the active and
standby MM910s over the local area network (LAN).
NOTICE
If the active MM910 MGMT port has been connected to the network by using a
network cable and the client needs to be directly connected to the MM910, do not
directly disconnect the network cable from the active MM910 MGMT port that has
been connected to the network. Otherwise, an active/standby MM910 switchover
will be triggered, which may cause network interruption. You are advised to
connect the client to the active MM910 STACK port in the chassis by using a
network cable. If the active MM910 STACK port in the chassis has been connected
to the MGMT port in another chassis, use an idle active MM910 STACK port in
another chassis.
NOTE
Step 2 Set the IP address and subnet mask or route information for the local PC so that
the local PC can communicate with the MM910 properly.
Step 3 On the menu bar of Internet Explorer, choose Tools > Internet Options.
The Internet Options dialog box is displayed.
NOTE
This section uses a PC running Windows 7 and Internet Explorer 8.0 as an example.
Figure 8-28 Logging in to the HMM WebUI (MM910 (U54) 2.20 or later)
Figure 8-29 Logging in to the HMM WebUI (MM910 earlier than (U54) 2.20)
----End
Prerequisites
Conditions
Precautions
Procedure
Step 1 Connect the Ethernet port of the PC to a management network port of the active
or standby MM920/MM921 over the LAN.
The 10GE optical port and MGMT port on the MM920/MM921 panel are
management network ports. This section uses the MGMT port as an example.
Step 2 Set an IP address and a subnet mask or add route information for the PC so that
the PC can communicate with FusionDirector.
Step 3 Open the browser, enter https://ipaddr in the address box, and press Enter.
NOTE
● ipaddr indicates the address used to access the FusionDirector WebUI. It can be in either
of the following formats:
– IPv4 address in dotted-decimal format XXX.XXX.XXX.XXX.
– Fully qualified domain name (FQDN) of FusionDirector.
● The browser may display a message indicating that the website has a security certificate
error. Ignore this error and continue the login if the IP address is correct.
Password Specifies the password of the user. For security purposes, change
the password periodically.
NOTE
● If the username or password is incorrect, you need to enter a verification code in the
second login attempt. If the verification code is not clear, click to refresh the
verification code.
● If you enter incorrect passwords for three consecutive times, the account will be locked
for 5 minutes. If the account is locked, try again later or contact the administrator.
----End
Prerequisites
When logging in to the HMM CLI, ensure that:
● If you log in to the CLI over SSH, a maximum of five concurrent users are
supported.
● To log in to the CLI over the network port, you must connect the network port
on the configuration terminal to the network port on the server by using a
network cable, and ensure that the IP addresses of the two network ports are
on the same network segment.
● To log in to the CLI over the serial port, you must connect the serial ports of
the terminal and the server by using a serial cable.
Login Method
● Login over SSH
● Login over the local serial port
NOTE
● The HMM provides one default user Administrator, and the default password is on
the product nameplate.
● The system locks a user account if the user enters incorrect passwords for five
consecutive times. The user is automatically unlocked 5 minutes later, or an
administrator can unlock the user on the CLI.
● For security purposes, change the initial password after the first login and change
your password periodically.
At the initial startup of the HMM, wait for about 3 minutes before you log in to the CLI.
● If the client uses Windows:
a. Download and install the SSH client communication tool.
b. Connect the client to the management network port on the server.
c. Enter the IP address, username, and password of the management
network port on the client communication tool.
– Parity: None
– Stop bits: 1
– Flow control: None
Figure 8-35 lists the parameters to be specified.
Prerequisites
The RMC is operating properly.
Data
● IP address of the RMC management network port. The default IP address is
192.168.2.100.
● RMC user names and passwords
The RMC provides four default users:
– User root (default password: Huawei12#$)
– User admin (default password: Huawei12#$)
– User operator (default password: Huawei12#$)
– User taobao (default password: Huawei12#$)
Tool
A terminal tool (for example, PuTTY) has been installed on the PC. This tool is
third-party software. You need to prepare it by yourself. PuTTY 0.60 or later is
required for login over a serial port.
Document
For details about the RMC, see the X8000 Server RMC Command Reference.
The PuTTY window is displayed, prompting "login as:" for you to enter a user
name.
----End
Step 4 In the Host Name (or IP address) text box, enter the IP address of the RMC
management network port.
The PuTTY window is displayed, prompting "login as:" for you to enter a user
name.
----End
NOTE
The server in this section can be a management module, compute node, or switching plane.
Prerequisites
Conditions
The PC and the server or MM910/MM920/MM921 management network port
have been connected by using a network cable.
Data
You have obtained the following data:
● You have obtained the IP address of the server to be connected.
● You have obtained the user name and password for logging in to the server to
be connected.
Software Tools
PuTTY.exe: This tool is third-party software. You need to prepare it by yourself.
Procedure
Step 1 Set an IP address and a subnet mask or add route information for the PC so that
the PC can properly communicate with the server.
You can run the Ping Server IP address command on the PC CLI to check the
communication between the PC and the server.
Step 2 Double-click PuTTY.exe.
The PuTTY Configuration window is displayed, as shown in Figure 8-38.
Configure Host Name and Saved Sessions, and click Save. You can double-click the saved
record under Saved Sessions to log in to the server the next time.
Step 4 (Optional) After logging in to the Ethernet plane by using PuTTY, if you fail to
delete characters on the CLI by using the Backspace key, choose Terminal >
Keyboard, and select Control-H under The Backspace key, as shown in Figure
8-39.
NOTE
● If this is your first login to the server, the PuTTY Security Alert dialog box is displayed.
Click Yes to proceed.
● If an incorrect user name or password is entered, you must set up a new PuTTY session.
----End
By default, the server serial port is the OS serial port. For details about how to redirect the
server serial port, see "Querying and Redirecting the Serial Port (serialdir)" in the iBMC
User Guide.
Scenarios
Use PuTTY to log in to the server over a serial port in either of the following
scenarios:
The server in this section can be a management module, compute node, or switching plane.
Prerequisites
Conditions
Data
You have obtained the user name and password for logging in to the server to be
connected.
Software Tools
Procedure
Step 1 Double-click PuTTY.exe.
Step 2 In the navigation tree on the left, choose Connection > Serial.
● Stop bits: 1
● Parity: None
● Flow control: None
In COMN, N indicates the serial port number, and the value is an integer.
Step 4 In the navigation tree, choose Session.
Step 5 Select Connection type in Serial, as shown in Figure 8-40.
----End
Prerequisites
Conditions
● You have logged in to the MM910 CLI by using the floating IP address of the
MM910.
● There is no jumper cap over the pins on the mainboard of the compute node,
passthrough module, or switch module.
Data
● User name and password for logging in to the management module. The
default user name of the MM910 is root, and the default password is
Huawei12#$.
● User name and password for logging in to the compute node to be
connected. The default user name is root, and the password is Huawei12#$.
● Password for logging in to the passthrough module or switch module to be
connected The default password is Huawei12#$.
Procedure
Step 1 Use an SSH tool and the floating IP address of the MM910 to log in to the
MM910 CLI.
In this document, PuTTY is used as the SSH tool. For details, see 8.15 Logging In
to a Server Over a Network Port by Using PuTTY.
telnet 0 1101
*=====================================================================*
* Welcome to SMM SOL Server *
* Please log in with SMM account and password. *
*=====================================================================*
user name:
NOTICE
If you need to disconnect the service terminal or server power after logging in to
the SOL screen, exit the SOL screen first. Otherwise, re-logging in to the SOL
screen will fail.
*=====================================================================================
======================
please input the SOL Blade1~Blade16(1 ~ 16), Blade1A~Blade16A(17 ~ 32), Swi1~Swi4(33 ~ 36) and
COM#(n)
press Ctrl+R to return
*=====================================================================================
======================
Blade1~Blade16(1 ~ 16)
Blade1A~Blade16A(17 ~ 32)
Swi1~Swi4(33 ~ 36)
Please input your choice:
Step 4 Enter the slot number of the compute node, passthrough module, or switch
module, and press Enter.
● If you enter a compute node slot number, the following serial port
information is displayed:
1 systemcom
2 RAIDcom
3 BMCcom
4 Exboardcom
Or
1 SYS COM
2 BMC COM
Or
1 systemcom
2 BMCcom
● If you enter a switch module slot number, the following serial port
information is displayed:
1 BMCcom
2 fabriccom
3 basecom
4 FCcom
Or
1 BMCcom
2 fabriccom
Or
1 BMCcom
2 fabriccom
3 basecom
● If you enter a passthrough module slot number, the following serial port
information is displayed:
1 BMCcom
Step 5 Enter the value representing the serial port to be connected, and press Enter.
The serial port screen is displayed. On this screen, you can perform operations
such as configuration and query.
NOTE
You can press Ctrl+R once to return to the slot number selection screen shown in Step 3, or
press Ctrl+R twice to exit the SOL screen.
----End
Prerequisites
Conditions
Data
Procedure
Step 1 Use an SSH tool and the floating IP address of the MM920/MM921 to log in to
the CLI.
In this document, PuTTY is used as the SSH tool. For details, see 8.15 Logging In
to a Server Over a Network Port by Using PuTTY.
Step 2 Run the ipmcget -l bladeN -t SOL -d cominfo or ipmcget -l swiN -t SOL -d
cominfo command to query the SOL port information of the compute node, pass
through module, or switch module.
Step 3 Run the ipmcset -l bladeN -t sol -d activate -v com_value or ipmcset -l swiN -t
sol -d activate -v com_value command to enter the serial port input interface.
Step 4 Enter the username and password as prompted.
----End
Prerequisites
Conditions
The Secure File Transfer Protocol (SFTP) service has been enabled on the
destination device.
Data
You have obtained the following data:
● You have obtained the IP address of the server to be connected.
● You have obtained the user name and password for logging in to the server to
be connected.
Software Tools
WinSCP.exe: This tool is third-party software. You need to prepare it by yourself.
Procedure
Step 1 Open the WinSCP folder, and double-click WinSCP.exe.
The WinSCP Login dialog box is displayed, as shown in Figure 8-41.
NOTE
● Host name: Enter the IP address of the server to be connected. For example,
192.168.2.10.
● Port number: The default value is 22.
● User name: Enter the username. For example, admin123.
● Password: Enter the password. For example, admin123.
● Private key file: This parameter is left blank by default. Retain the default
value.
● Protocol: Retain the default option SFTP in the File protocol drop-down list,
and select Allow SCP fallback.
NOTE
● If a private key file is not selected at the first login, the warning message "Continue
connecting and add host key to cache" is displayed. Click Yes. The WinSCP file transfer
window is displayed.
● On Windows 7, C:\Users\Administrator\Documents on the local PC is opened in the
left pane, and /root on the server is opened in the right pane by default.
Step 4 In the left and right panes, create, delete, or copy folders in specific directories as
required.
----End
Prerequisites
● A PC is connected to the server by using a serial cable.
● WFTPD has been installed.
Software Tools
wftpd32.exe: used to transfer files between different platforms, for example, from
a PC to a switching plane of a switch module. This tool is third-party software. You
need to prepare it by yourself.
Procedure
Step 1 Double-click wftpd32.exe.
Step 3 Select all check boxes except Winsock Calls, and click OK.
Step 5 Click New User. In the displayed dialog box, enter a new username (for example,
vxworks) and click OK.
Step 6 Enter a new password (for example, vxworks) in the New Password and Verify
Password text boxes, and click OK.
Step 7 Copy the upgrade file to a directory (for example, D:\FTP) on the PC.
NOTE
Step 8 Select vxworks from the User Name combo box, and enter the upgrade file
directory (for example, D:\FTP) in the Home Directory text box. See Figure 8-43.
----End
Prerequisites
The SFTP service has been enabled on the destination device.
Software Tools
mini-sftp-server.exe (free software)
Procedure
Step 1 Double-click mini-sftp-server.exe.
The Core FTP mini-sftp-server dialog box is displayed, as shown in Figure 8-44.
----End
9 Other Resources
News
For notices about product life cycles, warnings, and updates, visit Support >
Bulletins > Product Bulletins.
Cases
For details about existing cases, see the Computing Case Library.
NOTE
The Computing Case Library is available only to Huawei engineers and partners.
▪ Hotline: 400-822-9999
▪ Email: support_e@huawei.com
– Enterprise customers outside China can obtain the customer service
information from: Global Service Hotline.
– Carrier customers in China can contact Huawei in the following ways:
▪ Hotline: 400-830-2118
▪ Email: support@huawei.com
– Carrier customers outside China can obtain the customer service
information from: Global TAC Information.
● Contact the technical support personnel of the local Huawei office.