Professional Documents
Culture Documents
Introduction To Basic
Troubleshooting
BCN-B
PHYSICAL DESIGN OF ONE MODULE
2x USB
1x RJ-45 Software download
Hardware maintenance
Debugging
interfaces
• CFPUs and EIPUs are the only add-in cards that have
Transport and IP Layer ref. TCP/IP Protocol
• CFPUs are directly linked to LAN1 ports for OAM
connection to OMS and NetAct
• EIPUs are directly linked to SFP11 to SFP22
O&M
CONNECTIVITY
• Know the EIPUs in Quantity and Positions
>show hardware inventory list brief
Know which EIPUs are for which Roles and connected
to which SFP ports
> show networking interface
USEFUL
• Check Backplane Inter-Module Cabling COMMANDS TO
1. Go to directory /opt/nokiasiemens/SS_QNTools/script
2. Execute command ./zcablecheck.sh UPDATE
• How to check mcRNC clustering (BACKPLANE) KNOWLEDGE AND
1. Go to directory /opt/nokiasiemens/SS_QNTools/script
2. ./zbackb.sh TROUBLESHOOTING
• How to check SFP port state
Note: Each BCN1=m10=1; BCN2=m20=1; BCN3=m30=3; BCN4=m40=4 etc
#zjane 1 ZLAI
#zjane 2 ZLAI
#zjane 3 ZLAI
#zjane 4 ZLAI
etc
• mcRNC HW STATUS
1. Check status of add-in cards
#hwcli --t
2. Check all add-in cards and managed objects
#hwcli
BCN-B
All add-in cards in the mcRNC look the same – CFPU, CSPU, USPU & EIPU
Processor add-in
card (BMPP2-B) –
Octeon 2 variant B
• Located at the front of the BCN module in the cooling air inlet
• Prevent dust from accumulating inside the equipment
• Meets the NEBS GR 63 CORE and GR 78 CORE requirements
Network
Architecture
Check Associations for all external connectivity (ACESS (IUB) & CORE)
>show troubleshooting z-commands zassoc
Check Bi-directional Forward Detection status
> show troubleshooting z-commands zbfd
Check IUB Transport
> show troubleshooting z-commands zbts
List measurement files waiting for transfer to OMS
> show troubleshooting z-commands zifo meafile -f
List measurement files in temporary working directory
Z-Commands
> show troubleshooting z-commands zifo meafile -r
List measurement files transferred to OMS
> show troubleshooting z-commands zifo meafile -t
Check SFP Port Status
> show troubleshooting z-commands zjane box 10 zlai
Check SFP Interface to Port Mapping
> show troubleshooting z-commands zjane box 10 zmap
Check Switch (Main & Extension running configuration)
> show troubleshooting z-commands zjane box 10 zr
Check L2 Packet Count Statistics
> show troubleshooting z-commands zjane box 10 zs port portnumber 0/7 targetswitch e
Check Detailed L2 Packet Count Statistics
> show troubleshooting z-commands zjane box 10 zss show-switch-port-counters port-name portname SFP13
Check Up Times of BCN and ADD-IN cards
> show troubleshooting z-commands zjane box 10 zu
Check Memory Utilization – ctrl+c to end it
> show troubleshooting z-commands zmem
Check Recovery Groups States Z-Commands
> show troubleshooting z-commands zrg state -
-all - list all recovery groups
-d - list disabled recovery groups
-e - list enabled recovery groups
-l - list locked recovery groups
-u - list unlocked recovery groups
Symptom Data Reports
- This is one of the items you will need to provide when asking for support from Nokia or Support Team. It comes
under the category of Problem Reporting
- Can also be used to guide our own investigation and troubleshooting if you can read the logs
SYMPTOM DATA REPORTS
PRACTICAL SESSION
Standard symptom report is a framework used to collect symptom/behavior data
from Multicontroller RNC to support troubleshooting of problems.
Plugins are specific internal programs designed to do checks, monitor and collect
SYMPTOM DATA
logs. Some of the plugins are put into groups, thus it is possible to collect the REPORTS
symptom data on the group level too.
Most of the Multicontroller RNC configuration data and logs essential for
investigation of the problem can be collected with the standard symptom plugin
report group-RNC and subreport-MessageMonitor
group-RNC is a group plugin which contains all the plugins below;
subreport-rnchw
subreport-rncinfo
subreport-rncipconfig
subreport-rncsignaling
subreport-rncrnw
subreport-rnchas SYMPTOM DATA
subreport-rncalarm
subreport-rncuplane
REPORTS
subreport-rncmon
subreport-ipmgmt
subreport-networkresiliency
The name of the plugin defines its function and expected log.
The framework allows easy enhancement with additional plugins, if the available
standard plugins are not sufficient.
The collected symptoms data is stored in /mnt/backup/stdsymp directory.
The data must be collected as soon as possible after an abnormal situation has
taken place and before any recovery action is performed, such as Multicontroller
RNC restarts or replacing hardware.
This is important because the information stored about the problem
may get overwritten in the process of time or be lost because of
recovery actions.
The standard symptom report group-RNC is expected to be SYMPTOM DATA
completed in around 15 minutes depending on Multicontroller RNC configuration.
Recovery actions can be started, if needed, as soon as symptom report generation
REPORTS
is completed.
Save or collect the standard symptom reports.
To save (collect) the standard symptom report, enter the following command:
save symptom-report <name> include <plugin>
Often what is basic for Nokia when Collecting Standard Symptom Report is: COLLECTING
STANDARD
save symptom-report RNC260 include group-RNC include subreport-MessageMonitor
SYMPTOM
NB: REPORTS
(1) name
(a) cannot exceed 25 characters and must start with a letter
(b) cannot be duplicated
(2) IF you are in CFPU-0 and you collect symptom report, it will be store in
/mnt/backup/stdsymp directory under CFPU-0
List the symptom reports
To list all the standard symptom reports, enter the following command:
>show symptom-report all
2. Log into the CFPU node where the HDD is NOT faulty.
6. Wait until the hot swap LED turns into a solid blue. This may take a few seconds.
Removing AMC
7. Pull the hot swap handle again more firmly and slide the AMC out of the bay
Removing AMC
8. If you are not installing another AMC immediately, install an AMC filler into the
empty AMC bay.
This is to ensure adequate cooling and a proper EMC shield in the module.
Procedure
1. Place the AMC so that the faulty hard disk drive side is facing down. Unscrew the
four screws on the metal bracket of the AMC module, then turn the module over
carefully while holding the hard disk drive.
1. Connect the new hard disk drive to the SAS connector of HDSAM-A.
Connect the new hard disk drive to the SAS connector in the HDSAM-A by pushing it
gently
1 Check that the EMC gasket is correctly in place and that its contacts are clean. 2
Insert the AMC into the bay, sliding it along the guide rails as shown in the figure
below. Make sure that the AMC is firmly seated in the module’s connectors.
4. Enable network boot for the CFPU node with the new HDD.
Installing AMC HDD
To enable the network boot for the node with the new hard disk drive, enter the
following commands:
a) Log in as root.
> set user username root
b) b) Power on the node.
> hwcli -np on <CFPU-X>
Wait a few seconds before proceeding to the next step.
c) Reset the node.
> hwcli -nr -B 3 <CFPU-X>
d) Exit root.
> exit
Installing AMC HDD
5. Disable the watchdog on the CFPU node with the new HDD.
To disable the watchdog on the node with the new hard disk drive through SSH,
enter the following command: ssh \ wdctl -d
6. Initialize the new HDD from the other CFPU node where the hard disk is not faulty.
To initialize the new disk, enter the following fsclish command:
> initialize hw
The following output is displayed:
Hardware successfully initialized
Note: To run the initialization script and display the console output, the space bar
must be pressed several times after entering the command.
7. Reboot the CFPU node with the new HDD from the local HDD.
Enter the following commands: Installing AMC HDD
> set user username root
# hwcli -nr -B 2 <CFPU-X>
# exit
The node will restart and synchronize the Distributed Replicated Block Devices
(DRBD). You can enter the watch -n 10 cat /proc/drbd to see how the
synchronization is progressing. However, if the watch -n 10 cat /proc/drbd
command fails, the cat /proc/drbd command must be executed.
Note: Set user username root must first be executed before
watch -n 10 cat /proc/drbd.
Do not restart the node during the DRBD synchronization. The initialization process
of the new disk is not ready until the synchronization is successfully completed.
8. Check that serving and backup CMF (Cluster Management Functionality) are
working normally. This can be done by comparing Managed Objects CFPU-0 and
CFPU-1. Enter:
show cmf status recovery-unit node-name <mo-name>
Compare the blocks and they should match for both managed objects.
9. Unlock the node with the new hard disk drive.
Enter: set has unlock managed-object
Example:
> set has unlock managed-object /CFPU-1
The following output is displayed: /CFPU-1 unlocked successfully.
Note: Hard-disk failure alarm will be cleared manually and observed over a period for
confirmation.
Thank you