Professional Documents
Culture Documents
3g Troubleshooting Capacity PDF Free
3g Troubleshooting Capacity PDF Free
Acknowledgements
The following individuals are responsible for contribution to the specifications, design and implementations
represented in the various revisions:
Declan Quinn (FSC)
Justin Clayden (West)
Dominador Galicinao (West)
Tim Zhang (FSC)
Revision Code
The revision number will reflect the modifications by following the format Rev. x, y, where
X is the first digit, incremented for changes of substance, i.e. technical/procedural issues.
Y is the second digit, incremented when editorial only changes have been incorporated.
All draft/preliminary versions are 0.n; the first final version is Revision 1.0.
Revision History
Rev. Date Author Information
1.0 5/22/2009 Declan Quinn/Justin Initial Draft
Clayden (WR)
1.1 6/5/2009 Declan Quinn 1. Minor typo corrections
2. Addition of Scope section
3. Addition of 384/128 RAB Reduction for expanding
capacity
Table of Contents
1. Introduction............................................................................................................................4
1.1. Purpose & Scope 4
1.2. Definitions for this Document 4
2. Capacity..................................................................................................................................7
2.1. T-PIM Reports 7
2.2. Scope 7
2.2.1. In Scope 7
2.2.2. Out of Scope 7
2.3. Troubleshooting Flowchart 8
3. Radio/Air Interface Capacity.......................................................................................................9
3.1.1. RNC/Market/Region Level Reporting and Overview 13
3.1.1.1. Admission Control Issues 13
3.1.1.2. DL Power/DL Channelization Codes Issues 13
3.1.1.3. Soft Handover Overhead (SHO) 14
3.1.1.4. Cell Congestion 14
3.1.1.5. Received Total Wideband Power (RTWP) 15
3.1.2. Cell/RBS Analysis 16
3.1.2.1. Failures due to Admission Control 16
1.3.1.2.1. Voice Admission Control Failures 16
1.3.1.2.2. HSDPA Interactive Admission Control Failures 17
1.3.1.2.3. PS Interactive Admission Control Failures 17
3.1.2.2. Failures due to Lack of DL Power/DL Channelization Codes 17
3.1.2.3. High Soft/Softer Handover Overhead 18
3.1.2.4. Cell Congestion 18
3.1.2.5. High RTWP 19
4. RBS/Baseband Capacity...........................................................................................................21
4.1.1. RNC/Market/Region Level Reporting and Overview 25
4.1.1.1. RAB failures due a Lack of hardware resources 25
4.1.1.2. RRC denied - Insufficient Licensed Capacity 25
4.1.1.3. RRC denied – Node Blocking 25
4.1.2. RBS Analysis 25
4.1.2.1. Lack of hardware resources /Insufficient Licensed Capacity 25
4.1.2.2. RRC denied – Node Blocking 25
5. Transport/Backhaul Capacity....................................................................................................26
5.1.1. RNC/Market/Region Level Reporting and Overview 35
5.1.2. Iub/Transport Link Analysis 35
5.1.2.1. RRC and RAB TN Congestion/Blocking (All Service Types) 35
5.1.2.2. Iub Congestion (UL and DL) 35
5.1.2.3. AAL2 QoS A – D Setup Failures 36
5.1.2.4. ATM Lost Cells (Transmitted and Received) 36
6. RNC Capacity.........................................................................................................................37
7. Capacity Management Tool......................................................................................................40
8. Troubleshooting Tools.............................................................................................................41
9. References.................................................................................................................................42
1. Introduction
1.1. Purpose & Scope
The intent of this document is to provide UMTS Trouble Shooting and Optimization from KPI and
Counter perspectives for Ericsson (E///) Capacity and provide detailed analysis strategies for
identifying reason for the KPI trends and offering guidelines for improving performance.
The KPI/Counters described here are applicable to the P6 release of the Ericsson UTRAN.
This document is not all inclusive and is only intended to provide a quick cook book to understand
available E/// for trouble shooting and optimization best practices Guideline Document. For any
information not covered here, the Ericsson product documentation (CPI/ALEX Libraries) should be
referenced.
AS Active Set
BH Busy Hour
CN Core Network
DL Downlink
IE Information Element
LA Location Area
RA Routing Area
RB Radio Bearer
RBS Radio Base Station – another name for the
Term or Acronym Definition
Node B
RF Radio Frequency
RL Radio Link
TRX Transceiver
TX Transmit
UE User Equipment
UL Uplink
2.2. Scope
2.2.1. In Scope
The scope of this document includes methods to improve capacity by optimization of the existing
resources.
Capacity Issues
Radio/Air RBS/ Transmission
RNC
Interface Baseband /Backhaul
Capacity
Capacity
Capacity Capacity
Failures due to
Lack of DL RRC denied -
Power/DL Insufficient MP Load
Channelization Iub Congestion
Licensed Trending
Codes Capacity
High RTWP
Ericsson Capacity
Flowchart.pptx
3. Radio/Air Interface Capacity
The main metrics for the Radio and Air Interface Capacity are contained in the following reports in T-
PIM:
Accessibility
Quality
Congestion
The main report for identifying and troubleshooting Radio and Air Interface capacity issues in T-PIM is
the Accessibility Report. This contains the Admission Control and Lack of Downlink Power/
Channelization Codes Access Failure counters.
Most of these KPI’s are made up of single counters taken from the RNC or RBS. The list of Radio/Air
Interface KPI’s are shown below:
Individual counter description for those counters not found below can be found in the ALEX
Libraries.
10000
8000
6000
4000
2000
0
05.01.2009 05.02.2009 05.03.2009 05.04.2009 05.05.2009 05.06.2009 05.07.2009 05.08.2009 05.09.2009 05.10.2009
Sum of PS Interactive RAB Failures due to Admission Control Sum of Voice RAB Failures due to Admission Control Sum of HSDPA Interactive RAB Failures due to Admission Control
The Admission Control failure reasons can be broken down by bearer type. Individual troubleshooting
for these issues should be carried out on an individual cell level.
3500
3000
2500
2000
1500
1000
500
0
05.01.2009 05.02.2009 05.03.2009 05.04.2009 05.05.2009 05.06.2009 05.07.2009 05.08.2009 05.09.2009 05.10.2009
Sum of RAB failures - Lack of DL power Sum of RAB failures - Lack of DL Channelization code
Soft Handover Overhead (SHO)
One metric which can be used for capacity is the SHO metric. This shows when a UE has one or more
Radio Links (RL).
Soft Handover Ratio
1.8 44.5
1.6
44
1.4
43.5
1.2
43
1
0.8
42.5
0.6
42
0.4
41.5
0.2
0 41
05.02.2009 05.03.2009 05.04.2009 05.05.2009 05.06.2009 05.07.2009 05.08.2009 05.09.2009 05.10.2009 05.11.2009
The % of UE’s in SHO can also be used to determine the possible capacity requirement. The limitation is
a trade-off between the capacity used by a single UE compared to the risk of dropping the call by losing
the Radio Link if there is only one in the Active Set.
Cell Congestion
This metric gives an indication of when the cell is in congestion in either the UL or the DL. This may be an
indication of High UL RTWP (UL Congestion), High TX Power [Non-HS Capable Cell] (DL Congestion), Non
HS Dl DCH Power Overload (Dl Congestion) or HS Overload (DL Congestion).
These can be found in the Common Measurement reports. A number of actions may be taken
automatically by the RNC to reduce congestion including the rate downswitch of Non-Guaranteed
services, release of calls occurring over the Iur and release of calls served on the RNC.
Cell Congestion - Aggregated
2500
2000
1500
1000
500
0
05.03.2009 05.04.2009 05.05.2009 05.06.2009 05.07.2009 05.08.2009 05.09.2009 05.10.2009 05.11.2009 05.12.2009
Sum of Cell Congestion DL + UL (sec) Sum of Total time the cell was congested in DL Sum of Total time the cell was congested in UL
On an RNC level, the Cell Congestion counter will be rolled up to provide an overall figure for the RNC’s
Cell’s Congestion. Much like the other metrics, this is useful for reporting at a high level, but cell level
analysis is required for troubleshooting.
Typically, the RTWP should be in the general range of -100 - -110 dBm.
-106.3662
Total
-106.5793
-106.7925
3.1.2. Cell/RBS Analysis
All of the metrics shown above should be used on a Cell level to accurately identify the worst offending
cells. The Worst Offenders in an RNC/Market/Region level should be ranked by the following metrics:
Number of Failures due to Admission Control
Number of Failures due to Lack of DL Power/DL Channelization Codes
High Soft Handover Overhead
Amount of time a cell is congested (UL, DL or Combined)
Cells with High RTWP (Hot Cells i.e. > -100 dBm)
Cell Congestion
Cell congestion in the UL and DL can be counted for a number of issues. For UL Time in Congestion, the
primary cause is High RTWP. This will be dealt with in Section .
For DL Time in Congestion, the main causes are:
High TX Power [Non-HS Capable Cell]
Non HS Dl DCH Power Overload
HS Overload
The following steps are a suggestion for troubleshooting:
1. Investigation of the Failures due to lack of DL Power (similar to )
a. Discussion with Regional team over a possible reduction in CPICH to reduce the overall
Power Per RL in the cell
2. Investigation into the transport counters to determine if there is congestion or resource
availability issues on the Iub. This may cause users to retain their RSBs for longer periods than
necessary
3. Investigation into the number of users/HS RAB’s in the cell to determine if the traffic is high
a. Work with the Regional Capacity team to determine if a new site will be required or if a
case for a 2nd carrier can be used for additional capacity
After these steps are performed, monitoring should continue on a cluster of cells around the affected
cell to determine the performance improvement.
High RTWP
RTWP gives total amount of UL power received by the Node B in the carrier (5 MHz) frequency. This
includes the following:
Power received from all the UEs in the Node-B’s vicinity
Any internal and/or external interference
Thermal noise
System Noise Figure (Including BTS and Antenna system)
High RTWP can cause a number of issues such as:
A Reduction in Node-B Sensitivity
An Imbalance between downlink and uplink
A Reduction in the UL capacity
A Reduction in a UE’s battery life
UE may need to transmit more power which would consume more battery
The following steps are a suggestion for troubleshooting:
1. Audit and correct the Incorrect ulAttenuation parameter
a. Note that RBS 3518 / 3418 with remote RRUs, typically, do not have a main feeder and
hence ulAttenuation parameter should be calculated based on Jumper cable length only
b. But if a 3518 /3418 has RRUs installed close to the Baseband unit, it might have main
feeder cable. Accurate feeder length needs to be identified to calculate ulAttenuation
and electricalulDelay parameters
2. If there is an External TMA (Non Ericsson ASC), the ExternalTMA Mo must be correctly
configured
a. The ulGain parameter must also be set correctly for these sectors
b. Sites without TMA must have the MO ExternalTma deleted
c. Incorrect settings for internalPower can also affect the TMA functions – If it is set to NO,
the TMA may be powered off
3. Audit the CIQ Database and rectify any incorrect configuration issues
a. Incorrect electricalulDelay and ulTrafficDelay parameter can have an indirect impact on
RTWP. Incorrect delay could reduce the Macro diversity gain which might make UE’s to
transmit more power
After these steps are performed, monitoring should continue on a cluster of cells around the affected
cell to determine the performance improvement.
4. RBS/Baseband Capacity
Other than the Radio and Air Interface Capacity metrics, a number of RBS Baseband elements can cause
capacity issues. Counter and metrics to indicate these issues are contained in the following reports in T-
PIM:
Accessibility
Node B
HSDPA Node B
The main report for identifying Baseband capacity issues where they relate to Radio in T-PIM is the
Accessibility Report. This contains the RAB Failure counters due to a lack of Ul or DL hardware,
insufficient licensed capacity or Node Blocking.
The Node B Report also displays the usage % of the RAX and TX Board Channel elements. This is an
average usage over the period the report is run.
Most of these KPI’s are made us of single counters taken from the RNC or RBS. The list of main RBS
Baseband KPI’s are shown below:
Individual counter description for those not found below can be found in the ALEX Libraries.
Individual counter description for those not found below can be found in the ALEX Libraries.
100000
80000
60000
40000
20000
In the second chart, following the RNC upgrade, we can drill down to an hourly level and the issue only
occurs for a short period (One ROP @ 0000). This may have been related to an upgrade or parameter
changes on the RNC.
RRC Reject Due to MP Load Control - Post RNC Upgrade
45000
40000
35000
30000
25000
20000
15000
10000
5000
Total
The MP Load can also be checked to determine if there were any issues with RNC Load during this
period.
7. Capacity Management Tool
Capacity Management tools and methods should also be used in improving capacity. These items
include:
– OSS-RC
– Measurement Results Recording – WCDMA (WMRR)
– This is a special measurement report run on any number of cells in the RNC and run for less
than 24 hours. This data provides:
• BLER for all service combinations
• UE Tx Power
• DL CPICH Ec/No & RSCP
xConfig should also be used to ensure all parameters are consistent with the FSC Baseline Set unless
otherwise agreed for performance reasons.
8. Troubleshooting Tools
The following tools can be used for troubleshooting:
Tektronix K-18
This is a protocol analyzer used for analysis of the Iu and Iub links and the layer 1, 2 and
3 messaging. This can be used for further analysis of the call flows
More detailed information on this can be found in the Tektronix Documentation
GPEH
The General Performance Event Handling (GPEH) tool is a feature in the Ericsson OSS
that provides capability similar to a protocol analyzer.
This can be run on an RNC level
More detail on this tool can be found in ALEX
CTR
The Call Trace (CTR) tool is a feature in the Ericsson OSS that provides capability similar
to the GPEH Tool
This can be run on a Cell level for specific cell troubleshooting. These files can be read in
Actix
More detail on this tool can be found in ALEX
UETR
The User Equipment Trace (UETR) tool is a feature in the Ericsson OSS that provides
capability similar to the GPEH Tool
This is run on a particular IMSI for specific issue troubleshooting, typically a test SIM
attempting to recreate problem conditions. These files can be read in Actix
More detail on this tool can be found in ALEX
9. References
1. 3GPP TS 25.331 V 5.19.0 “UMTS Radio Resource Control Protocol Specification”.
2. Ericsson ALEX
3. Allan Orbigo, Christophe Vidal, .
4. Alejandro Aguirre, Sireesha Panchagnula Ericsson UTRAN Parameters.
5. 3GPP TS 25.304 V 5.9.0 “UMTS UE Procedures in Idle Mode and Procedures for Cell Reselection
in Connected Mode”.
6. UMTS Network KPI, U12 UMTS Network KPI v6.14 080406TMO LR.doc
7. UMTS Network KPI Level 2, UMTS Network KPI Level-2_v1_20070205ERI_Updated.doc
8. UMTS RNC LCS KPI definitions and Formulas, Michael Gebretsadik
9. 3GPP TS 25.331, UMTS RRC protocol specification.
10. 3GPP TS 25.211 V 5.8.0, “UMTS Physical channels and mapping of transport channels onto
physical channels”.
11. 3GPP TS 25.413 V 5.12.0, “UTRAN Iu Interface RANAP Signaling”.
12. 3GPP TS 25.214, ”UMTS Physical Layer Procedures FDD”.
13. Ericsson Product Documentation, EN/LZN 733 0017 R4A.
14. Ericsson Product Documentation, “Guidelines for LA/RA/URA planning”
15. Ericsson Product Documentation 58/1551-AXD 105 03/1 Uen G, “Performance Statistics RNC
3810”.
16. Nabeel Lughmani, Pradeep Singh, Tim Zhang, “Paging Performance Guidelines”.
17. Sireesha Panchagnula, “UMTS Paging Concepts”.