Professional Documents
Culture Documents
TROUBLESHOOTIN
G
OVERVIEW
Objective
Agenda
CPP O&m Concepts
Protocols
O&m Client Services
Counters Overview
Performance Management
Iub over ATM
Initial Counters
Iub Analysis
Fail After Admission
IP Iub Throughput
Questions?
OBJECTIVE
Main idea is introduce to the transport engineer the basic concepts of
troubleshooting on Iub interface, by presenting initial counters and
KPIs, that could help to define which area needs further
investigations.
CPP is the Connectivity Packet Platform on which are based the following
nodes: RNC, RBS, MGW, RXI.
Information are read and stored into a SQL database on a daily basis.
PROTOCOLS
Protocols used for accessing these services:
› http
› unsecure protocols (unencrypted): telnet, ftp, iiop
› secure protocols (encrypted): ssh, sftp, ssliop
NODE
Hyper RS232
Terminal
HTTP (80)
MoShell Ethernet TCP/IP FTP (21) / SFTP (22) File system
or
IPoverATM
CM (Configuration Mgmt)
IIOP (56834) MIB
/ FM (Fault Mgmt)
SSL IOP (56836)
PM (Performance Mgmt)
Scanners
Figure 1 - Protocols
The O&M client services
RNC reserves a CID and the relevant bandwidth, and forwards the
establish request message through the AP. It will contain, the
allocated CID, the traffic descriptors and QoS
CID
iUb over atm
Because of standardization constrains, no more than 248 AAL2
connections can be simultaneously established on a single AAL2
path: more than 248 connections can be established between two
adjacent nodes if more than one AAL2 path is configured.
When an AAL2 connection is allocated on an AAL2 path, a Channel
Identifier (CID) is reserved and assigned by the node that is
originating or forwarding the AAL2 connection request.
Flow Control:
The Flow Control function has been conceived to dynamically adapt
transmission rate of Best Effort services to Iub available bandwidth by
reducing transmission rate during Iub congestion situations
Initial counter check
Recommended to check in an initial investigation as they will give clues
on whether the source of the problem is transport network based.
Aal2Ap pmUnSuccOutConnsLocalQosClassA/B/C/D
Aal2Ap pmUnSuccInConnsLocalQosClassA/B/C/D
Aal2Ap pmUnSuccOutConnsRemoteQosClassA/B/C/D
Aal2Ap pmUnSuccInConnsRemoteQosClassA/B/C/D
Initial counter check
The following counters show the BW utilization.
Iub interface
UniSaalTp pmNoOfLocalCongestions
NbapCommon pmNoOfDiscardedNbapMessages
Iublink pmTotalTimeIublinkCongestedDl
Iu/Iur interface
NniSaalTp pmNoOfLocalCongestions
Iub interface
UniSaalTp pmLinkInServiceTime
Iu/Iur interface
NniSaalTp pmLinkInServiceTime
Initial counter check
The following counter shows if Iub Bandwidth is limiting HS services, measured
in %.
OBS. if > 75% cause could be Iub capacity or Radio limitations.
IubDataStreams pmCapAllocIubHsLimitingRatioSpi<xx>
› ImaGroup pmGrUasIma
› E1PhyspathTerm,
E1Ttp,E3PhysPathterm pmEs
pmSes
pmUas
› Os155SpiTtp pmMsEs
pmMsSes
pmMsUas
pmMsBbe
› Vc12Ttp,Vc4Ttp pmVcEs
pmVcSes
pmVcUas
Iub analysis
The followingAAL2 flowchart
Setup Failure summarises an Iub link analysis
procedure based on AAL2 Setup failure rate
OK
Strict Admission Traffic examination.
No AAL2 Setup Failure
Counters
Aal2Ap::pmUnSuccOutConnsLocalQoSClass<x> (A/B/C/D)
Number of unsuccessful attempts to allocate AAL2 resources during
establishment of outgoing connections on this Access Point (AP). Caused by
Rejects in Connections Admission Control (CAC).
Aal2Ap::pmUnSuccOutConnsRemoteQoSClass<x> (A/B/C/D)
Number of unsuccessful establishments of outgoing connections on this AAL2
Access Point (AP).
Aal2Ap::pmSuccOutConnsRemoteQosClass<x> (A/B/C/D)
Number of successful establishments of outgoing connections on this AAL2
Access Point (AP).
AAL2 Setup Failure Rate
[ AAL2 _ Fail _ Rate _ Local _ ClassA]%
pmUnSuccOu
KPIs tConnsLocalQoSClassA *100%
pmSuccOutConns Re moteQoSClassA pmUnSuccOu tConnsLocalQoSClassA pmUnSuccOu tConns Re moteQoSClassA
There is a second method using Erlang Counters, that won’t be demonstrated on this
presentation.
Counters
Aal2Ap:: pmExisTransConns
The number of existing connections for the Access Point (AP) existing in the node.. Gauge
Counter
Aal2Ap:: pmExisOrigConns
Number of existing connections for the Access Point (AP) originating in this node.
Gauge Counter.
Aal2Ap:: pmExisTermConns
Number of existing connections for the Access Point (AP) terminating in this node.
Gauge Counter.
KPI
CID Utilization Estimate
[ pmExisOrigConns pmExisTerm Conns pmExisTran sConns ]
Average _ No _ Connections
n
Note: if the RXI is a pure AAL2 switching node, then the pmExisOrigConns and
pmExisTermConns counters can be discounted as there can be no originated
or terminated connections in the node, only transiting connections.
Counters
VplTp :: pm Re ceivedAtmCells
AAL2 _ VP _ Utilisation _ Rx * 100%
Meas _ Length( s ) * ingressAtm PCR
TN quality Physical Layer Quality
Several counters are available to monitor the availability and the quality of
physical and IMA terminations in CPP nodes.
Errored Seconds (ES): seconds with block errors during the PM interval.
These counters are incremented for each second where one or more blocks
with one or more errors are received.
HSFrameDelayDistribution pmHsDataFrameDelayIubSpi xx
This counter indicates the percentage of times where Iub congestion has occurred per SPI
(Scheduling Priority Indicator).
Experience has shown that in high loaded Iub cases, this counter could reach values of
about 65–75%.
Flow Control
Low HS Throughput Site Analysis Study Case
Counters were extracted and graphs plotted to
illustrate the HS Frame Loss Ratio and HSLimitIub
KPIs over time
Flow Control
Examining the KPIs resulting graphs below, it was evident that the channel
normally reserved for ClassA traffic (vc39), was experiencing abnormally high
bandwidth utilization.
The ClassB&C traffic channels (vc50 & vc51) were experiencing abnormally low
utilization (next slide).
Flow Control
Enhanced Uplink Congestion KPIs
Flow Control
pmEdchDataFramesLost
Eul _ Frame _ Loss _ Ratio * 100%
pmEdchDataFrames Re ceived pmEdchDataFramesLost
refers to an RRC/RAB setup failure that occurs after the user has been
admitted to the network.
An RRC failure that occurs after the initial admission could be if the user
wanted to upswitch to a higher rate while on an existing call and the upswitch
could not be achieved, due to lack of resources (Radio or Transport). This
would be perceived by the user as a slow connection.
On the other hand, a RAB setup failure would be perceived by the user as a
failure to setup a call.
Failure After Admission
In general, high ‘Failure After Admission’ occurrences are mainly due to:
pmSumBestPsHsAdchRabEstablish
AvNrHsUser sPerCell
pmSamplesBestPsHsAdchRabEstablish