PS Faults Analysis and Troubleshooting
PS Faults Analysis and Troubleshooting
www.huawei.com
Page2
Network
Sort Evaluate Items Evaluate Software Priority
Element
MS High
Application Layer TEMS; CDS; Probe; Dingli
EGPRS (GPRS)
Throughput ATU
MS High
Definition: Ping Delay KPI is the KPI from begin to end, which means the time delay from
sending ping request with test equipment to receiving ping reply with test server.
Key information confirm: Ping delay issue need to confirm 3 important information firstly:
Ping packet size
Normally, ping delay increase along with ping packet size.
Server position (destination address)
Server position have important affect to ping Delay, normally, ping delay in intranet is
quite less than which of internet. The server position in internet have quite different in
different position (inside country or outside country).
Whether exist background data transmission
If ping in upload or download, ping delay will be much longer than ping without upload or
download.
Evaluate Method:
DT/CQT: If evaluation is in moving state, all cells related to test need to be analyzed.
Statistics mean value/distribution: if statistics distribution, analysis focuses on delay
abnormal point.
Resource: PDCH resource, transfers resource and quality, channel multiplex rate, channel preemptions etc. will affect PS
throughput.
Overlay: Show Um quality, as BEP distribution.
Traffic Model: Resource is limit, traffic change lead to user resource change, then affect PS throughput.
Sort 1 factor according to Forward Check, Sort 2 factor according to Backward Check
Resource: PDCH channel number, Abis, Gb transfers resource, coding scheme will affect PS throughput.
Overlay: Show Um quality, as CQI distribution.
Traffic Model: Resource is limit, traffic change lead to user resource change, then affect PS throughput.
Sort 1 factor according to Forward Check, Sort 2 factor according to Backward Check
Abnormal operation: Operation in BSC (operation log record), like PDCH Channel Priority Type, transfers network
or core network reform action will affect PS ping delay.
Abnormal alarm: Common alarms, such as E1/T1 frame out-of-sync alarm, will affect the ping delay, in stipulate
action will check it.
Parameter: Common algorithm parameter or channel parameter affect PS data transmission, see PS data
transmission parameter checklist.
Version check: Commonly algorithm implement or compatibility issue, software bug which release as early warning
or version release notes.
Notebook/Server performance and windows setting: Bad performance or incorrect window size setting will affect
TCP layer data process and Ack feedback duly will affect PS ping delay.
CN+ Route: Packet loss due to core network factors, delay too long, overload etc. will affect ping delay.
Test mode: Test whether include upload or download which will affect PS ping delay.
Sort 1 factor according to Forward Check, Sort 2 factor according to Backward Check
Sort 1 factor according to Forward Check, Sort 2 factor according to Backward Check
Action 4: Version check Unconditional enter Check exist early warning, RN affect data transmission issue.
Action 5: Parameter
Unconditional enter Check exist abnormal configuration affect data transmission issue.
check
Action 6: Resource Check include: PDCH channel resource, transfers resource and quality, channel
Unconditional enter
analysis multiplex rate, channel preemptions etc. will affect PS throughput.
Action 7: Traffic model Analysis CS and PS traffic volume change, confirm throughput/volume change is
Unconditional enter
analysis whether relate to traffic model.
Action 8: Overlay
STEP 3: analysis Action 6 Transfers OK, then enter Analysis Um factor whether affect data transmission issue.
Backward
Check 1) DT throughput unconditional enter;
Action 9: Single user 2) Counter throughput and complaint AnalysisL1, L2, L3 and application layer, confirm PS data transmission issue
throughput trace data issue, first analysis counter data, then signal, L1, C/I change, L2 TBF establishment, radio block re-transmit, application
analysis analysis single user trace data (Signal layer delay and packet loss which affect throughput.
analysis part is optional)
Action 10: Single user Ping delay issue enter (Signal analysis Analysis whole delay, confirm GB, Abis, UM affect to ping delay, Confirm long
delay trace data analysis part is optional) delay part.
Step 4: Confirm issue solved Unconditional enter Confirm issue whether solved.
If project is swap, original network formula and mapping formula need to be provided; in some
condition, original network counter handbook is needed too.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 20
Page 20
Forward Check (Action 2: Abnormal operation; Action 3: Alarm; Action 4: Version)
Input Output Follow-up
OMStar+ solution
1. Parameter
return (Abnormal
1. BSC MML Configuration and Manual 1. Check
operation);
license; Result
2. M2000 output Alarm Log;
check 2. Alarm resume;
3. Early warning
3. M2000 output Operation Log elusion perform
Case 3 (<Allocate One PDCH for Signaling Data> Parameter Configuration): A project
swap Ericsson, LLC layer weighted throughput cannot reach the standard, executed
parameter check, then found parameter: Allocate One PDCH for Signaling Data
/ALLOCSINGLEPDCHFORSIGNALLING is open, cause whole throughput is affected.
Close the parameter and throughput was improved.
[Issue description] G project HW area LLC layer throughput lower than area in other manufacturer, need optimized.
[Issue analysis]
STEP 1: Issue Confirmation and sort
Confirm Whole-net issue, decreased about 20K than other manufacturer.
STEP 2: Forward Check
PDCH dynamic
transform failure,
most of it are
because of PDCH
ratio thresh, and
reclaimed busy
dynamic PDCHs
are less, show no
CS preemptions.
Case: M project swap 160 2G sites, before swap, DT throughput is 100kbit/s, after swap, at
the same area, DT throughput is about 80kbit/s, DT mode is FTP download. Particularly see
Case 11.
Note: DT throughput action 1~8 are almost the same as counter throughput (Forward Check add
server, notebook, test mode check to PS parameter checklist), differences are analysis sites or cell
distinguish.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 32
Page 32
DT Throughput Backward Analysis (Action 9: L1 analysis)
L1 analysis Result
L2 analysis Result
TCP layer data exist disorder, but Gb data N (U) is continuous, which proved disorder exist in other
network element upward BSC
For this issue, the main reason is other network element upward BSC issue. Forward analysis cannot find issue
real reason, but according to backward analysis located other network element upward BSC issue fast. This
Issue Need other network element upward BSC to check data transfers issue.
Note: Ping Delay Issue action 1~8 are almost the same as counter throughput (Forward Check add
server, notebook, test mode check to PS parameter checklist), differences are analysis sites or cell
distinguish.
SNDCP SNDCP
LLC LLC
Relay
RLC RLC BSSGP BSSGP
MAC MAC NS NS
GSM PF GSMPF L1 L1
MS BSS SGSN
Um Abis
4. Abis interface analysis: Check
Gb
3. Gb interface analysis:
5. Um interface analysis: C/I value, whether there is delay, jitter, packet Check whether there is loss
BEP, level quality, TBF loss, retransmission at the RLC and sequence error of
establishment failure, and call drop layer, and low High Coding packets.
Scheme Rate.
Analysis rules:
1. Check whether there
are problems, such as the
loss and sequence error of
packet before the packet
capture point.
2. Use the TEMS Log tool
to summarize the overall
delay in pinging tests.
Overroll Ping
Delay Data
Note: Actions 1-8 for troubleshooting ping delay problems are similar to those for troubleshooting the throughput-
related counter problems, except that the sites and cells to be analyzed are different. New items in the forward
analysis are server, laptop, and test method, which are all included in the PS Parameter Checklist.
Case 7: The ping delay fails to meet the specified limit because the server in office S fails to respond to the
ping request.
Problem description: According to the ping delay test result, the success rate is only about 80%, which is
below the acceptance standard.
Root cause: The server fails to respond to the ping request, causing failed ping test.
Follow-up solution: Locate the transmission problem at the core network side.
Case 8: The ping delay fails to meet the specified limit because N (u) errors occur in the downlink LLC
PDU data over the Gb interface in office M.
Problem description: According to the ping delay test result, the success rate is below the acceptance
standard (85%).
Root cause: Signaling analysis results show that a large number of N (u) errors occur over the Gb
interface, causing failure of assembling ICMP packets in the upper system and low success rate of ping
service.
Follow-up solution: Rearrange the sequence of downlink LLC PDU data to resolve the problem.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 46
Page 46
Ping Delay Problem Backward Analysis
(Action 10: Abis Interface Analysis)
Input Output
1. Results of analysis
Follow-up
of Abis interface solution
transmission quality
Original traffic statistics, PS single FMA or and resources For more
subscriber signaling, and TEMS log. PS 2. Results of analysis details, see the
signaling of the delay, jitter, sections about
packet loss rate in the analysis of
analysis the Abis over IP
throughput
Analysis rules: tool or log networking mode.
1. Check whether the Abis interface 3. Transmission data statistics.
analysis
transmission quality is good and at RLC layer
whether transmission resources are tool 4. Coding scheme
sufficient. rate of ping service
2. Check whether the delay, jitter,
packet loss rate are normal in the Abis
over IP networking mode.
3. Check whether there is data
transmission at RLC layer during
pinging.
4. Check whether the coding scheme
rate is excessively low during pinging.
Cases to be collected
Case 9: The ping delay fails to meet the specified limit because the RLC operates in the unacknowledged mode in office Y.
Problem description: According to the ping delay test result, the success rate is below the acceptance standard (80%).
Root cause: The SIM card for test is registered to be the unacknowledged RLC mode and lost packets cannot be
retransmitted over the Um interface. Therefore, there is difficulty in successfully pinging and assembling packets at the LLC
layer, causing failure of ping service.
Follow-up solution: modify the SIM card registration information to resolve this problem.
Analysis Report
on PS Data Transmission P
Notify the frontline personnel to map these parameters onto original settings. Increase
the value of parameters by two to three kbit/s. Then perform backward analysis.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 51
Page 51
Case 10: LLC Throughput in Office M Below the Standard (2)
The original network is configured with the BPDCH. That is, GPRS channels are only used for GPRS services. Therefore, the
average number of simultaneous TBFs is 1. In contrast, in Huawei network, channels for GPRS and EGPRS are not
configured separately, the downlink multiplex rate is generally about 4.
The red line represents the Maximum Rate Threshold of PDCH in a Cell (60%). The PDCH
usage rate almost reaches the threshold. A large number of application failures due to the
threshold indicates that PDCH resources are not sufficient.
The sum of FR TCH Occupied Rate, HR TCH Occupied Rate, and PDCH Occupied Rate is
almost 100%, which indicates that Um interface resource is not sufficient.
Throughput
Action Throughput
Specified Action Before
Time After Action
Action
Action 1: KPI
definition and
mapping
Action 2: Abnormal
operation check
Action 3: Abnormal
alarm check 2011-8-26 20 kbit/s 23 kbit/s
Action 4: Version
check
Action 5: Parameter
check
Action 6: Resource
analysis
Problem description: The FTP download rate in office F is below the standard.
Problem analysis
Step 1: Problem identification and classification
According the drive test result, the throughput in only one site is below the standard.
Step 2: Forward analysis
Action 9: L1 analysis
C/I statistics indicate that if the mean C/I is low, Um interface data transmission is affected. High
coding scheme rate cannot be used unless C/I is above 30 to improve the data transmission rate
during the drive test. The frontline personnel are advised to optimize the Um interface by
interference suppression and carrier adjustment.
Action 9: L2 analysis
According to the result of signaling analysis at the BSC side, TBF access is resumed after TBF drop.
RLC/MAC data blocks are frequently retransmitted over the Um interface, causing
low transmission rate and drops.
According to the result of L2 signaling analysis, Um interface transmission quality is
poor, causing significant retransmission, drops, and low transmission rate. You are
advised to identify the Um interface transmission problem.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 57
Page 57
Contents
Step1: Establish project, input counter, Step 2: Choose configuration or performance counter,
configuration, alarm etc. Data choose needed analysis counter or KPI in right list, show
as figures.
Step 2: Right-click
the selected
signaling and
choose the required
function such as
statistics, export, or
extraction function.
Wireshark can analyze .cap files at any time and analyze the data transmission of LLC, BSSGP, and TCP layers.
Currently, the analysis of drive test signaling is performed by using the following method: Convert the Gb interface
signaling to .cap files by using the signaling analysis tool and view the signaling transmission by using Wireshark.
1 ) Ping Analyse ( 1 ) Open the file , click QoS->ping (2) Select the 5-elements-data to be analysed.
Analyse
Page68
Numerator: amount of MAC-D data received by the NodeB on the Iub In the formula of NSN, the
interface, except the packets discarded due to flow control. In at least denominator is greater than that of
the following four scenarios, the NSN's throughput measured is higher Huawei, but the numerator is
sum ([RECEIVED_HS_MACD_BITS] - than that of Huawei: greater than that of Huawei's first
[DISCARDED_HS_MACD_BITS]) / sum 1. The serving cell changes across NodeBs. formula in some scenarios and
Throughput
NSN ([ALLO_DUR_HS_DSCH_FLOW_INT] + 2. The number of HARQs exceeds the maximum number of NodeB smaller than that of Huawei's
per user
[ALLO_DUR_HS_DSCH_FLOW_BGR]) / 100)) retransmission times. second formula in most cases. The
Unit: Kbit/s 3. Data is discarded if timeout occurs when the NodeB buffers data. throughput measured using the
4. A user releases a call. formula of NSN is lower than that
Denominator: duration of flow allocation in at the MAC-D layer. The using the formula provided by
time precession is 10 ms. Huawei.
Throughput
ALU None None N/A N/A
per user
(EDCH_DATA_SCELL_UL+EDCH_DATA_NSC_S_EDCH_UL+EDC
H_DATA_NSC_NS_EDCH_UL)*
8*0.001/(DUR_HSUPA_USERS_1_OR_2
+DUR_HSUPA_USERS_3_OR_4+DUR_HSUPA_USERS_5_OR_6+
DUR_HSUPA_USERS_7_OR_8+DUR_HSUPA_USERS_9_OR_10+
DUR_HSUPA_USERS_11_OR_12+DUR_HSUPA_USERS_13_OR_ The data traffic of the
14+DUR_HSUPA_USERS_15_OR_16+DUR_HSUPA_USERS_17_ numerator measures that at
OR_18+DUR_HSUPA_USERS_19_OR_20+DURA_HSUPA_USERS Numerator: MAC-E SDU traffic
NSN Cell throughput NodeB the MAC-E layer, which is
_21_TO_24+DURA_HSUPA_USERS_25_TO_28+DURA_HSUPA_U Denominator: online duration of users in a
SERS_29_TO_32+DURA_HSUPA_USERS_33_TO_36+DURA_HS measurement period (accuracy: 1s) greater than that at the
UPA_USERS_37_TO_40+DURA_HSUPA_USERS_41_TO_44+DUR MAC-D layer.
A_HSUPA_USERS_45_TO_48+DURA_HSUPA_USERS_49_TO_52
+DURA_HSUPA_USERS_53_TO_56+DURA_HSUPA_USERS_57_
TO_60+DURA_HSUPA_USERS_61_TO_64+DURA_HSUPA_USER
S_65_TO_72)
Unit: Kbit/s
SUM(EDCH_DATA_SCELL_UL) * 8 /{SUM(MACE_PDU_RETR_00_COUNTER +
MACE_PDU_RETR_01_COUNTER*2 + MACE_PDU_RETR_02_COUNTER*3 +
MACE_PDU_RETR_03_COUNTER*4 + MACE_PDU_RETR_04_COUNTER*5 +
MACE_PDU_RETR_05_COUNTER*6 + MACE_PDU_RETR_06_COUNTER*7 +
MACE_PDU_RETR_07_COUNTER*8 + MACE_PDU_RETR_08_COUNTER*9 +
MACE_PDU_RETR_09_COUNTER*10 + MACE_PDU_RETR_10_COUNTER*11 +
MACE_PDU_RETR_11_COUNTER*12 +
The data traffic of the numerator
MACE_PDU_RETR_12_COUNTER*13)*0.01*(MACE_PDUS_10MS_TTI /(MACE_PDUS_10MS_TTI + The formula of NSN measures the
Throughput MACE_PDUS_2MS_TTI))}+{SUM(MACE_PDU_RETR_00_COUNTER + measures that at the MAC-E
NSN MACE_PDU_RETR_01_COUNTER*2 + MACE_PDU_RETR_02_COUNTER*3 + throughput of SDU at the NodeB
per user MACE_PDU_RETR_03_COUNTER*4 + MACE_PDU_RETR_04_COUNTER*5 + layer, which is greater than that at
MACE_PDU_RETR_05_COUNTER*6 + MACE_PDU_RETR_06_COUNTER*7 + MAC-E/MAC-I layer.
MACE_PDU_RETR_07_COUNTER*8 + MACE_PDU_RETR_08_COUNTER*9 + the RLC layer.
MACE_PDU_RETR_09_COUNTER*10 + MACE_PDU_RETR_10_COUNTER*11 +
MACE_PDU_RETR_11_COUNTER*12 +
MACE_PDU_RETR_12_COUNTER*13)*0.002*(MACE_PDUS_2MS_TTI /(MACE_PDUS_10MS_TTI +
MACE_PDUS_2MS_TTI))}
Unit: Kbit/s
Throughput
ALU None None N/A N/A
per user
Laptop
Dumeter, NetMeter High
computer
Application layer
HSDPA
throughput Probe, Dingli, XCAL,
UE High
Qvoice, Tems, QXDM
Laptop
Dumeter, NetMeter High
computer
Application layer
HSUPA
throughput Probe, Dingli, XCAL,
UE High
Qvoice, Tems, QXDM
Windows command:
HSPA Ping delay UE High
ping ipaddress
• Definition: Ping delay, is an end-to-end KPI that indicates the delay from when the test equipment
sends a ping request message to when received the ping reply message from server.
• Confirmation of key information: Confirm the following key information for any ping delay
problem:
• Ping packet size
Generally, when the size of a ping packet increases, the ping delay also increases.
• Server location (target address)
The server location has a great impact on ping delay. The ping delay in an internal network is much lower than that
in an external network.
The ping delay in an external network where the server is located varies greatly depending on the server location (at
home or abroad).
• Whether data transmission is involved
If a ping operation is performed during data upload or download, the ping delay is much higher than that when data
upload or download is not performed.
• Evaluation method:
• Moving/fixed-point: If the ping delay is evaluated in the moving status, all cells involved need to be
analyzed.
• Statistical average value/distribution: If distribution is measured, focus on analyzing the points
when delay is abnormal.
Ping delay Ping delay KPI Measure the ping delay by using the ping Test shows that the ping
requirement command in Windows system. delay does not meet the
requirement specified in
the contract or is longer
than those of peer
vendors.
Abnormal operation: including operations of RNC/NodeB (operation logs). For example, modifying the IUPS route,
transmission network, or core network have a great impact on HSPA data transmission.
Abnormal alarm: Common alarms such as alarms about packet loss during transmission affect the throughput, which are
tested in the SOP.
Parameter: common algorithm parameters and channel parameters affecting HSPA data transmission. For details, see the
checklist of HSPA data transmission related parameters.
Known problem: generally algorithm implementation or compatibility problems. Software bugs are indicated through early
warning or release notes.
Transmission configuration: Transmission bandwidth, DSCP configuration, and transmission mapping of the service type
affect HSPA data transmission.
Perform a forward check for factors at level 1 and a reverse check for factors at level 2.
Server Test
Abnormal Transmission Terminal Portable performance
Abnormal alarm Parameter Known problem performance and CN mode
operation configuration performance and TCP window
TCP window
Abnormal operation: including operations of RNC/NodeB (operation logs). For example, modifying the IUPS route and rectifying the
transmission network or core network have a great impact on HSPA data transmission.
Abnormal alarm: Common alarms such as alarms about packet loss during transmission affect the throughput, which are tested in the
SOP.
Parameter: common algorithm parameters and channel parameters affecting HSPA data transmission. For details, see the checklist of
HSPA data transmission related parameters.
Known problem: generally algorithm implementation or compatibility problems. Software bugs are indicated through early warning or
release notes.
Transmission configuration: Transmission bandwidth, DSCP configuration, and transmission mapping of the service type affect HSPA
data transmission.
Laptop computer/server performance and TCP window: Poor performance of the laptop computer or server or improper window
setting affects data processing and return of ACK messages at the TCP layer and the HSPA throughput.
CN: Factors of the core network such as packet loss, high delay, and overload affect the HSPA throughput.
Test mode: The test mode such as single thread test mode or multi-thread test mode affects the HSPA throughput.
Resource: Code words, power (uplink RTWP and downlink TCP), CE, bandwidth, and board load affect the HSPA throughput.
Coverage: air interface quality, for example, CQI distribution
Traffic model: Resources are limited. Traffic change causes change of user resources, which then affects the HSPA throughput.
Perform a forward check to identify level-1 factors and a reverse check to identify level-2 factors.
Server Test
Abnormal Known Transmission Terminal Portable performance
Abnormal alarm Parameter performance and CN mode
operation problem configuration performance and TCP window
TCP window
Abnormal operation: including operations of RNC/NodeB (operation logs). For example, modifying the IUPS route and rectifying the
transmission network or core network have a great impact on HSPA data transmission.
Abnormal alarm: Common alarms such as alarms about packet loss during transmission affect the throughput, which are tested in the
SOP.
Parameter: common algorithm parameters and channel parameters affecting HSPA data transmission. For details, see the checklist of
HSPA data transmission related parameters.
Known problem: generally algorithm implementation or compatibility problems. Software bugs are indicated through early warning or
release notes.
Transmission configuration: Transmission bandwidth, DSCP configuration, and transmission mapping of the service type affect HSPA
data transmission.
Laptop computer/server performance and TCP window: Poor performance of the laptop computer or server or improper window
setting affects data processing and return of ACK messages at the TCP layer and the HSPA ping delay.
CN: Factors of the core network such as packet loss, high delay, and overload affect the HSPA ping delay.
Test mode: Whether upload or download is enabled during the test affects the HSPA ping delay.
Resource: Code words, power (uplink RTWP and downlink TCP), CE, bandwidth, and board load affect the HSPA ping delay.
Coverage: air interface quality, for example, CQI distribution
Traffic model: Resources are limited. Traffic change causes change of user resources, which then affects the HSPA ping delay.
Perform a forward check to identify level-1 factors and a reverse check to identify level-2 factors.
R99 PS data transmission troubleshooting covers four steps, as listed in the following table.
Step Action Entry Condition Purpose
Step 1: Confirm and classify the problem. None similar to that for HSPA data transmission troubleshooting
Step 2: Action 2: Check alarms. None similar to that for HSPA data transmission troubleshooting
Perform a Action 3: Check early
None similar to that for HSPA data transmission troubleshooting
forward check. warnings.
Action 4: Check parameters
None similar to that for HSPA data transmission troubleshooting
(including licenses).
Action 5: Check
None similar to that for HSPA data transmission troubleshooting
transmission configurations.
Action 6: Analyze resources. None similar to that for HSPA data transmission troubleshooting
According to the analysis on root causes of online problems in 2012, more than 80% of problems can
be located following the standard actions.
Any abnormal operations, alarms, and early warnings can be checked based on the SOP.
Case 2 (incorrect setting of the R99 DCCC 4a threshold): In busy hours, the number of HSUPA
users exceeds the allowed value, that is, 20. When HSPA users move from the DRD of an R99 cell
to an HSPA cells, the users are forced to go back R99 cell. The initial rate of the R99 cell is 64 Kbit/s,
whereas the R99 DCCC 4a threshold is set to 4 Kbit/s and the baseline value is 1 Kbit/s. As a result,
the uplink rate cannot rise to 384 Kbit/s, affecting the HSDPA rate.
1. Use the FMA tool to convert the configuration of Node B from xml
to mml (see the appendix).
2. For the configuration of RNC, use the FMA tool to filter the
configuration of NodeB.
See case 3: The HSUPA Rate Drops in Office L Because the Transmission Configuration
Does not Comply with Specifications
The number of
HSDPA RAB
establishment
attempts or the
number of
successfully
established HSDPA
RABs increases
since April 6 and is
larger than that
before upgrade.
Consistent Inconsistent
before and before and after
after swap swap
HSUPA: Compare the proportion of HSUPA users whose UPH is limited in the cell (VS.HSUPA.MaxPwrLmtUserRatio).
Output Solution
1. Scope of sites or
UE tracing data Input cells where the Perform a
or throughput throughput problem is
screenshot UE forward
detected in a drive test
analysis.
tracing 2. Test mode of
throughput in a drive
Analysis rule: provided based on tool test: moving or fixed
drive test rules, which are point, FTP or HTTP
specified by the customer or a 3. Confirm the problem
third party symptom: test on
whether the
throughput meets the
requirement or single
point test for locating
the problem.
Case: 160 3G sites in office M are swapped. The throughput obtained in a drive test in
moving mode is 730 Kbit/s before the swap, and the throughput obtained in a drive test in the
same area is about 600 Kbit/s after the swap. The drive test is performed in FTP download
mode. For details, see case 12.
Input
RNC CDT
Output Solution
Analysis Rules for HSPA Single User Tracing Data.rar FMA 1. Check results
of UE 1. Replace the
1. Signaling for checking UE
capabilities:
FMA capabilities terminal of the
RRC_CONNECT_SETUP_CMP 2. Check result related
2. Signaling for checking the of the category.
registration rate: registration rate 2. Set the
RANNAP_RAB_ASSIGNMENT_REQ registration rate
3. Check results
3. Network capability check (ADT again on the
URES) of network
capabilities HLR.
3. Check
Analysis rules:
1. Observe the signaling field hsdsch-
whether related
physical-layer-category and UE boards of the
capability information. NodeB support
2. Observe the uplink and downlink • Terminal capability the
registration rates from the maxBitrate HSDPA/HSPA+
field and ExtendedMaxBitrate field. If the • Terminal registration
capability.
rates are lower than the required rates, • Network capability
an exception occurs.
3. In the resource audit response
message, check whether related service
feature capabilities are supported.
Note: Actions 1 to 8 for the throughput problems detected through a drive test are basically the
same as those for the throughput problems detected through traffic measurement except that the
sites and cells to be analyzed are different. In addition, the forward check covers the check on
servers, laptop computers, and test mode, which are included in the HSPA parameter checklist.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 102
Page 102
Reverse Analysis on HSDPA Throughput Problems Detected Through
a Drive Test (Action 9: Analyze L1 Signaling)
Output Solution
RNC CDT Input See the 1. Check RLC 1. Change the
appendix BO is 0 or not. test to a multi-
for RNC 2. Check result thread or packet
1. L2 RLC Bo statistical chart CDT of RLC injection test.
2. PS L2 RB statistics analysis retransmission 2. Modify the flow
3. PS L2 FP packet statistics and control algorithm
(HSPA).
DownlinkWindow to the static flow
Analysis rules: FMA sfullnum control algorithm.
Change the test
3. Check results
1. If the value of RLC Bo is 0, an place to the one
of uplink air
exception occurs. Follow the handling where the
interface
measures. If the value of RLC Bo is not coverage is good.
bandwidth and
0, go to the next step. 3. Perform a
allocated
2. If the value of ulSufilistnum packet injection
bandwidth for
increases continuously, an exception test.
NodeB
occurs. Packet loss may occur on the
Iub interface or retransmission is
performed over the air interface. If the
value of DownlinkWindowsfullnum is
not 0 and increases continuously, an
exception occurs. Packets are blocked
in the RLC sending window on the RNC
side and cannot be sent.
3. If the count of Harqfailure increases
continuously, an exception occurs.
Related items of TNLIub can be used to
collect statistics of packet loss and
delay on the Iub interface in the uplink
of R99 or UPA. If the number of lost
packets or the delay increases
continuously, an exception occurs.
RNC CDT
Input Output Solution
1. Enable RNC CDT by using 1. Change the path
FMA and choose Tool > User 1. Check to another APN path
plain extraction to obtain the results of and modify the flow
Pcap file in the same directory. FMA packet loss control algorithm to
Import TranExpert. on devices the static flow
TranEx above or control algorithm.
Analysis rule: pert below RNC 2. Modify the
Check the uplink packet loss at 2. Check window for servers
the point where packets are results of and laptop
captured and the analysis results performance computers. Replace
of delay on the RAN side. of servers servers and laptop
If the results show that packet and laptop computers with the
loss occurs in the uplink of the computers ones that have
packet capture point, packet loss better performance.
occurs in devices above the
RNC.
Generally, the measured delay is
about 100 ms. If the delay is
above 200 ms and fluctuates
greatly, an exception occurs.
Case 6: The Peak Rate Is Low During the DC Competitive Test in Office G (Next Page)
After the MTU of routers on the CN is changed to 1500 bytes, the peak rate becomes stable and is around 40 Mbit/s.
This parameter is related to transmission equipment on the CN. It takes more time and efforts to check this
parameter than other parameters related to the RNC and NodeB. Reverse analysis shows that the MTU is limited.
This problem can be located through forward parameter analysis.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential Page 108108
Reverse Analysis on HSUPA Throughput Problems Detected Through
a Drive Test (Action 9: Analyze L3 Signaling)
Note: Actions 1 to 8 for the throughput problems detected through a drive test are basically the
same as those for the throughput problems detected through traffic measurement except that the
sites and cells to be analyzed are different. In addition, the forward check covers the check on
servers, laptop computers, and test mode, which are included in the HSPA parameter checklist.
NodeB CDT
Input See the Output Solution
appendix
1. Check result 1. Set ROT to a
1. DL HSUPA RlEnhanced for NodeB of load on cells reasonable
2. Trans Iub HSUPA CDT CDT 2. value, set the
You can see the values of OutBw analysis Retransmission background
(outbound bandwidth allocated to (HSUPA). 3. Check results noise to the
transmission equipment) and Loss (lost of radio actual value, and
packets on the Iub interface) by using FMA resources (CE, suppress
the FMA tool. power, and external system
traffic volume) interference.
limitation 2. Change the
test place to the
Analysis rules: 4. Check results
one where the
1. Check whether the value of of transmission
coverage is
uhwDelayCurLoad reaches 75%. If printing
good.
yes, uplink overload occurs.
3. a. Update CE
2. Check whether the proportion of Hi
resources; b.
whose value is 0 exceeds 10%. If yes,
Change the test
packets are frequently retransmitted.
place to the one
3. If the scheduling reason is (0, 21),
where the
the number of CE resources is
coverage is
insufficient. If the value of
good; c. Adopt
rLScheInfo1.ucUph is smaller than 31,
the uplink packet
the power is insufficient. If the value of
injection mode.
rLScheInfo1.ucTebs is smaller than 25
4. Disable
and fluctuates, the UE buffer is
HSUPA flow
insufficient.
control.
4. Check whether packet loss or a
delay problem occurs. If yes, an
exception occurs (see related
description of HSDPA).
Case 7: The HSUPA Rate Is Low Due to Packet Loss on the Iub Interface
at a Site of Office T (Next Page)
Analysis on scheduling at L1
Based on the analysis on L1 signaling, the rate is low because flow control is enabled, which is possibly
caused by packet loss or delay on the Iub interface. To locate the root cause, analyze L2 signaling.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 114
Page 114
Case 7: The HSUPA Rate in Office T Is Low (Continued)
After the ping command is run on the NodeB LMT and continuous ping detection is enabled, the ping
command can be run successfully when the ping packet size is set to 1400 but fails when the ping
packet size is to set to 1470. The MTU of intermediate transmission equipment is set to an improper
value, that is, the value is smaller than 1500. After the MTU is set to 1500 bytes, the HSUPA rate
increase to 2 Mbit/s and remains stable.
This parameter is related to intermediate transmission equipment. It takes more time and efforts
to check this parameter than other parameters related to the RNC and NodeB. Reverse analysis
shows that the MTU is set improperly. This problem can be located through forward parameter
analysis.
After the preventive measures are taken, the throughput increases by 20%, which is close to those
of peer vendors but is not enough to win the first place. To find a solution to the problem, perform a
reverse check.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 116
Page 116
Case 8: The DPA Throughput Is Lower Than Those of Peer Vendors (Continued)
Uplink RTWP:
In busy hours, the RTWP is high in 11.3% of cells.
After field engineers execute Action 6, the throughput detected through a drive test
increases by 18%, which is higher than those of peer vendors.
The R99 service volume drops, and the The traffic is transferred normally, and no
Action 7: Analyze traffic models.
HSDPA service rises. actions are required.
Action 8: Analyze coverage. N/A
Action 9: Analyze the traced data about the single
N/A
user throughput.
Action 10: Analyze the traced data about the single
N/A
user delay.
Problem description: The R99 PS service volume rises after upgrades at site F.
Problem analysis
Step 1: Confirm and classify the problem.
The problem is confirmed to be an RNC-level problem. After upgrades, the downlink R99 PS traffic rises drastically.
The items to be checked in forward check are the same as those for
throughput problems detected through traffic measurement and through a
drive test. In the reverse check, you need to check the resources, coverage,
and traffic models. If a drive test is required, analyze signaling at L1, L2, L3,
and application layer.
Analysis rules:
1. Check the analysis results of
packet loss in the uplink and
delay on the RNC.
If packet loss occurs in the
uplink, packets are lost on
routers above the RNC.
2. The general delay measured
is around 100 ms. If the delay is
higher than 200 ms and
fluctuates greatly, an exception
occurs.
Note: Actions 1 to 8 for ping delay problems are basically the same as those for the throughput problems detected through
traffic measurement except the sites and cells to be analyzed are different. In addition, the forward check covers the check
on servers, laptop computers, and test mode, which are included in the HSPA parameter checklist.
Output Solution
RNC CDT Input
1. Whether
FMA RB
reassignment
Analysis rules: is performed 1. Change the test
1. Check whether RB place.
2. Bearer
reassignment is performed during 2. Modify the
channel
the ping test. setting of the
2. Check whether the channel channel parameter.
where ping packets are
transmitted is the correct channel
to be tested during the ping test.
Case 9: The ping delay does not meet the requirements because the state transition
policy is unreasonable in office M.
Problem description: The ping delay is 191 ms before the swap and 274 ms after the
swap, which does not meet the requirements and affects the swap and acceptance.
Root cause: The state transition policy is different from that of Ericsson. The DCH/HSDPA
state threshold is high. A 32-byte ping packet can be carried on only the FACH. The
normal delay on the FACH is around 260 ms.
Solution: Reduce the DCH/HSDPA state threshold. UEs still transit from the DCH to the
FACH state from time to time. After the H2F optimization function is enabled, the problem
is solved.
Output
Probe/QXDM UE log
Input Solution
NodeB CDT
1. Whether
the PDU is
Probe sent as 1. Change the test
FMA expected place.
Analysis rules: 2. Packet 2. Modify the
1. According to the size of ping retransmission setting of the
packets, check whether the PDU rate in the channel parameter.
is sent as expected. uplink and
2. Check the retransmission rate downlink
of ping packets on the Uu
interface.
Action 10: Analyze the traced data about the single user delay.
Analysis on the uplink CDT data shows that the uplink retransmission rate is high. It is
confirmed that the problem is caused by software defects.
Field engineers confirm that the delay is introduced on the Uu interface, and R&D
engineers analyze why the delay is introduced.
Output Solution
Transmission delay Input
on the Iub interface
RNC CDT
FMA Transmission Check the
delay on the transmission
Iub interface configuration.
Analysis rules: Packet loss
1. Compare the delay data with rate at L2
the baseline value to check
whether the delay variation and
high delay exist.
2. Check whether packet loss
occurs at L2.
2. This slide describes the general analysis procedure. For details, see the HSPA Data
Transmission Troubleshooting Guide on the support.huawei.com website.
After the parameters are set to baseline values, the throughput increases by 3% to 4%, which is
still 20 Kbit/s lower than the throughput before swap. Then perform a reverse analysis.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 136
Page 136
Case 12: The Throughput Detected Through a Drive Test in Office M Does Not
Meet the Requirement (Continued)
Analyze the distribution of online users by using the OMStar. The ratio of uplink 2 ms users to uplink 10 ms users is about
7:3. Most terminals support the 2 ms policy. The HSUPA 2 ms policy configured on the live network does not comply with
the baseline configuration and needs to be optimized.
Scheduling probability 55% before swap Scheduling probability 40% after swap
Action 1: Check
abnormal operations.
Problem
Tool Name Data Source
Type
RNC script, RNC traffic statistics, RNC
OMStar network evaluation tool
license, and NodeB license
RNC script, RNC traffic statistics,
Network- OMStar network optimization tool NodeB license, and NodeB traffic
level statistics
performance
OMStar SOP tool RNC script and alarm logs
Alarm logs, operation logs, PCHR, and
FMA
RNC traffic statistics
WIRESHARK TCP/IP packet capture
Data
transmission TranExpert TCP/IP packet capture
for a single QXDM UE QXDM log
user
PROBE UE PROBE log
Page151
Page152
Actions 3 to 6:
Air interface
Checking basic
differences
elements
Coding differences
Resource
differences
Actions 7 to 12:
Use the Assistant
Use the Probe and Isolate causes for Channel and Analyzing the
Extend for data
Optis for DTs. low throughput. interference corresponding
analysis.
differences causes
Handover
differences
The actions provided in X solutions are formulated and methodically organized based on cases
and issue characteristics identification by following the principle for minimal costs.
You are advised to select points with SINRs greater than 24 to test whether the peak throughput can be achieved at such points.
This is the most effective way to exclude causes for low DT throughput caused by PCs, terminals, and SIM cards.
You are advised to conduct a DT during off-peak hours to reduce background users' impact on DT users. During peak hours,
background users occupy more scheduling resources. As a result, interference from neighboring cells increases.
Air interface conditions are affected by test routes and direction. In addition, vehicle speed may also affect air interface conditions
due to frequency deviation effect. Therefore, the external environment should be identical.
You can download the Assistant Extend from the following URL:
http://3ms.huawei.com/hi/group/9261/thread_4222425.html?mapId=3417611&for_statistic_from=all_group_forum
When the bandwidth keeps unchanged, the throughput is mainly affected by radio environment and product capabilities.
SINR_CDF SINR vs DL RLC DL RLC Throughput_CDF
100.00% 100.00%
35000 Throughput
90.00% 90.00%
30000
+
80.00% 80.00%
=
70.00% 25000 70.00%
60.00% 20000 60.00%
50.00% 15000 50.00%
40.00% 40.00%
10000
30.00% 30.00%
20.00% 5000 20.00%
10.00% 0 10.00%
0.00% -10 -5 0 5 10 15 20 25 30 0.00%
-10 -5 0 5 10 15 20 25 30
Affecting factors of the throughput curve: 0 10000 20000 30000
Affecting factors of the radio environment Devices: repeaters + channels + alarms + data transmission Throughput is affected by both radio
(SINR): + a server + UE capabilities environment and product capabilities.
Coverage: low SINR due to coverage restriction Power configuration (Pa, Pb) Therefore, throughput can be optimized
Interference: intra-RAT and inter-RAT interference Optimization on parameters and features (product capability) from these two aspects.
Focus on external events that are not listed in this table but chronologically correlated with the KPI deterioration. In practices, a brief understanding of the
event, rather than event digging-out, is enough for analysis.
Release Notes, warnings, and avoidance guides are available at Huawei support website.
Air interface Air interface parameter signaling is obtained from the DT data to compare parameter mappings during
comparison tests on commercial networks. The FMA can extract air interface parameter configuration
parameters information contained in L3 signaling from Probe logs or QXDM logs.
During analysis of non-eNodeB device faults, the network topology can be analyzed to provide
Non-eNodeB check or optimization suggestions for operators based on actual experience. As for throughput differences
between radio networks or transmission networks, parameter differences (transmission parameters, EPC-
parameters related parameters, and TCP parameters) between different networks must be analyzed, especially for
MTU parameters. For details, see the attachments.
This action involves 55 core parameters affect data transmission in the entire network for X Solutions for Parameter Check
and Optimization on the LTE Network. This document mainly describes the impact of parameters related to non-eNodeB
devices and provides 38 parameters that need manual checks, associated configurations, and balancing
configurations. For other parameter checks, see X Solutions for Parameter Check and Optimization on the LTE
Network.
For detailed parameter description and impact, see Attachment – Alarm Checklist and Attachment – Parameter
Checklist.
Downlink throughput
increases by 2% to 4% at
Set CqiAdjAlgoSwitch.Ad
the cell edge but does not
AdaptiveStepVarySw aptiveStepVarySwitch Accelerate the MCS adjustment and optimization. All
increase at the cell center.
itch to ON. -1
The average throughput
increases by 2.
Note: For all throughput-related parameters, see X Solutions for Parameter Check and Optimization on the LTE Network of the latest
version.
For problems related to the air interface and coverage, this document only describes problem symptoms. For detailed handling methods,
see X Solutions for Basic LTE Network Planning and Optimization.
For uplink interference, it is recommended that you observe the ChMeas.Pwr.Cell counter and analyze corresponding counters. If the
interference recorded in traffic statistics is greater than –110 dBm, interference exists.
The eNodeB is configured with 2T2R in some networks. In scenarios of metros and shopping malls,
1T1R repeater repeaters are used for coverage holes, which causes 1T1R. As a result, the Rank2 ratios vary sharply by
area in the same cell during DTs.
Whether a UE reports Rank2 information is affected by the SINR. From live network applications, some
Intra-site MOD3 areas may experience intra-site MOD3 interference. Because time synchronization is naturally used for
interference the same site, strong MO3 interference reduces the SINR in the RS domain and a low RANK2 ratio if
optimization on overlapping coverage is not good.
Time When time synchronization is used, the inter-site pilot interference increases, which reduces the SINR and
synchronization affects the Rank2 ratio.
1. Standing wave ratio (SWR) alarm: When the SWR is high, the transmit channel may be closed even
though 2T2R is configured. As a result, only Rank1 can be used.
Antenna assurance
2. Cross feeder connection: causes imbalance between two receive antennas of the UE, which affects the
Rank2 ratio.
This parameter is used for setting the initial MIMO type: 1) When this parameter is set to TM2, the
downlink throughput rate is decreased because only one data flow can be used compared with the
InitialMimoType
setting of ADAPTIVE. 2) When this parameter is set to ADAPTIVE, the downlink throughput rate is
increased compared with the settings of TM2.
MIMO mode during data transmission. When this parameter is set to NO_ADAPTIVE, set downlink MIMO
to OL_ADAPTIVE for the UE based on the FixedMimoMode parameter, enabling adaptive open loop
MimoAdaptiveSwitc
transmission. When this parameter is set to CL_ADAPTIVE and the adaptive closed loop transmission is
h
set to OC_ADAPTIVE, the UE can switch over between open loop adaptive MIMO and closed loop MIMO.
The switchover depends on the UE capability.
MIMO mode selection in the fixed MIMO mode. When this parameter is set to TM2, all UEs use the TM2
mode, only one data flow is transferred. In this case, downlink peak rate is half of that with the setting of
FixedMimoMode TM3 or TM4 (two antenna ports configured on the network side) or lower (four antenna ports configured
on the network side, and the downlink throughput rate is greatly lower than that with the setting of TM3
or TM4.
The following figure shows the method of locating problems related RBs and Grant.
Throughput restriction Postpaid cards are recommended because the throughput is restricted by the Online
on the EPC charging system (OCS) when prepaid cards are used.
Transmission Determine whether faults exist on the EPC using
restriction the UDP packet injection.
Other NEs AMBR Check the AMBR rate contained in the S1AP_INITIAL_CONTEXT_SETUP_REQ message
restriction traced over the SI interface.
Message subscription
on the U2000
Whether ANR is
enabled
Abnormal events (call drops, handovers, Whether UEs perform inter- Whether inter-frequency
frequency measurement handovers are triggered
reestablishments, and ping-pong handovers)
Whether event A2 threshold
is properly configured
The receive Probe N/A If analysis results of action 2 show that interference exists on uplink
performance is data/U2000 receive channels, use Probe data to check whether the TX power of the
poor due to uplink tracing UE greatly increases, and check whether the SINR of uplink receive
interference. results signals is low on the U2000. If these two conditions are met, uplink
interference exists and needs to be checked. Otherwise, the
interference is not the root cause for the poor receive
performance.
After channel/interference exceptions are confirmed, follow the instructions in X Solutions for Troubleshooting RF
Channel Faults in the LTE Network to check channels and detect and eliminate the interference.
Handover conditions are met but the handovers are Values of PDCCH Grant Count are centralized at 750 or
not performed in time. 875.
S1-U
SGi
public network
S9312 NE40E servers
UE+PC eNodeB UGW
Larger RTT delay exerts severer impact on TCP rate. The RTT delay decreases at the TX end than that from the TX end.
You are advised to isolate and segment the long delay to check the delay source.
In addition, you can determine whether data transmission between the eNodeB and the EPC is normal. For details, see XXX.
Page177
1 Background
3 Analysis Method
4 Tools Introduction
5 Case
1 Background
3 Analysis Method
4 Tools Introduction
5 Case
To ensure satisfactory LTE throughput, the following actions for effective routine network maintenance and
optimization must be taken:
Action 1: Parameter Check Periodically check network parameters to ensure feature parameter
consistency and that feature parameters are set to the values
recommended for the current version. For details, see X Solutions for
Parameter Check in the LTE Network.
Action 2: Alarm Check Periodically check network alarms and clear those that may affect
output performance. For details about various types of alarms, see
Attachment 3 Alarm Checklist.
1 Background
3 Analysis Method
4 Tools Introduction
5 Case
Customer OSS
Coverage/Modulation
CQI/SINR
TCP/IP Performance
MCS/IBLER/MIMO Bandwidth/
Grant/RB
Latency/Packet
RF
Drop User QoS
Power Configuration MTU
MIMO 1T2R\2T2R\ TCP Parameter
HSS KPI Statistics Error
OL\CL KPI Mapping Error
Router
IP IP IP
Power Network Network Network
MME FireWall
QoS/Capacity Capacity/CPU Load
AMBR
Feature/Parameter
QCI 6/8/9
Cat
License/Alarm PS Core Network TCP Win
Abnormal Operation Speed Limit(CAR)
Software defect Feature(TCP Proxy/DPI)
Hardware defect Charge Policy
Cache (F5)
Every phase of the E2E process affects PS transmission performance. It is necessary to understand
the E2E key factors and possible causes affecting LTE PS performance.
If the RS power decreases by 3 dB, the value of PA is changed from –3 to 0, but the RS SINR
PA and PB does not decrease by 3 dB, the UE measurement increases by 3 dB according to the value of
PA. As a result, the CQI value is greater than its actual value.
Wide-band CQI and sub- When the best sub-band CQI is reported, the measured CQI value is greater than its actual
UE's CQI
band CQI value.
reporting
Aperiodic CQI reporting is more reliable, and the PUCCH may incorrectly detect messages in
the event of periodic CQI reporting. In normal cases, aperiodic CQI reporting is superior to
Periodic and aperiodic
periodic CQI reporting. The CQI value becomes higher if the number of aperiodic CQI reports is
greater than that of periodic CQI reports.
A larger value of eNodeB detection indicates more occupied PUSCH indicators, less incorrect
Delta offset CQI index
CQI detection, and higher CQI reliability.
When false SR detection occurs, the eNodeB receives a random CQI value, which is small in
False SR detection
eNodeB most cases.
detection When incorrect CQI detection occurs, a random CQI value, which is small in most cases, is
Incorrect CQI detection
reported.
The actions provided in X solutions are formulated and methodically organized based on cases and issue characteristics identification by
following the principle for minimal costs.
Check whether the KPI calculation formulas Analyze the differences in KPI calculation formulas
Action 1: Check KPI
used on the live network are the same as between Huawei and its competitors if there are any.
definitions.
those on the FMA.
Check for associated KPIs in which differences are
Focus on deteriorating KPIs. detected between Huawei and its competitors.
Action 2: Identify the Perform DTs to demarcate and locate issues (for
issue scope and details, see action 13).
Check for historical operations at the time
demarcate issues. of KPI deterioration.
Check for new faults and alarms.
Focus on the differences in air interface parameters
Check for known issues in the current between Huawei and its competitors.
Actions 3 to 6: Check version.
Check whether parameters are set to their optimal
basic elements. Check the impact of parameter values to ensure transmission performance.
reconfiguration on traffic statistics.
Check consistency between network parameters.
Actions 3 to 6 are included in the technical guide, and therefore are not detailed in this document.
Instead of being performed only in the event of issues, actions 3 to 6 can be used as routine network maintenance
measures.
Prerequisi
Step Action Purpose
te
Check whether the KPIs formulas used by Huawei are correct or the same as those used
Action 1: Check KPI by Huawei competitors.
definitions. None The methods of calculating traffic KPIs are incorrect.
Step 1: Check whether the number of cells in traffic statistics and the collection period differs
Identify and before and after the deterioration.
demarcate 1. 3 Characters: "geographical"(TOP Cell)+"time"+"Frequency Band"(Dual Band)
issues. Characters.
Action 2: Identify the
2. Execute associated traffic statistics analysis according to issue scenarios(deterioration
issue scope and None
or comparison testing).
demarcate issues.
3. Associated traffic statistics analysis: Coverage/Modulation and Coding, Resource and
Capacity, insufficient data source, RF Channel and Interference.
Action 3: Check
external events 1. Check the impact of abnormal eNodeB-side operations.
and historical 2. Check the impact of modifying the following factors on the service throughput: RF
antennas, transmission/TCP, EPC/datacom equipment, and user QoS.
operations Deterioratio
3. Determine the impact of external events (press conferences for new mobile phones
(including n scenarios
and great gatherings).
eNodeB-side 4. Check whether certain neighboring sites or cells are deactivated, sites are
operations and disconnected, or access is prohibited.
other operations).
Step 2:
Check basic Action 4: Check for None Check for alarms affecting the traffic services
elements. faults and alarms.
Action 5: Check for If deterioration occurs after an upgrade, check changes in the current Release Notes and
None
known issues. resolved issue in next version release Notes.
1. Parameters include eNodeB parameters, the AMBR, TCP, TCP window length, and
Internet exit bandwidth.
Action 6: Check 2. In deterioration scenarios, evaluate the impact of parameter changes.
None
parameters. 3. In comparison testing, check differences in parameter settings between Huawei and its
competitors and check whether parameter settings meet the requirement of achieving
the optimum throughput.
Action 1 is performed on any condition. Checking KPI definitions in the first place helps avoid a
waste of time on analyzing invalid data. Focus on the following information during the KPI definition
check (for details, see Attachment 1 technical guide):
Use the FMA to perform a secondary check on all traffic statistics provided by telecom operators.
Ensure that the KPI mapping is correct before a KPI comparison between Huawei and its competitors. For details about the
KPI mapping, see the network swapping SOP 3.0, which can be downloaded from link.
Verify that KPI calculation formulas are correct. When a KPI has multiple formulas, the values calculated by different
formulas vary. For example, the formula stipulated in 3GPP specifications is recommended for UE throughput.
Check the method of collecting traffic statistics: average value of each cell, or weighted average value of the entire
network.
Check consistency between traffic statistics, including whether the number of sites and that of cells remain unchanged
before and after the deterioration, and whether the collection period is the same before and after the deterioration.
[(L.Thrp.bits.DL/(Report_Period x
Spectrum Efficiency
60)]/1000000/[L.ChMeas.PRB.DL.Used.Avg/(Syst NA
DL_formula1 Reflects the cell throughput/Hz
em bandwidth x 5)]/System bandwidth②
Furthermore 100%RB throughput can also reflect the Max
L.Thrp.bits.UL/(Report_Period x cell throughput when 100% RB Usage
Spectrum Efficiency
60)/1000000/[L.ChMeas.PRB.UL.Used.Avg/(Syste NA
UL_formula1
m bandwidth x 5)]/System bandwidth}
① The cell throughput is calculated by excluding SRB volumes because cell-level statistical period measures only the DRB period.
② System bandwidth x 5: For example FDD 10Mhz, System bandwidth x 5=10*5=50 RBs
Thp(excluding LastTTI)=(4K+4K)/2ms=4000Kbps
Thp(including LastTTI)=(4K+4K+2K)/3ms=3333Kbps
This action is performed after you have verified that the KPI calculation formulas and data are valid. This action enables you to demarcate an
issue to identify the cause, helping determine the subsequent action and avoid unnecessary analysis.
First, obtain basic information about each issue:
Issue occurring on the entire ›If KPIs deteriorate in TopN cells, do not check the NEs (including the EPC, FTP server, some transmission devices, and tested
network or in TopN cells UEs) shared between TopN cells and non-TopN cells. Focus on differences, such as those in eNodeB configurations,
transmission, channel quality, and interference.
Issue occurring in MOCN ›Air interface parameters on the eNodeB are set without distinguishing between PLMNs. If one operator's network is faulty but
networking another operator's network is normal, the issue does not occur in the air interface pipe for the eNodeB. Focus on differences,
such as changes in the configurations of the EPC and transmission equipment, between operators.
Issue occurring on a ›Obtain the changes in the UE throughput, UE quantity, and parameter settings on each frequency band, the coverage
multi-band network capability of each frequency band, and the impact of differences in UE's frequency band capability.
Perform associated KPI analysis after The FMA can automatically complete this action,
the preceding information is obtained. including:
Proportion of traffic Number of users Average CQI value PUSCH Handover IPPM delay and
or duration of the Traffic volume and CQI distribution interference success rate packet loss
last TTI CCE usage Average MCS index PUCCH Number of inter- Maximum FE/GE
Length distribution PRB usage value and CQI index interference frequency bandwidth
of incoming PDCP PUCCH RB distribution handovers
packets IBLER/RBLER
PDCCH DTX
Rank 2 proportion
Actions
This action aims to identify the possible causes of traffic model changes based on the following factors:
Scenario Element Evaluation Criteria Involved KPI Factors
① [Multi-band networking] UEs migrate among frequency
[S1 tracing] The proportions of AMBRs bands.
change significantly, especially low AMBRs
AMBR -- ② Networks are put into commercial use, users choose other
Deterioration (for example, 64 kbit/s) whose proportion
operators, or large gatherings are held.
increases.
③ UEs' QoS (QCI) changes.
① Functions such as CAR and DPI are enabled on the CN.
Proportion of traffic The proportion of traffic within the last TTI
DL LastTTI data volume Ratio ② Transmission performance decreases.
within the last TTI increases by over 5%.
③ The AMBR decreases.
Length of an
The average length of incoming PDCP
incoming PDCP Avg PDCP DL Packet Size Modifications are made to MTU transmission nodes.
packets decreases by 100 bytes.
packet
[S1 tracing] The proportion of UEs with low
Comparison testing
Channel problems affect transmission performance in the following ways: 1) The MIMO dual-codeword proportion is low. 2)
Interference mainly affects the MCS. 1) The MIMO dual-codeword proportion is low. 2) Interference mainly affects the MCS.
The preceding impact decreases spectral efficiency and the throughput in traffic statistics.
Elemen
Evaluation Criteria Involved KPI Factors
t
Internal/External interference
L.UL.Interference.Avg
[Traffic statistics] The cell interference Reconstruct RF antennas ①
Uplink Interference
The impact of handover issues on transmission performance is as follows: 1) Throughput decreases if handovers cannot be
completed in time or ping-pong handovers are triggered. 2) Improper inter-frequency measurement decreases throughput.
3) The performance of X2-based handovers is superior to that of S1-based handovers. 4) In multi-band networking,
throughput decreases if UE migration is caused by handover parameter reconfiguration.
Reconfiguration of handover
Handovers of The number of inter-frequency handovers changes by over Inter-Freq HO parameters (for example,
multiband UEs 10%. Success Rate thresholds, handover priorities,
and MLB)
For details about the methods, procedures, and tools related to DT data analysis, see X Solutions for PS Data Transmission
Optimization in the LTE Network (Drive Tests) (Click).
1 Background
3 Analysis Method
4 Tools Introduction
5 Case
Action 1: Check KPI Action 2: Identify the Action 3: Check Action 4: Check Action 5: Check Action 6: Check
definitions. issue scope and external events and faults and alarms. known version parameters.
demarcate issues. historical operations. issues.
KPI definitions on Traffic statistics eNodeB operation logs, alarm logs, fault logs, or BRD Release notes eNodeB configuration
Huawei OSS collected by the U2000 logs file
Source data KPI definitions on Traffic statistics Probe log
Huawei competitors' collected by the PRS QXDM log
OSS
Differences in KPI OMSTAR/FMA/
definitions between QCAT/Probe
Huawei and its
competitors
1. KPI calculation 1. Issue scope and Fault occurrence time, abnormal operations, abnormal Version changes 1.Differences in air-
formula check results TopN sites alarms, and fault information interface parameters
2. KPI value check 2. Associated KPI 2.Check results of
Output
Action 7: Analyze Action 8: Analyze Action 9: Analyze Action 10: Analyze Action 11: Analyze Action 12: Analyze Action 13: Perform
the traffic model. resources and coverage and the channels and handovers. transmission and DTs.
capacity. coding over the air interference. TCP.
interface.
Source data
tracing data of collected by the collected by the packet capturing logs for DTs
TopN sites Traffic statistics U2000 U2000 TMF files of IFTS
collected by the Traffic statistics Traffic statistics tracing data
U2000 collected by the collected by the
Traffic statistics PRS PRS
collected by the PRS RSSI or
interference
tracing data
capacity issues capacity issues 2. Causes of 2. Interference 2. Trend of the transmission 2. DT-based
2. PRB usage MCS index, within the number of 3. Packet disorder performance
and CCE IBLER, and system handovers during transmission analysis results
usage DTX changes 3. Interference 4. Transmission 3. Transmission
from eternal bandwidth limitation bandwidth
systems limitation
analysis results
Obtain the Assistant Extend and related operation guide from Link.
[FMA Tools] A new function is introduced to analyze the proportions of KPIs based on PDF traffic statistics.
The efficiency of analyzing the proportions of KPIs, such as the CQI and MCS index, improves significantly.
[FMA Tools] Tools > Parameter check > PS Performance Parameter check
[FMA Tools] CellDT/IFTS TTI analysis tool (for Huawei R&D and
GTAC engineers only)
1 Background
3 Analysis Method
4 Tools Introduction
5 Case
Problem Analysis:
Step 2 Check basic factors. Check for transmission alarms based on traffic KPIs.
The check result shows that the SCTP link clear alarm is generated
Action 4: Check parameters. None before this problem occurs. According to frontline personnel, the customer OK
reconstructed cell transmission.
Action 5: Checking for Known Issues None No Version defect is detected. OK
Compare configurations of normal and abnormal sites. No abnormal
Action 6: Check parameters. None parameter settings are found. Service parameters are modified before OK
and after deterioration.
[Problem Analysis]
Step 1 Identify and demarcate issues.
Prerequisit
Action Result Status
e
1) The mapping of KPI calculation formulas between Huawei and city A is correct.
Action 1: Check
None -----OK OK
KPI definitions.
2) Raw traffic statistics about Huawei network and city A are correct. -----OK
1) According to DTs, the average SINR is 11 and TopN cells with poor CQI values are
identified on Huawei network. The average SINR reaches 16 E network.
2) TopN cells: Check the CQI, rank 2 proportion, transmission performance, and
whether TopN cells can be identified.
Action 2:
3) Resources and capacity: For Huawei network, a large number of UEs are served
Identify the
by each site during peak hours, and the percentage of sites serving over 50 UEs
issue scope and None Abnormal
reaches 30%. For E network, the percentage of sites serving over 40 UEs reaches
demarcate
only 10%, and almost no site serves 90 or more UEs.
issues.
4) Transmission: According to traffic statistics and DTs, Huawei network encounters
serious transmission problems, which decrease transmission quality. In city A,
transmission quality is better, and the number of scheduling attempts in each cell
is greater than 900.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential 211
[Comparison Testing Case] Throughput on Huawei Network at Site
T Lower Than That in City A (2)
Resource and capacity
comparison On Huawei network, the throughput increases rapidly to 18
The number of UEs on Huawei network is
Mbit/s after the sites serving over 50 UEs are excluded.
twice that in city A.
The proportion of sites serving over 50 UEs on
Huawei network is 20% higher than that in city A.
Standard Standard
City Average Maximum Minimum Average Maximum Minimum Transmission
Deviation Deviation
comparison
City A 16.59 30 -5 6.68 -88.76 -53.75 -113.69 11.79
Throughput is limited
HW\\ M 11.59 29.95 -15.5 7.38 -85.65 -55.6 -116.66 9.58 possibly due to insufficient
transmission bandwidth
provision.
The average SINR in Huawei Morelia
area is 5 dB lower than that in city A.
Action 11: Analyze Performed based on traffic The spectral efficiency of Huawei is better than that of the
OK
coverage and coding. statistics analysis results
competitor.
Page215
eNodeB
The LTE network has certain requirements for the transmission delay to ensure sound network
performance. For example, HTTP DL/UL throughput, handover, and CSFB are all delay-sensitive.
On the other hand, many operators consider ping delay as an important indicator for evaluating
the end-to-end user experience of networks. The existence of the air interface also contributes to
the ping delay to a certain extent.
Measurement
method or tool
problems
Transmission 25%
problems above
the RAN
53%
Transmission problems above the RAN:
① Poor access network quality (microwave)
Parameter set-
② Improper server routing or transmission
Measurement tool and method ting problems (multiple hops)
22%
confirmation: ③ Long server delay (such as a server
① Measurement tool connection deployed outside China)
mode and tool model ④ Improper server configurations or poor
② Ping packet size, packet server performance (for example, packet
transmission interval, and loss and large jitter)
destination IP address
Tools & Methods Action 1: Confirm the measurement tool and method.
Appendixes
Using an incorrect method or tool will result in poor ping test performance or inaccurate test results. Therefore, the measurement tool and
method must be confirmed according to the following rules:
Method
Tool
UE Portable PC Software
Destination Number of Ping packet
Portable PC server/IP ping tests size
Test form: test in the
Types: data cards and performance: CPU
smart devices resources and memory
CMD command prompt address
window delivered with Different ping packet
Connection mode: Impacts of background Number of ping
the system, test using Different ping sizes: causing
local test, access programs: occupation of tests: required to be
the software embedded servers: Different differential air-
through the USB, or CPU resources and greater than or
with the ping function, or destination IP interface
access through the network bandwidth equal to 50 to
test using HRPing addresses result in performance and
radio access point resources prevent poor results
Different ping options, different IP routes. fragmentation
(AP) Statistics are inaccurate due to air interface
such as the interval and problems. As a
Different connection and the packet and transmission
protocol packet header result, final results
modes cause different transmission capability is fluctuation.
size, can be configured are affected.
transmission delays. insufficient when ping is
for different ping tools.
used every 5 ms.
* Some software, such as Speedtest, does not calculate the delay in the same way as the ICMP-based ping utility does. Therefore, the troubleshooting using such software
is different from that using the ping utility.
Operation log check: Check whether Alarm check of problematic cells: Check
current sites are subject to related whether related alarms exist. Analyze whether the problem symptom is the
operations. same as the symptom of precaution problems
or that of resolved problems in later versions.
Alarm check of neighboring cells: Check
Operation check of other NEs: Check
whether changes of the number of UEs
whether other NEs are subject to
are caused by alarms in neighboring
configuration changes, such as upgrade
cells. For problems in upgrade scenarios, check the
and cutover.
release notes to find suspicious version
changes.
Impact check of large events: Check Alarm check of other NEs: Check alarms
whether heavy-traffic scenarios, such as existing on related NEs.
gathering, exist on site.
Determine the problem scope and find characteristics of the problem to facilitate troubleshooting.
PreAllocationSwitch/SmartPreAllocationSwitch Whether to enable preallocation determines whether to trigger UL Grant through the SR. The SR period waiting time can decrease in the event of preallocation.
PreallocationSize/PreallocationSizeForSparse The number of scheduling times can decrease if preallocation is mapped with the ping packet size.
PreallocationMinPeriod/PreAllocMinPeriodForSparse The preallocation period determines the scheduling waiting duration of uplink packets.
SmartPreallocationDuration/SmartPreAllocDuraForSparse Preallocation gains can be obtained every time ping is performed if the preallocation duration is mapped with the ping task period.
If DRX is enabled, the scheduling waiting time needs to be woken up when the UE sends uplink data. Data cannot be transmitted until OD if DRX is in a silent
DRXPARAGROUP
period in the downlink.
LongDrxCycle This parameter determines the duration of waiting OD of downlink packets when the downlink packets are transmitted in the sleep time of DRX.
The ping process is an E2E process. Therefore, besides parameters of the eNodeB, other parameters, such as those of transmission equipment, CN equipment, and servers, also need
to be checked. Because the equipment is not always provided by Huawei, cooperation and confirmation with the operator are required.
Perform this action when the problem needs to be located based on inference and analysis. First, locate the problem by segment and determine whether the
delay problem is caused by delay of the air interface below the S1 interface or the interfaces above the S1 interface.
Procedure:
① Use the ping function to obtain the delay of the entire loopback.
② Calculate the delay from the eNodeB to the server by IFTS 149 tracing (tracing data can be directly obtained from the network side). For the detailed method of
collecting IFTS 149 tracing data, see Attachment 04 Analysis Guide for LTE 149 Trace Log. In addition, you can also run ping directly to test the network
communication from the eNodeB to the UGW.
③ Obtain the delay from the UE to the eNodeB by deducting the total delay by the delay from the eNodeB to the server. Obtain the total ping delay and the loopback
delay from the eNodeB's PDCP layer to the server's application layer by IFTS 149 tracing. Then deduct the total ping delay by the loopback delay to obtain the
delay from the UE to the eNodeB. In this way, it can be confirmed whether the problem is related to the air interface or above the air interface.
UE-to-eNodeB Delay Baseline eNodeB-to-Server Delay (Internal Network) eNodeB-to-Server Delay (External Network)
<15 ms < 5 ms Depending on the network environment of the server
④ Compare the delay from the eNodeB to the server and that from the UE to the eNodeB with baseline values to identify exceptions.
Long delay from the UE to the eNodeB Action 5: Analyze the delay from the UE to the eNodeB.
Problem segmentation result
Action 6: Analyze the transmission delay from the eNodeB
Long delay from the eNodeB to the server
to the server.
eNodeB Abstract
Perform action 5 if the segmentation results of action 4 show that there is a large delay from the UE to the eNodeB.
Segment the delay from the UE to the eNodeB, and measure the delay of each segment to locate the segment where the delay problem
occurs.
Above the S1
interface
Air interface
Application layer
of the UE
Ping delay
› Check whether preallocation configurations (including the period, data volume, and duration) meet the requirements of the ping
Analysis of the uplink
model when preallocation is enabled.
packet delay
› Check the current user's SR period (which can be obtained from signaling for RRC connections) when preallocation is disabled.
Analysis of the › Check whether DRX is enabled currently and whether settings of LongDrxCycle affect transmission of downlink packets.
downlink packet delay › Check whether congestion occurs due to heavy traffic based on traffic statistics.
› The radio conditions determine the air interface's encoding capability, which is a key factor in determining whether a ping packet
Coverage and encoding will be fragmented.
efficiency › The network coverage and interference are key factors in determining whether retransmission over the air interface will occur.
The delay caused by transmission above the air interface can be obtained by IFTS 149 tracing. However, the delay of transmission
above the air interface may involve multiple NEs, which also involves operator equipment and equipment provided by other vendors.
Therefore, cooperation is required during the troubleshooting. This action helps locate problems on the transmission side, but the root
cause needs to be located or the problem needs to be resolved under cooperation with the operator or the vendor of the malfunctioning
NE.
Procedure:
Transmission above the air interface consists of the following three segments:
(1) eNodeB to UGW
(2) UGW to Internet gateway
(3) Internet gateway to the server
The three segments are as follows:
(4) Access network
(5) Operator's backbone network
(6) Public Internet
If the server is an operator's internal server, the UGW directly connects to the server.
Segment 1 Segment 2 Segment 3
Mirror
server
S1-U
SGi
Name Description
Attachment 01 Technical Guide to PS Data Transmission (Ping Delay) Provides detailed guidelines for ping delay optimization and
Performance Optimization in the LTE Network V8.1.1 problem location step by step.
Attachment 02 LTE PS Data Transmission (Ping Delay) Problem Provides the template of a standard analysis report for ping
Analysis Report (Template) delay problem location.
Attachment 03 Data Transmission (Ping Delay) Parameter & Alarm Lists key parameters and alarms related to the ping delay on the
Checklist eNodeB.
Attachment 05 Guide to Capture Packets over the S1 Interface During Describes how to capture packets over the S1 interface during
Mirroring on BBU mirroring on the BBU.
Long transmission delays above IP addresses allocated to UEs by the UGW are different,
Ping delays of individual sites are
S1 interfaces at some sites, resulting in different IP routes for destination IP addresses.
1 longer than those of other sites
causing ping delays not to meet As a result, ping delays are long due to improper
during acceptance.
the requirements configurations of some routes.
Slow UE wake-up after DRX is The test ping delay after DRX is The wake-up duration from the DRX in a silent period to
2 enabled, causing a long ping enabled significantly becomes that in activation is long, resulting in a long delay. This
delay long. problem is a TopN UE compatibility problem.