Voipfuture Whitepaper CodecChoice 2020-1
Voipfuture Whitepaper CodecChoice 2020-1
2020
Introduction 3
4
signals for transmission or storage, while the decoder function
recreates the signal for playback or editing.
Hello Hello
Conversion Conversion
Analog/Digital Digital/Analog
Codec Codec
(Encode) (Decode)
Protocol Protocol
Stack Stack
Codecs are often optimized to encode human speech as
opposed to general audio signals. Human speech and human
hearing have well-known characteristics.
Most codecs relevant to landline and mobile VoIP telephony are standardized
by ITU-T and 3GPP. One notable exception is the Opus codec, which
was introduced by the IETF. The table lists some codecs and their key
characteristics.
The table (next page) is by no means complete and much more could be said
about each individual codec. Please refer to the respective standards for more
detailed information.
Codec Bandwidth Net Bitrate Usage
EVS NB, WB, 5.9 - 128 Currently being deployed in 4G mobile networks;
(Enhanced SWB, FB kbit/s mandatory codec for 5G voice services
Voice
Services)
G.711.1 NB,WB 64, 80, 96 Not widely supported, but used as HD codec by
kbit/s some fixed voice operators
G.726 NB 16, 24, 32, Widely supported by fixed network devices, but
40 kbit/s not widely used
Opus NB, WB, 6–510 kbit/s Widely supported by fixed network devices, but
SWB, FB not widely used Multi-mode audio codec defined
by the IETF in RFC 6716; required by WebRTC
implementations and slowly gaining support by
fixed network devices
4/11/2019 Page 1
10
0.010
Quality (R-Factor) per kilobit
AMR WB 12.65k
AMR 12.2k
5
0.005
G.726 32k
G.722
G.711
8
1/1
But also fixed network operators are – at least for on-net calls
– slowly moving to G.722 and G.711.1. Finally, IPX wholesale
services are enabling HD voice continuity in international calls.
(Estimating) Better
User Experience 9
The rise of wideband codecs creates new challenges for monitor-
ing voice quality and user experience. The Mean Opinion Score
(MOS) is considered the key metric for voice quality, yet few are
aware of its many different flavors.
MOS_NB
5.0
MOS_WB 10
4.5
4.0
Mean Opinion Score (MOS)
93
3.5
MOS_NB 4.41
3.0
MOS_WB 3.69
2.5
2.0
1.5
1.0
0 20 40 60 80 100 120 140
R-factor
1/1
bw=nb-swb;
br=16.4;
br=13.2-24.4;
Ranges may initially be specified by the caller. The callee may then accept or
not accept the EVS codec as a whole. If he accepts, he shall only answer with
ranges within the constraints given by the caller, as in this example:
CONTROL BY ENCODER
Finally, the EVS encoder performs bandwidth detection on the input signal
to apply a bandwidth decision logic. This means that the encoder may deci-
de to encode the audio in a lower bandwidth than determined by the input
sampling rate. For example, assume the input sampling frequency is 32 kHz.
If the bandwidth detection logic determines that there is no “energetically
meaningful” spectral content above 8 kHz, then the codec is operated in the
WB mode.
Interestingly, this codec decision does not seem to be constrained by the re-
sult of the SDP negotiation. Voipfuture analysed live VoLTE traffic and found
many examples of calls where SDP negotiations settled on EVS WB as band-
width mode, but the encoder delivered EVS NB encoded frames.
IMPACT ON MONITORING
All of this would be of little importance to VoIP monitoring, if the
used bandwidth and bitrate modes had no impact on the user
experience.
max wideband ⏵
EVS WB 24.4k 129
EVS WB 9.6k
EVS WB 7.2k 99
EVS NB 24.4k 93
EVS NB 16.4k 90
EVS NB 13.2k 87
EVS NB 9.6k 87
EVS NB 8.0k 83
EVS NB 7.2k 82
EVS NB 2.8k 80
⏴ max narrowband 1/1
This must be seen in light of the fact that very different factors
contribute to codec, codec bandwidth and bitrate selection,
which are not all under full control of a service provider. Quality
can change on a packet by packet basis – put to the extreme, user
satisfaction can change every 20ms.
For modern multi-mode HD voice codecs it is therefore not suf-
ficient to determine the codec based on the result of the SDP
negotiation.
15
CONCLUSION
For decades narrowband codecs, such as G.711, have been consi-
dered the gold standard of telephony. The current rise of modern
HD codecs is about to change this, which has a number of impli-
cations. For example, HD codecs change the user expectations,
because users quickly get used to new quality standards. This in
turns requires to speed up the transition to wideband codecs.
Most VoIP monitoring tools on the market are not capable of de-
tecting the codec based on analysis of the RTP payload. Qrystal
is a notable exception and provides service providers worldwide
with accurate information on the user satisfaction.
Voipfuture is a premium voice quality analytics vendor providing
tools for assessing, aggregating, analyzing, and visualizing voice
quality information. Voipfuture products offer a precise view on
media and control plane to communication service providers,
VoLTE carriers, wholesalers and enterprises.