Voice Codecs - GL Communications
Voice Codecs - GL Communications
com
Voice Codecs
GL Communications products support a variety of signaling and audio processing applications in both VoIP and TDM. Using these tools, one can
emulate, analyze, and troubleshoot audio signaling over both VoIP and TDM. Each of these tools support the following narrow-band, and wideband
(HD audio) codec standards:
G.711 App II (PCM µ-law/A-law with VAD) 64 kbps 8000 Yes Multiples of 10 ms
SPEEX (Narrow Band) 8 kbps 8000 Yes Fixed at 20 ms. Multiple Ptime
Not Supported
SPEEX (Wideband) 11.2 kbps 16000 Yes Fixed at 20 ms. Multiple Ptime
Not Supported
G.729 operates at a bit rate of 8 kbps with an encoding frame length of 10 ms and 5 ms look ahead, but there are extensions, commonly
designated as G.729a and G.729b. Annex A and Annex B Voice encoding using CS-ACELP (Conjugate-Structure Algebraic Code Excited Linear
(https://www.linkedin.com/company/glcommunicationsinc)
Prediction) 8 kbps, is the lowest bit rate ITU-T standard with toll quality. Annex A is a low-complexity version of the G.729 standard. Annex B
defines VAD/CNG/DTX (Voice Activity Detection/Comfort Noise Generator/Discontinuous Transmission) for G.729 and G.729A.
GSM-FR
GSM-FR is a Full Rate speech coder standardized by the European Telecommunications Standards Institute (ETSI) for compressing toll quality
speech (8000 samples / second) and was the first digital speech coding standard used in GSM digital mobile phone systems. The coder has a bit
rate of 13 kbps with an encoding frame length of 20 ms.
This coder uses the principle of Regular Pulse Excitation-Long Term Prediction-Linear Predictive coding. The coder works on a frame of 160
speech samples with an encoding frame length of 20 ms, and no look ahead is required.
GSM EFR
GSM-EFR (6.60) is an improved and hence the Extended version of GSM-FR (6.10) codec. With sampling frequency of 8000 samples/sec and
frame size of 31 bytes it achieves the bit rate of 12.2kbps with an encoding fixed frame length of 20 ms. Codec supports Voice Activity Detection
(VAD) to allow saving of bandwidth.
GSM HR
GSM HR 6.20 operates with sampling frequency of 8000 samples/sec. This codec outputs the frames of size 14 Bytes, which puts the bit rate of
encoder at 5.6kbps with an encoding frame length of 20 ms. Codec supports Voice Activity Detection (VAD) to allow saving of bandwidth.
G.726 (ADPCM)
This is an ADPCM (Adaptive Differential Pulse Code Modulation). Originally, a half-rate alternative to ITU-T G.711 and includes both the G.721 and
G.723 standards. G.726 compresses by converting between linear, A-law (used in Europe) or µ-Law (used in the U.S and Japan) PCM and 40, 32,
24 or 16 kbps with an encoding frame length of 10 ms.
G.726 with Voice Activity Detection (ADPCM)
Home (https://www.gl.com/index.html) | About (https://www.gl.com/about.html) info@gl.com
This is an ITU-T Adaptive differential pulse code modulation (ADPCM) voice codec, which transmits at bit rates of 16, 24, 32, and 40 kbps with an
(https://www.youtube.com/user/GLCommunicationsInc) | andCareers (https://www.gl.com/employment.html) (https://www.gl.com/inforequestform.php)
encoding frame length of 10 ms. It supports Voice Activity detection generates SID packets during Silence Period. ADPCM provides the
following functionality: | Downloads (https://www.gl.com/download_overview.php) | +1 (301) 670-4784 (tel:+1 (301) 670-4784)
(https://www.facebook.com/GLCommunicationsInc)
Voice mail recording and playback, which is a requirement for Internet voice mail.
(https://www.twitter.com/GLCommInc)
Voice transport for cellular, wireless, and cable markets.
High voice quality voice transport at 32 kbps.
(https://www.linkedin.com/company/glcommunicationsinc)
AMR operates at eight bit rates in the range of 4.75 to 12.2 kbps with an encoding frame length of 20 ms and was specifically designed to improve
link robustness.
AMR-WB provides improved speech quality because of a wider speech bandwidth that is of 50–7000 Hz compared to narrowband speech coders
which in general are optimized for POTS wireline quality of 300–3400 Hz.
EVRC-B is an enhancement to EVRC. EVRCB codec type compresses each 20 milliseconds of 8000 Hz, 16-bit sampled speech input into output
frames of one of the four rates:(1/8- 16 bits, ¼- 40 bits, ½- 80 bits, and 1- 171 bits with an encoding frame length of 20 ms). By default, 1/8 and 1
are selected as the minimum rate & maximum rate. There is option to select RTP packet format between Header Free Format and Bundled
Format. By default, Bundled Format is set.
Important enhancement in EVRC-B is the use of 1/4 rate frames that were not used in EVRC. This provides lower average data rates (ADRs)
compared to EVRC, for a given voice quality.
EVRC-C adds the feature of encoding wideband signals sampled at 16 kHz with signal bandwidth up to 7 kHz.
Enhanced Voice Services Codec (EVS)
Home (https://www.gl.com/index.html) | About (https://www.gl.com/about.html) info@gl.com
(https://www.youtube.com/user/GLCommunicationsInc)
EVS provides vastly improved voice quality, network capacity and| advanced
Careers (https://www.gl.com/employment.html)
features for voice services over LTE and other (https://www.gl.com/inforequestform.php)
radio access
technologies standardized by 3GPP. It is the first 3GPP conversational codec providing up to 20 kHz audio bandwidth, offering speech quality that(301) 670-4784)
| Downloads (https://www.gl.com/download_overview.php) | +1 (301) 670-4784 (tel:+1
of highest standard.
(https://www.facebook.com/GLCommunicationsInc)
(https://www.twitter.com/GLCommInc)
EVS codec includes a multi-rate audio codec, a source controlled variable bit-rate (SC-VBR) scheme, a VAD, a comfort noise generation (CNG)
system, and an error concealment (EC) mechanism to offset the effects of transmission errors resulting in lost packets. Its channel-aware mode
feature further improves frame/packet error resilience.
(https://www.linkedin.com/company/glcommunicationsinc)
OPUS
The Opus codec scales from 6 kbit/s narrowband mono speech to 510 kbit/s fullband stereo music. Supports both constant bitrate (CBR) and
variable bitrate (VBR). Provides audio bandwidth such as Narrow Band (8 kHz), Middle Band (12 kHz), Wideband (16 kHz), Super Wideband (24
kHz), and Full Band (48 kHz).
SMV
The Selectable Mode Vocoder (SMV) [2] compresses each 20 milliseconds of 8000 Hz, 16-bit sampled speech input into output frames of one of
the four different sizes: Rate 1 (171 bits), Rate 1/2 (80 bits), Rate 1/4 (40 bits), or Rate 1/8 (16 bits) with an encoding frame length of 20 ms. SMV
is the preferred speech codec standard for CDMA2000, and will be deployed in third generation handsets.
SPEEX NB and WB
SPEEX NB is based on CELP Narrowband (8 kHz with an encoding fixed frame length of 20 ms), open source codec specifically used for VoIP and
file-based applications
SPEEX WB Codec has a sampling rate of 16000 samples/sec with an encoding fixed frame length of 20 ms, which makes it a wide band codec.
This codec supports different codec options such as Sampling Rate, Variable Bit Rate, Voice Activity Detection and Perceptional Enhancement.
iLBC Codec
iLBC (internet Low Bitrate Codec) is a narrow band speech codec that operates at either 13.33 kbps with an encoding frame length of 30 ms or
15.20 kbps with an encoding length of 20 ms. Companies that are using iLBC in their commercial products include:
Applications/Soft phones: Skype, Nortel, Webex, Hotsip, Marratech, Gatelinx, K-Phone, XTen;
(https://www.linkedin.com/company/glcommunicationsinc)
Codec Bit Rate (Kbps) Number of bits per second which needs to be transmitted to deliver a voice call. (codec bit rate = codec sample
size / codec sample interval).
We can calculate bit-rate as follows: For G.711 – 64 kbps = (160 bytes * 8 bits) * (1/20 ms)
For G.729 – 8 kbps = (20 bytes * 8 bits) * (1/20 ms)
Voice Payload in Bytes The voice payload size represents the number of bytes (or bits) that are filled into a packet.
PPS PPS represents the number of packets that need to be transmitted every second in order to deliver the codec bit
rate. To retrieve the PPS you can just do 1/(voice payload in ms). For Example, 50 PPS=(1/20 ms), 33 PPS = (1/33
ms)
R-Factor Quality score based on various end point and network parameters. Includes codecs, packet loss, and delay.
Conversational The voice quality metric that measures voice quality based on transmission delay, burst packet loss, and burst loss
R-Factor recency.
Listening R-Factor The voice quality metric based only on burst packet loss and codec selection.
MOS-LQ Mean Opinion Score based on listening quality. Does not consider recency or delay. ITU-T P.862 Listening Quality
implementations.
MOS-CQ Mean Opinion Score based on conversational quality. Includes recency and delay effects.
MOS-Nom Nominal quality or maximum score for the codec selected. Similar to the G.107 E-model defaults
Recency A time factor used to weight scores based on the time from a burst packet loss to the end of the call or next packet
loss event.
Home (https://www.gl.com/index.html) | About (https://www.gl.com/about.html) info@gl.com
(https://www.twitter.com/GLCommInc)
The mean opinion score (MOS) is a commonly used method to determine the quality of speech. Every codec has a certain speech quality
characteristics. With MOS, the quality of a speech is rated on a scale of 1 (bad) to 5 (excellent).
(https://www.linkedin.com/company/glcommunicationsinc)
Recently. the speech quality estimates are based on the ITU G.107 E Model. These models considered the entire Ear-Mouth path and all relevant
conditions such as end-to-end level, echo, side tone, and frequency characteristics of the various path segments.
The E Model uses a computational method that includes factors such as noise, signal level, loudness ratings, impairments, delay, codec type, and
even network type to derive a quality score. This transmission quality rating is called as the ‘R’ factor. Over time and based on experience with
subjective and objective measurements, the E Model's R-Factor score was mapped to an equivalent Mean Opinion Score (Excellent to Bad) to
predict the quality of the “mouth to ear” (M2E) speech path. Scoring includes consideration for the type of subjective test used for scoring.
Passive/listening or active/conversational tests produce slightly different scores.
For IP networks, the score assumes ideal conditions outside the IP cloud and bases the scores on the relevant IP impairments such as packet loss,
latency, jitter, and even when these impairments occur over the duration of the call.
GSM-FR 3.57 73
GSM HR 3.53 72
GSM EFR Home (https://www.gl.com/index.html)
4.16 | About (https://www.gl.com/about.html)
91 info@gl.com
(https://www.twitter.com/GLCommInc)
G.726-24k 3.35 68
G.726-16k 2.82 57
(https://www.linkedin.com/company/glcommunicationsinc)
G.726-40k with VAD 4.16 91
(https://www.twitter.com/GLCommInc)
AMR WB Mode 6 (19.85) 4.18 107
AMR WB Mode 7 (23.05) 4.18 107
(https://www.linkedin.com/company/glcommunicationsinc)
AMR WB Mode 8 (23.85) 4.18 107
EVRC 3.94 83
(requires additional license)
EVRCB 3.98 84
(requires additional license)
SMV 3.88 81
Speex NB 4.16 91
ISDN Triggered Call Capture & Analysis Application (call-capture-and- RTP ToolBox™
analysis-over-t1e1.html) (over TDM)
(https://www.twitter.com/GLCommInc)
Voice Band Analyzer (voice-band-analysis.html) (VBA) MAPS™ SIGTRAN (maps-sigtran.html), MAPS™ ISDN SIGTRAN (maps-
isdn-sigtran-emulator.html)
(https://www.linkedin.com/company/glcommunicationsinc)
PacketScan™ (packetscan-all-ip-packet-analyzer.html) (over IP) MAPS™ BICC IP (bicc-protocol-emulation-using-maps.html), MAPS™
CAMEL IP (camel-application-part-cap-emulator-over-tdm-ip-using-
maps.html)
VQT Analysis for NB, WB, and SWB speech (voice-quality-testing-pesq- VQuad™ SIP (ip-softphone-testing.html)
polqa.html)
Buyer's Guide
Item Item Description
No.
Related Software
PKV100 PacketScan™ (packetscan-all-ip-packet-analyzer.html) (Online and |Offline)
Home (https://www.gl.com/index.html) About (https://www.gl.com/about.html) info@gl.com
LANGUAGE
English
(https://www.gl.com/index.html)
中文
(https://www.gl.com/cn/index.html)
COPYRIGHT © 2024 GL COMMUNICATIONS INC - ALL RIGHTS RESERVED. RSS FEED (HTTPS://WWW.GL.COM/RSS/GLFEEDS.XML)
PRIVACY POLICY (HTTPS://WWW.GL.COM/GL-PRIVACY-POLICY.HTML)
CONTACT (HTTPS://WWW.GL.COM/INFOREQUESTFORM.PHP?LISTNAME=IRF-
GBL)