An_Analysis_of_VoIP_Communication_and_Overview_of_
An_Analysis_of_VoIP_Communication_and_Overview_of_
net/publication/272912480
CITATION READS
1 421
3 authors, including:
8 PUBLICATIONS 4 CITATIONS
Ahsanullah University of Science & Tech
105 PUBLICATIONS 1,675 CITATIONS
SEE PROFILE
SEE PROFILE
All content following this page was uploaded by Omar Farrok on 24 June 2017.
compressed and then encoded into digital voice streams by
Abstract—In this paper the latest development G.729 which the codec [4]. The VoIP protocol stack is illustrated in Fig. 1.
is an audio data compression algorithm has been analyzed and Voice packets are transmitted over the IP network, and the
implemented to save a noticeable bandwidth. Different effects reverse processes of decoding and depacketizing are
of AnnexB=yes and AnnexB=no has been observed in the G.729
codec which has the standard bandwidth of 8 kbps. As most of
accomplished at the receiver. A playout buffer is used by the
the internet subscribers of Asian countries use very low internet receiver to smoothen the speech by getting rid of delay jitter.
bandwidth the goal of this analysis is to propose to change some Packets arriving later than the playout time will simply be
parameters of this system so that the quality of voice may be discarded. Some other components such as voice/silence
kept in a tolerable limit using only 5kbps to 6kbps where this detector, loss/error concealment and echo canceller, are also
codec uses at least 6.4kbps. In a real life environment it is tested included in the system to enhance the functionality and
practically that it is possible to transmit voice satisfactorily
using less than 6.4 kbps by changing some parameter described
performance of VoIP systems. The major metric to evaluate
in this paper. the user-perceived voice quality is the Mean Opinion Score
(MOS).
Index Terms—G.729 codec, AnnexB=yes, AnnexB=no, RTP
(real-time transport protocol), VAD (voice activity detector). Application Layer Voice
Transport Layer RTP RTCP SIP H.323
UDP
I. INTRODUCTION Network Layer IP
Link Layer NIC
The G.729 codec perform voice compression at bit rates Physical Layer Ethernet
that vary between 6.4 and 12.4kbps [1]. It is an audio data
Fig. 1. VoIP protocol stack.
compression algorithm for voice that compresses digital
voice in packets of 10 milliseconds duration. It is officially
Voice codecs are standardized by the International
described as Coding of speech at 8 kbit/s using
Telecommunication Union-Telecommunication (ITU-T),
conjugate-structure algebraic code-excited linear prediction
such as G.729 with 8 kbps, G.723.1a with 5.3/6.3 kbps, etc.
(CS-ACELP). Because of its low bandwidth requirements,
The output voice stream then enters the packetizer to
G.729 is mostly used in Voice over Internet Protocol (VoIP)
generate constant bit rate (CBR) audio packets with RTP
applications where bandwidth must be conserved. Standard
(RTCP)/UDP/IP header where RTP and RTCP [5] are
G.729 operates at a bit rate of 8 kbit/s, but there are
Real-time Transport Protocol and Real-Time Control
extensions, which provide rates of 6.4 kbit/s (Annex D, F, H,
Protocol, respectively, which are designed to support
I, C+) and 11.8 kbit/s (Annex E, G, H, I, C+) for worse and
real-time multimedia applications with stringent delay
better speech quality, respectively.
constraint over unreliable User Datagram Protocol (UDP).
To achieve good quality low-bit-rate silence compression,
Besides these, call set-up signaling protocols, such as Session
a robust frame-based voice activity detector module is
Initiation Protocol (SIP), are used for establishing VoIP
essential to detect inactive voice frames, also called silence or
connections. SIP, defined in RFC 2543 of Internet
background noise frames. The achievement of bit-rate
engineering task force (IETF), is a signaling protocol for
savings for coded speech at average rates as low as 4 kb/s
Internet applications, e.g. conferencing, telephony, events
during normal speech conversation while maintaining
notification, and instant messaging.
reproduction quality [2]. G.729 Codec is closely relation to
RTP [3].The packet-level VoIP system performance has been
focused. The analogue voice signals are digitized,
II. DESCRIPTION OF THE SYSTEM
G.729 has been extended in Annex B (G.729b) which
provides a silence compression method that enables a voice
Manuscript received November 8, 2012; revised December 18, 2012. This
work was arranged and supported by Zamir Telecom Limited, Authors would activity detection (VAD) module. It is used to detect voice
like to express their deepest sense for the technical and financial support. activity in the signal. It also includes a discontinuous
N. A. Shafi is with the Zamir Telecom Limited, 4-6 Davenant Street, Unit transmission (DTX) module which decides on updating the
# A, London E1 5NB, England (e-mail: nahid_apee@yahoo.com).
O. Farrok is with the Department of Electrical and Electronic Engineering, background noise parameters for non speech (noisy frames).
Ahsanullah University of Science and Technology (AUST), Dhaka-1208, It uses 2-byte Silence Insertion Descriptor (SID) frames
Bangladesh (e-mail: omarruet@gmail.com). transmitted to initiate comfort noise generation (CNG). If
M. M. Ali is with the Department of Electrical and Electronic Engineering,
Rajshahi University of Engineering and Technology (RUET), Rajshahi-6204, transmission is stopped, and the link goes quiet because of no
Bangladesh (e-mail: mmali.ruet@gmail.com). speech, the receiving side might assume that the link has been
SPEECH
SPEECH
SPEECH
SP
SP
SPEECH
SPEECH
SPEECH
SPEECH
SPEECH
575
International Journal of Computer Theory and Engineering, Vol. 5, No. 3, June 2013
Fig. 7. 2nd portion of call trace report for receiver end of VOS switch as
AnnexB= no. Fig. 9. 2nd portion of call trace report for transmitter end of VOS switch as
AnnexB= no.
IV. CONCLUSION
G.729 has been extended with various features, commonly
designated as G.729a and G.729b. Dual-tone
multi-frequency signaling (DTMF), fax transmissions, and
high-quality audio cannot be transported reliably with this
codec. DTMF requires the use of the RTP Payload for DTMF
Digits, telephony tones, and telephony signals as specified in
RFC 2833. Research for this article was done by means of a
literature study and practical work knowledge in Bangladesh.
The literature review aims to provide business management
with a review of the development of VoIP in Asian countries.
The literature review included a study of published and
internet articles, books and literature on the management of
information systems [13]. The research department of VOIP
update is G.729 codec AnnexB=no if the audio packet size is
less than 6.4kbps then a good voice quality is achieved in a
low band width of 5kbps to 6kbps only that saves average
bandwidth of approximately 14%.
Fig. 8. 1st portion of call trace report for transmitter end of VOS switch as APPENDIX
AnnexB= no. VOS switch, VPS switch, various dialer, SQL database,
So, noise will be generated which is undesired [12]. When firewall system, STM device and CISCO router & switch,
we are decreasing RTP packet size for low network area, it is OS-Windows server and Linux server etc.
not possible to achieve good quality for audio codec G.729. If
it is possible to decrease frame size audio codec G.729, then ACKNOWLEDGMENT
we may get it good voice quality. We are still not getting This research work is inspired and supported by Zamir
good voice quality at low network as AnnexB=no. But Telecom Limited, Situated in 4-6 Davenant Street, Unit # A,
AnnexB=no is better than AnnexB=yes. If the audio packet London E1 5NB.
size is less than 6.4kbps as AnnexB=no, that is possible for
good voice quality at low band width area. It is also possible REFERENCES
to decrease RTP packet size. So total bandwidth saving in [1] G.729 VoIP Compression Algorithm. [Online]. Available:
this technique ranges from (6.4-6/6.4)×100% = 6.25% to http://www.adaptivedigital.com/product/vocoders/g729.htm
576
International Journal of Computer Theory and Engineering, Vol. 5, No. 3, June 2013
[2] ITU-T Recommendation, “G.729 Annex B: a silence compression N. A. Shafi was born in Rajshahi, Bangladesh on July
scheme for use with G.729 optimized for V.70,” Digital Simultaneous 23th, 1985. He received the B.Sc. and M.Sc. degrees
Voice and Data Applications, vol. 35, no. 9, pp. 64-73, Sep. 1997. from the Department of Applied Physics and Electronic
[3] Cisco Systems. [Online]. Available: Engineering, Rajshahi University, Rajshahi,
http://www.cisco.com/en/US/tech/tk652/tk698/technologies_tech_ Bangladesh, in 2006 and 2007 respectively. He is
note09186a0080094ae2.shtml currently working toward the System Engineer in the
[4] L. Cai1, Y. Xiao, X. S. Shen, L. Cai, J. W. Mark, and I. J. Commun. frame of the Network Operation Center, ZAMIR
VoIP over WLAN: Voice capacity, admission control, QoS and MAC TELECOM LIMITED at Bangladesh support end on
Sysem. Published online in Wiley Inter Science. 19. pp. 491–508. September 1st, 2009, Dhaka, Bangladesh. He is a
[Online]. Available: http://www.interscience.wiley.com. member of the International Association of Computer Science and Information
[5] H. Schulzrinne, S. Casner, R. Frederick, and V. Jacobson. RTP: A Technology (IACSIT). His current research interests include VOIP, Antenna,
Transport Protocol for Real-Time Applications Available. [Online]. Microcontroller and Spintronics.
Available: http:/ftp://ftp.ietf.org/ rfc/rfc1889.txt.
[6] G.729. [Online]. Available: http://en.wikipedia.org/wiki/G.729
[7] R. V. Cox and P. kroon, “Low bit-rate speech Coders for multimedia
communications,” IEEE Communication Magazine, vol. 34, no. 12, pp. O. Farrok was born in Bangladesh, on 16 July, 1982.
34-41, December 1996. He received Bachelor and Master degree from the
[8] G. Rec and B. Annex, “A silence compression scheme for G.729 University of “Rajshahi University of Engineering &
optimized for terminal conforming,” Tech. Rep., ITUT, Oct 1996. Technology” in the department of Electrical and
[9] F. Beritelli, S. Casale, and G. Ruggeri, “Performance evaluation and Electronic Engineering (EEE), Rajshahi-6204,
comparison of Itu-T/Etsi voice activity detectors,” Viale Andrea Doria Bangaldesh. He has been joined since 2007 and serving
695125 Catania. as an assistant professor in the Department of Electrical
[10] Offer/Answer Considerations for G.723 Annex A and G.729 Annex B. and Electronic Engineering, Ahsanullah University of
[Online]. Available: Science and Technology (AUST). His current research interests include
http://tools.ietf.org/html/draft-muthu-payload-offer-answer-g723-g72 Power electronics, Digital Electronics and Communication Engineering. He
9-0 0 is a member of The Institution of Engineers, Bangladesh.
[11] ITU-T Recommendation G.729 Annex B – “A silence compression
scheme for G.729 optimized for terminals conforming to
Recommendation V.70,” November 1996.
[12] Recognizing and Categorizing Symptoms of Voice Quality Problems. M. M. Ali was born in Bangladesh. He received the Bachalor degree from
[Online]. Available: Engineering College, Rajshahi, Bangladesh. He received Ph.D. degree from
http://www.cisco.com/en/US/tech/tk652/tk698/technologies_white_ Japan. He is currently working as a professor in Rajshahi University of
paper09186a00801545e4.shtml Engineering & Technology in the department of Electrical and Electronic
[13] Voice over internet protocol (VoIP) African Journal of Business Engineering (EEE), Rajshahi-6204, Bangladesh. At present he is serving as
Management. 3. pp. 89-94. [Online]. Available: the pro vice chancellor of this university. He is also a fellow of the Institution
http://www.academicjournals.org/AJBM of Engineers Bangladesh and working as Vice-Chairman in the department
of academic and human resource development.
577